CLARIN (Common Language Resources and Technology Infrastructure) is a European research infrastructure providing easy and sustainable access to digital language resources and tools (written, spoken, or multimodal form) for especially, but not exclusively scholars in social sciences and humanities.
CLARIN also offers tools for e.g. discovering, exploiting, annotating and analyzing language resources through a networked federation of centres: language data repositories, service centres and knowledge centres, with single sign-on access for all members of the academic community in all participating countries. CLARIN-DK is one of these centres.
Besides providing access to digital language data and tools, CLARIN-dk offers depositing services. It is easy to upload, archive and share your own data.
In addition, CLARIN-DK has a knowledge center DANSK (DANish helpdeSK) comprising The Department of Nordic Studies and Linguistics, University of Copenhagen and The Danish Language Council.
CLARIN is short for Common Language Resources and Technology Infrastructure. CLARIN in Denmark (CLARIN-DK) is part of the European infrastructure CLARIN ERIC (European Research Infrastructure Consortium).
In the Danish repository, you find various collections of language data: a general language corpus, several collections of language for special purposes, parallel corpora, old Danish texts etc. CLARIN offers tools to discover, explore, annotate, analyse and combine data sets. The tools are supported by tutorials and use cases.
As CLARIN is part of a European infrastructure, it also offers access to language data and tools from the other members. In Denmark CLARIN offers workshops, seminars, PhD courses and training courses for scientists.
The CLARIN-DK archive contains language-based data and tools for annotation and research in a variety of disciplines in humanities and social sciences. Part of the data and tools has public access. Other resources can only be accessed and downloaded by researchers via e.g. WAYF.
The archive is evolving over time and currently contains Danish text corpora, separated in general language and professional language. In addition, there are Danish audio, video and photo collections as well as lexica, wordnet and annotations for some of the files.
The archive can host the language-based data you use for research purposes or want to share with fellow colleagues. It is possible to do linguistic annotation, convert text to the XML (TEI P5) format and deposit your own data in CLARIN-DK.