Events

Tutorial at CMC-Corpora 2017: ‘How to use TEI for the annota­tion of CMC and social media resources: a prac­tical introduction’

The goal of the event is to give a prac­tical intro­duc­tion into the annota­tion of lan­guage data from genres of com­puter­-­me­di­ated com­mu­nic­a­tion (CMC) and social media using the formats of the Text Encod­ing Ini­ti­at­ive (TEI).

In an intro­duct­ory sec­tion par­ti­cipants will learn about the gen­eral archi­tec­ture of TEI encod­ing schemas and about rules for the cre­ation of so-c­alled cus­tom­iz­a­tions which allow for extend­ing the use of TEI with tex­tual genres and in domains which are not yet covered by the cur­rent ver­sion of the TEI guidelines.

Examples for TEI cus­tom­iz­a­tions are the rep­res­ent­a­tion schemas for CMC/so­cial media genres developed in the TEI spe­cial interest group “com­puter­-­me­di­ated communication”.

In a hand­s-on ses­sion, par­ti­cipants will learn how to use these cus­tom­iz­a­tions to cre­ate a basic TEI rep­res­ent­a­tion for their own CMC/so­cial media data. For this pur­pose par­ti­cipants may bring samples from their own data/­cor­pora or select a sample from col­lec­tions of Wiki­pe­dia talk pages in sev­eral lan­guages pre­pared by the instruct­ors. Format spe­cific­a­tions for par­ti­cipants’ own data will be announced in advance.

For the hand­s-on ses­sion, par­ti­cipants will be asked to bring a laptop com­puter with WLAN and a full or trial license of the oXy­gen XML editor.

The tutorial is fun­ded as a CLARIN User Involve­ment Event and will be held in asso­ci­ation with the 5th Con­fer­ence on CMC and Social Media Cor­pora for the Human­it­ies (cmccorpora17), held Oct 3rd & 4th @ Eurac Resarch, Italy.

Further information and registration here

 

 

Workshop details

Loading Map....

04/10/2017
2:30 pm - 6:30 pm

See map above

Dighumlab

Secretariat
Digital Humanities Lab Denmark

Aarhus University
Jens Chr. Skous Vej 4
DK-8000 Aarhus C

info@dighumlab.org

Menu