In search of comity: TEI for distant reading
Any expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | DE EN ES FR IT |
Publicado: |
OpenEdition
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/703527f0a9f141958993ac8429837a78 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:703527f0a9f141958993ac8429837a78 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:703527f0a9f141958993ac8429837a782021-12-02T11:31:13ZIn search of comity: TEI for distant reading2162-560310.4000/jtei.3500https://doaj.org/article/703527f0a9f141958993ac8429837a782021-07-01T00:00:00Zhttp://journals.openedition.org/jtei/3500https://doaj.org/toc/2162-5603Any expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in particular on the TEI-conformant schemas developed for one of its principal deliverables: the European Literary Text Collection (ELTeC). The ELTeC will contain comparable corpora for each of at least a dozen European languages, each being a balanced sample of one hundred novels from the period 1840 to 1920, together with metadata concerning their production and reception. We hope that it will become a reliable basis for comparative work in data-driven textual analytics. The focus of the ELTeC encoding scheme is not to represent texts in all their original complexity, nor to duplicate the work of scholarly editors. Instead, we aim to facilitate a richer and better-informed distant reading than a transcription of lexical content alone would permit. At the same time, where the TEI encourages diversity, we enforce consistency by permitting representation of only a specific and quite small set of textual features, both structural and analytical. These constraints are expressed by a master TEI ODD, from which we derive three different schemas by ODD chaining, each associated with appropriate documentation.Lou BurnardChristof SchöchCarolin OdebrechtOpenEditionarticledistant readingELTeCODD chainingcorpus designthe European novelliterary studiesComputer engineering. Computer hardwareTK7885-7895DEENESFRITJournal of the Text Encoding Initiative, Vol 14 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
DE EN ES FR IT |
topic |
distant reading ELTeC ODD chaining corpus design the European novel literary studies Computer engineering. Computer hardware TK7885-7895 |
spellingShingle |
distant reading ELTeC ODD chaining corpus design the European novel literary studies Computer engineering. Computer hardware TK7885-7895 Lou Burnard Christof Schöch Carolin Odebrecht In search of comity: TEI for distant reading |
description |
Any expansion of the TEI beyond its traditional user base involves a recognition that there are many differing answers to the traditional question “What is text, really?” We report on some work carried out in the context of the COST Action Distant Reading for European Literary History (CA16204), in particular on the TEI-conformant schemas developed for one of its principal deliverables: the European Literary Text Collection (ELTeC). The ELTeC will contain comparable corpora for each of at least a dozen European languages, each being a balanced sample of one hundred novels from the period 1840 to 1920, together with metadata concerning their production and reception. We hope that it will become a reliable basis for comparative work in data-driven textual analytics. The focus of the ELTeC encoding scheme is not to represent texts in all their original complexity, nor to duplicate the work of scholarly editors. Instead, we aim to facilitate a richer and better-informed distant reading than a transcription of lexical content alone would permit. At the same time, where the TEI encourages diversity, we enforce consistency by permitting representation of only a specific and quite small set of textual features, both structural and analytical. These constraints are expressed by a master TEI ODD, from which we derive three different schemas by ODD chaining, each associated with appropriate documentation. |
format |
article |
author |
Lou Burnard Christof Schöch Carolin Odebrecht |
author_facet |
Lou Burnard Christof Schöch Carolin Odebrecht |
author_sort |
Lou Burnard |
title |
In search of comity: TEI for distant reading |
title_short |
In search of comity: TEI for distant reading |
title_full |
In search of comity: TEI for distant reading |
title_fullStr |
In search of comity: TEI for distant reading |
title_full_unstemmed |
In search of comity: TEI for distant reading |
title_sort |
in search of comity: tei for distant reading |
publisher |
OpenEdition |
publishDate |
2021 |
url |
https://doaj.org/article/703527f0a9f141958993ac8429837a78 |
work_keys_str_mv |
AT louburnard insearchofcomityteifordistantreading AT christofschoch insearchofcomityteifordistantreading AT carolinodebrecht insearchofcomityteifordistantreading |
_version_ |
1718395884102746112 |