Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora

Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language res...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Karlheinz Mörth, Laurent Romary, Gerhard Budin, Daniel Schopper
Formato: article
Lenguaje:DE
EN
ES
FR
IT
Publicado: OpenEdition 2015
Materias:
Acceso en línea:https://doaj.org/article/417f1cefbdcd4b9da31a2ccf6f97f53b
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:417f1cefbdcd4b9da31a2ccf6f97f53b
record_format dspace
spelling oai:doaj.org-article:417f1cefbdcd4b9da31a2ccf6f97f53b2021-12-02T11:30:23ZModeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora2162-560310.4000/jtei.1356https://doaj.org/article/417f1cefbdcd4b9da31a2ccf6f97f53b2015-12-01T00:00:00Zhttp://journals.openedition.org/jtei/1356https://doaj.org/toc/2162-5603Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.Karlheinz MörthLaurent RomaryGerhard BudinDaniel SchopperOpenEditionarticlelexicographylanguage resourcesdigital corporastatisticsComputer engineering. Computer hardwareTK7885-7895DEENESFRITJournal of the Text Encoding Initiative, Vol 8 (2015)
institution DOAJ
collection DOAJ
language DE
EN
ES
FR
IT
topic lexicography
language resources
digital corpora
statistics
Computer engineering. Computer hardware
TK7885-7895
spellingShingle lexicography
language resources
digital corpora
statistics
Computer engineering. Computer hardware
TK7885-7895
Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
description Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.
format article
author Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
author_facet Karlheinz Mörth
Laurent Romary
Gerhard Budin
Daniel Schopper
author_sort Karlheinz Mörth
title Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_short Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_full Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_fullStr Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_full_unstemmed Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
title_sort modeling frequency data: methodological considerations on the relationship between dictionaries and corpora
publisher OpenEdition
publishDate 2015
url https://doaj.org/article/417f1cefbdcd4b9da31a2ccf6f97f53b
work_keys_str_mv AT karlheinzmorth modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT laurentromary modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT gerhardbudin modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
AT danielschopper modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora
_version_ 1718395873312899072