The TEI and Current Standards for Structuring Linguistic Data

The TEI has served for many years as a mature annotation format for corpora of different types, including linguistically annotated data. Although it is based on the consensus of a large community, it does not have the legal status of a standard. During the last decade, efforts have been undertaken t...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autor principal:	Maik Stührenberg
Formato:	article
Lenguaje:	DE EN ES FR IT
Publicado:	OpenEdition 2012
Materias:	feature structures ISO/TC 37/SC 4 Linguistic Annotation Framework (LAF) Morpho-Syntactic Annotation Framework (MAF) standards Syntactic Annotation Framework (SynAF) Computer engineering. Computer hardware TK7885-7895
Acceso en línea:	https://doaj.org/article/66d0f592b3c34c6d8381ef8d8197612e
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:doaj.org-article:66d0f592b3c34c6d8381ef8d8197612e
record_format	dspace
spelling	oai:doaj.org-article:66d0f592b3c34c6d8381ef8d8197612e2021-12-02T11:29:29ZThe TEI and Current Standards for Structuring Linguistic Data2162-560310.4000/jtei.523https://doaj.org/article/66d0f592b3c34c6d8381ef8d8197612e2012-10-01T00:00:00Zhttp://journals.openedition.org/jtei/523https://doaj.org/toc/2162-5603The TEI has served for many years as a mature annotation format for corpora of different types, including linguistically annotated data. Although it is based on the consensus of a large community, it does not have the legal status of a standard. During the last decade, efforts have been undertaken to develop definitive de jure standards for linguistic data that not only act as a normative basis for the exchange of language corpora but also address recent advancements in technology, such as web-based standards, and the use of large and multiply annotated corpora. In this article we will provide an overview of the process of international standardization and discuss some of the international standards currently being developed under the auspices of ISO/TC 37, a technical committee called “Terminology and other Language and Content Resources”. After that the relationship between the TEI Guidelines and these specifications, according to their formal model, notation format, and annotation model, will be discussed. The conclusion of the paper provides recommendations for dealing with language corpora.Maik StührenbergOpenEditionarticlefeature structuresISO/TC 37/SC 4Linguistic Annotation Framework (LAF)Morpho-Syntactic Annotation Framework (MAF)standardsSyntactic Annotation Framework (SynAF)Computer engineering. Computer hardwareTK7885-7895DEENESFRITJournal of the Text Encoding Initiative, Vol 3 (2012)
institution	DOAJ
collection	DOAJ
language	DE EN ES FR IT
topic	feature structures ISO/TC 37/SC 4 Linguistic Annotation Framework (LAF) Morpho-Syntactic Annotation Framework (MAF) standards Syntactic Annotation Framework (SynAF) Computer engineering. Computer hardware TK7885-7895
spellingShingle	feature structures ISO/TC 37/SC 4 Linguistic Annotation Framework (LAF) Morpho-Syntactic Annotation Framework (MAF) standards Syntactic Annotation Framework (SynAF) Computer engineering. Computer hardware TK7885-7895 Maik Stührenberg The TEI and Current Standards for Structuring Linguistic Data
description	The TEI has served for many years as a mature annotation format for corpora of different types, including linguistically annotated data. Although it is based on the consensus of a large community, it does not have the legal status of a standard. During the last decade, efforts have been undertaken to develop definitive de jure standards for linguistic data that not only act as a normative basis for the exchange of language corpora but also address recent advancements in technology, such as web-based standards, and the use of large and multiply annotated corpora. In this article we will provide an overview of the process of international standardization and discuss some of the international standards currently being developed under the auspices of ISO/TC 37, a technical committee called “Terminology and other Language and Content Resources”. After that the relationship between the TEI Guidelines and these specifications, according to their formal model, notation format, and annotation model, will be discussed. The conclusion of the paper provides recommendations for dealing with language corpora.
format	article
author	Maik Stührenberg
author_facet	Maik Stührenberg
author_sort	Maik Stührenberg
title	The TEI and Current Standards for Structuring Linguistic Data
title_short	The TEI and Current Standards for Structuring Linguistic Data
title_full	The TEI and Current Standards for Structuring Linguistic Data
title_fullStr	The TEI and Current Standards for Structuring Linguistic Data
title_full_unstemmed	The TEI and Current Standards for Structuring Linguistic Data
title_sort	tei and current standards for structuring linguistic data
publisher	OpenEdition
publishDate	2012
url	https://doaj.org/article/66d0f592b3c34c6d8381ef8d8197612e
work_keys_str_mv	AT maikstuhrenberg theteiandcurrentstandardsforstructuringlinguisticdata AT maikstuhrenberg teiandcurrentstandardsforstructuringlinguisticdata
_version_	1718395881983574016

The TEI and Current Standards for Structuring Linguistic Data

Ejemplares similares