A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa

Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: V. M. Mojela
Formato: article
Lenguaje:AF
DE
EN
FR
NL
Publicado: Woordeboek van die Afrikaanse Taal-WAT 2013
Materias:
Acceso en línea:https://doaj.org/article/303321310da845f0bab58432aa8d35b3
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:303321310da845f0bab58432aa8d35b3
record_format dspace
spelling oai:doaj.org-article:303321310da845f0bab58432aa8d35b32021-12-03T09:18:49ZA Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa10.5788/23-1-12161684-49042224-0039https://doaj.org/article/303321310da845f0bab58432aa8d35b32013-12-01T00:00:00Zhttps://lexikos.journals.ac.za/pub/article/view/1216https://doaj.org/toc/1684-4904https://doaj.org/toc/2224-0039Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora are compiled from the written standard language and the written dialects. The majority of the Northern Sotho dialects do not have written orthographies, and the few dialects which had written orthographies prior to standardization came to monopolize the standard language and the Northern Sotho corpora. Therefore, the compilation of a corpus-based dictionary in Northern Sotho is tantamount to a continuation of producing unbalanced and unrepresentative dictionaries, which continue to sideline and to marginalize the majority of the communities and the linguistic varieties which could potentially enrich both the Northern Sotho standard language and the Northern Sotho corpora. The main objective with this research is to analyze, to expose and to suggest ways of correcting these irregularities so that the marginalized Northern Sotho dialects can be accommodated in the standard language. This will obviously increase the size of the Northern Sotho standard language and the corpus by more than 50%.V. M. MojelaWoordeboek van die Afrikaanse Taal-WATarticlecorpusbalanced corpusrepresentative corpusstan­dardi­za­tiondialectorthographymarginalized dialectsprestige dialectsmis­sion­ary activitiesPhilology. LinguisticsP1-1091Languages and literature of Eastern Asia, Africa, OceaniaPL1-8844Germanic languages. Scandinavian languagesPD1-7159AFDEENFRNLLexikos, Vol 23, Pp 286-296 (2013)
institution DOAJ
collection DOAJ
language AF
DE
EN
FR
NL
topic corpus
balanced corpus
representative corpus
stan­dardi­za­tion
dialect
orthography
marginalized dialects
prestige dialects
mis­sion­ary activities
Philology. Linguistics
P1-1091
Languages and literature of Eastern Asia, Africa, Oceania
PL1-8844
Germanic languages. Scandinavian languages
PD1-7159
spellingShingle corpus
balanced corpus
representative corpus
stan­dardi­za­tion
dialect
orthography
marginalized dialects
prestige dialects
mis­sion­ary activities
Philology. Linguistics
P1-1091
Languages and literature of Eastern Asia, Africa, Oceania
PL1-8844
Germanic languages. Scandinavian languages
PD1-7159
V. M. Mojela
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
description Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora are compiled from the written standard language and the written dialects. The majority of the Northern Sotho dialects do not have written orthographies, and the few dialects which had written orthographies prior to standardization came to monopolize the standard language and the Northern Sotho corpora. Therefore, the compilation of a corpus-based dictionary in Northern Sotho is tantamount to a continuation of producing unbalanced and unrepresentative dictionaries, which continue to sideline and to marginalize the majority of the communities and the linguistic varieties which could potentially enrich both the Northern Sotho standard language and the Northern Sotho corpora. The main objective with this research is to analyze, to expose and to suggest ways of correcting these irregularities so that the marginalized Northern Sotho dialects can be accommodated in the standard language. This will obviously increase the size of the Northern Sotho standard language and the corpus by more than 50%.
format article
author V. M. Mojela
author_facet V. M. Mojela
author_sort V. M. Mojela
title A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
title_short A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
title_full A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
title_fullStr A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
title_full_unstemmed A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
title_sort balanced and representative corpus: the effects of strict corpus-based dictionary compilation in sesotho sa leboa
publisher Woordeboek van die Afrikaanse Taal-WAT
publishDate 2013
url https://doaj.org/article/303321310da845f0bab58432aa8d35b3
work_keys_str_mv AT vmmojela abalancedandrepresentativecorpustheeffectsofstrictcorpusbaseddictionarycompilationinsesothosaleboa
AT vmmojela balancedandrepresentativecorpustheeffectsofstrictcorpusbaseddictionarycompilationinsesothosaleboa
_version_ 1718373318292144128