A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa
Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all...
Guardado en:
Autor principal: | |
---|---|
Formato: | article |
Lenguaje: | AF DE EN FR NL |
Publicado: |
Woordeboek van die Afrikaanse Taal-WAT
2013
|
Materias: | |
Acceso en línea: | https://doaj.org/article/303321310da845f0bab58432aa8d35b3 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:303321310da845f0bab58432aa8d35b3 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:303321310da845f0bab58432aa8d35b32021-12-03T09:18:49ZA Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa10.5788/23-1-12161684-49042224-0039https://doaj.org/article/303321310da845f0bab58432aa8d35b32013-12-01T00:00:00Zhttps://lexikos.journals.ac.za/pub/article/view/1216https://doaj.org/toc/1684-4904https://doaj.org/toc/2224-0039Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora are compiled from the written standard language and the written dialects. The majority of the Northern Sotho dialects do not have written orthographies, and the few dialects which had written orthographies prior to standardization came to monopolize the standard language and the Northern Sotho corpora. Therefore, the compilation of a corpus-based dictionary in Northern Sotho is tantamount to a continuation of producing unbalanced and unrepresentative dictionaries, which continue to sideline and to marginalize the majority of the communities and the linguistic varieties which could potentially enrich both the Northern Sotho standard language and the Northern Sotho corpora. The main objective with this research is to analyze, to expose and to suggest ways of correcting these irregularities so that the marginalized Northern Sotho dialects can be accommodated in the standard language. This will obviously increase the size of the Northern Sotho standard language and the corpus by more than 50%.V. M. MojelaWoordeboek van die Afrikaanse Taal-WATarticlecorpusbalanced corpusrepresentative corpusstandardizationdialectorthographymarginalized dialectsprestige dialectsmissionary activitiesPhilology. LinguisticsP1-1091Languages and literature of Eastern Asia, Africa, OceaniaPL1-8844Germanic languages. Scandinavian languagesPD1-7159AFDEENFRNLLexikos, Vol 23, Pp 286-296 (2013) |
institution |
DOAJ |
collection |
DOAJ |
language |
AF DE EN FR NL |
topic |
corpus balanced corpus representative corpus standardization dialect orthography marginalized dialects prestige dialects missionary activities Philology. Linguistics P1-1091 Languages and literature of Eastern Asia, Africa, Oceania PL1-8844 Germanic languages. Scandinavian languages PD1-7159 |
spellingShingle |
corpus balanced corpus representative corpus standardization dialect orthography marginalized dialects prestige dialects missionary activities Philology. Linguistics P1-1091 Languages and literature of Eastern Asia, Africa, Oceania PL1-8844 Germanic languages. Scandinavian languages PD1-7159 V. M. Mojela A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
description |
Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora are compiled from the written standard language and the written dialects. The majority of the Northern Sotho dialects do not have written orthographies, and the few dialects which had written orthographies prior to standardization came to monopolize the standard language and the Northern Sotho corpora. Therefore, the compilation of a corpus-based dictionary in Northern Sotho is tantamount to a continuation of producing unbalanced and unrepresentative dictionaries, which continue to sideline and to marginalize the majority of the communities and the linguistic varieties which could potentially enrich both the Northern Sotho standard language and the Northern Sotho corpora. The main objective with this research is to analyze, to expose and to suggest ways of correcting these irregularities so that the marginalized Northern Sotho dialects can be accommodated in the standard language. This will obviously increase the size of the Northern Sotho standard language and the corpus by more than 50%. |
format |
article |
author |
V. M. Mojela |
author_facet |
V. M. Mojela |
author_sort |
V. M. Mojela |
title |
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
title_short |
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
title_full |
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
title_fullStr |
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
title_full_unstemmed |
A Balanced and Representative Corpus: The Effects of Strict Corpus-based Dictionary Compilation in Sesotho sa Leboa |
title_sort |
balanced and representative corpus: the effects of strict corpus-based dictionary compilation in sesotho sa leboa |
publisher |
Woordeboek van die Afrikaanse Taal-WAT |
publishDate |
2013 |
url |
https://doaj.org/article/303321310da845f0bab58432aa8d35b3 |
work_keys_str_mv |
AT vmmojela abalancedandrepresentativecorpustheeffectsofstrictcorpusbaseddictionarycompilationinsesothosaleboa AT vmmojela balancedandrepresentativecorpustheeffectsofstrictcorpusbaseddictionarycompilationinsesothosaleboa |
_version_ |
1718373318292144128 |