Conceptos relacionados con estrella. Lingüística de corpus de astronomía

The PhD. thesis was made within the line of research of the GICEC (Group of Research of Concepts in Science Teaching). The research focus on the improvement and justification of the methodology used for determining frequent vocabularies, specific vocabularies, collocations and relations between lexi...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: Hansen Ruiz, Cristina Silvia
Otros Autores: Pérez Ceballos, Jesús Miguel (Universidad de La Laguna)
Formato: text (thesis)
Lenguaje:spa
Publicado: Universidad de La Laguna (España) 2011
Materias:
Acceso en línea:https://dialnet.unirioja.es/servlet/oaites?codigo=24261
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:The PhD. thesis was made within the line of research of the GICEC (Group of Research of Concepts in Science Teaching). The research focus on the improvement and justification of the methodology used for determining frequent vocabularies, specific vocabularies, collocations and relations between lexical units. This requires the use of the software PAFE (Software for Frequency Analysis and Environment Studies) and techniques based on corpus linguistics for detecting specific vocabularies, frequent vocabularies and collocations. The methodology has been tested applying it to a particular text of more of 1000 words: a text of Astronomy made by the Hubble European Space Agency Information Centre (ESA/ESO) for Secondary Education. The frequency data of the lexical units obtained with the PAFE software is used to propose a mathematical way of obtaining frequent vocabularies based on the analysis of the absolute frequencies distribution. Specific vocabularies are found comparing the relative frequencies of a given text with the ones of a general corpus following techniques developed in corpus linguistics. Collocations are found comparing frequencies and relations between lexical units. Frequencies and relations are then used to build the semantic networks enriched with the previously detected information. The research concludes the need of: removing all functional words; unifying synonyms spellings only for words which are in the high and medium intervals; not removing the mathematical language due to its importance in frequencies and semantic networks; an expert in the subject being analyzed to correct the data obtained when finding collocations and specific vocabulary; the mathematical determination of the frequent vocabulary; determining the optimal system analyzing the conservation of relations for each lemma and the conservation of total relations within the system; and enriching semantic networks with the data about specific vocabularies and relation conservation. Moreover possible errors in the methodology are delimited, the limitations that affect the results are analyzed and possible errors in each of the phases of the methodology are quantified.