Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula

Generic language models pretrained on large unspecific domains are currently the foundation of NLP. Labeled data are limited in most model training due to the cost of manual annotation, especially in domains including massive Proper Nouns such as mathematics and biology, where it affects the accurac...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Hua Cheng, Renjie Yu, Yixin Tang, Yiquan Fang, Tao Cheng
Format: article
Langue:EN
Publié: MDPI AG 2021
Sujets:
T
Accès en ligne:https://doaj.org/article/2b40f93513304981a835803b16c74883
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!