$Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula$
QR Code

Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula

Generic language models pretrained on large unspecific domains are currently the foundation of NLP. Labeled data are limited in most model training due to the cost of manual annotation, especially in domains including massive Proper Nouns such as mathematics and biology, where it affects the accurac...

Description complète

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Hua Cheng, Renjie Yu, Yixin Tang, Yiquan Fang, Tao Cheng
Format:	article
Langue:	EN
Publié:	MDPI AG 2021
Sujets:	unlabeled data self-training pretraining BERT LaTeX formula Technology T Engineering (General). Civil engineering (General) TA1-2040 Biology (General) QH301-705.5 Physics QC1-999 Chemistry QD1-999
Accès en ligne:	https://doaj.org/article/2b40f93513304981a835803b16c74883
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Internet

https://doaj.org/article/2b40f93513304981a835803b16c74883

Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula

Internet

Documents similaires