Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula

Generic language models pretrained on large unspecific domains are currently the foundation of NLP. Labeled data are limited in most model training due to the cost of manual annotation, especially in domains including massive Proper Nouns such as mathematics and biology, where it affects the accurac...

Full description

Saved in:
Bibliographic Details
Main Authors: Hua Cheng, Renjie Yu, Yixin Tang, Yiquan Fang, Tao Cheng
Format: article
Language:EN
Published: MDPI AG 2021
Subjects:
T
Online Access:https://doaj.org/article/2b40f93513304981a835803b16c74883
Tags: Add Tag
No Tags, Be the first to tag this record!