Optimizing Small BERTs Trained for German NER

Currently, the most widespread neural network architecture for training language models is the so-called BERT, which led to improvements in various Natural Language Processing (NLP) tasks. In general, the larger the number of parameters in a BERT model, the better the results obtained in these NLP t...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jochen Zöllner, Konrad Sperfeld, Christoph Wick, Roger Labahn
Format:	article
Language:	EN
Published:	MDPI AG 2021
Subjects:	named entity recognition natural language processing BERT German language pre-training fine-tuning Information technology T58.5-58.64
Online Access:	https://doaj.org/article/c269d264e4014373a18286098ba93af0
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doaj.org/article/c269d264e4014373a18286098ba93af0

Optimizing Small BERTs Trained for German NER

Internet

Similar Items