TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance

Scene text recognition (STR) is an important bridge between images and text, attracting abundant research attention. While convolutional neural networks (CNNS) have achieved remarkable progress in this task, most of the existing works need an extra module (context modeling module) to help CNN to cap...

Description complète

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu
Format:	article
Langue:	EN
Publié:	MDPI AG 2021
Sujets:	scene text recognition transformer self-attention 1-D split initial embedding Electronics TK7800-8360
Accès en ligne:	https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Internet

https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304

TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance

Internet

Documents similaires