TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Scene text recognition (STR) is an important bridge between images and text, attracting abundant research attention. While convolutional neural networks (CNNS) have achieved remarkable progress in this task, most of the existing works need an extra module (context modeling module) to help CNN to cap...
Enregistré dans:
Auteurs principaux: | Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu |
---|---|
Format: | article |
Langue: | EN |
Publié: |
MDPI AG
2021
|
Sujets: | |
Accès en ligne: | https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304 |
Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
-
Development of Vertical Text Interpreter for Natural Scene Images
par: Ong Yi Ling, et autres
Publié: (2021) -
Environmental Sound Recognition on Embedded Systems: From FPGAs to TPUs
par: Jurgen Vandendriessche, et autres
Publié: (2021) -
The presence of occupational structure in online texts based on word embedding NLP models
par: Zoltán Kmetty, et autres
Publié: (2021) -
Performance Evaluation of Offline Speech Recognition on Edge Devices
par: Santosh Gondi, et autres
Publié: (2021) -
K-Nearest Neighbor for Recognize Handwritten Arabic Character
par: Muhammad Athoillah
Publié: (2019)