TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Scene text recognition (STR) is an important bridge between images and text, attracting abundant research attention. While convolutional neural networks (CNNS) have achieved remarkable progress in this task, most of the existing works need an extra module (context modeling module) to help CNN to cap...
Saved in:
Main Authors: | Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu |
---|---|
Format: | article |
Language: | EN |
Published: |
MDPI AG
2021
|
Subjects: | |
Online Access: | https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Development of Vertical Text Interpreter for Natural Scene Images
by: Ong Yi Ling, et al.
Published: (2021) -
Environmental Sound Recognition on Embedded Systems: From FPGAs to TPUs
by: Jurgen Vandendriessche, et al.
Published: (2021) -
The presence of occupational structure in online texts based on word embedding NLP models
by: Zoltán Kmetty, et al.
Published: (2021) -
Performance Evaluation of Offline Speech Recognition on Edge Devices
by: Santosh Gondi, et al.
Published: (2021) -
K-Nearest Neighbor for Recognize Handwritten Arabic Character
by: Muhammad Athoillah
Published: (2019)