TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Scene text recognition (STR) is an important bridge between images and text, attracting abundant research attention. While convolutional neural networks (CNNS) have achieved remarkable progress in this task, most of the existing works need an extra module (context modeling module) to help CNN to cap...
Enregistré dans:
Auteurs principaux: | , , , |
---|---|
Format: | article |
Langue: | EN |
Publié: |
MDPI AG
2021
|
Sujets: | |
Accès en ligne: | https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304 |
Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|