TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance

Scene text recognition (STR) is an important bridge between images and text, attracting abundant research attention. While convolutional neural networks (CNNS) have achieved remarkable progress in this task, most of the existing works need an extra module (context modeling module) to help CNN to cap...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu
Format: article
Langue:EN
Publié: MDPI AG 2021
Sujets:
Accès en ligne:https://doaj.org/article/5d61a56968d34aa99a6d35d3cd35b304
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!