FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection
Inaccurate localization due to scale-variation during character detection causes a widespread issue overconfidence in results of the document analysis community, for the most part in historical and handwritten documents. In this work, we explored the performance of a state-of-the-art network with a...
Guardado en:
Autores principales: | , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
MDPI AG
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/8676484a284b49e2abffb4c892ca66d6 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:8676484a284b49e2abffb4c892ca66d6 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:8676484a284b49e2abffb4c892ca66d62021-11-11T19:14:39ZFAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection10.3390/s212172891424-8220https://doaj.org/article/8676484a284b49e2abffb4c892ca66d62021-11-01T00:00:00Zhttps://www.mdpi.com/1424-8220/21/21/7289https://doaj.org/toc/1424-8220Inaccurate localization due to scale-variation during character detection causes a widespread issue overconfidence in results of the document analysis community, for the most part in historical and handwritten documents. In this work, we explored the performance of a state-of-the-art network with a simple pipeline that fast and accurately predicts handwritten Chinese characters in old documents. In order to adapt to locations of characters with multi-scale more precisely, excluding pre-processing and in-between steps, we utilized a network with multi-scale feature maps. Then, across each feature map, pre-selected boxes of unalike scales and aspect ratios were employed. The last step was to prune the bounding boxes, sending them to non-maximum suppression to yield the final results. Focusing on a well-designed neural network architecture and loss function that presents well-classified examples, we found our experiments on Caoshu, Character, and Src-images datasets demonstrated that detection performance was enhanced for the detection rate (DT), the false positive per character (FPPC), and the F-score in the order of 98.84%, 0.71, and 97.64%, respectively. In comparison with SSD (single-shot detector), the detection performance of a detection rate (DT), the false positive per character (FPPC), and the F-score were 61.12%, 6.12, and 60.33%, respectively.Manar AlnaasanSungho KimMDPI AGarticlesimple pipelinemulti-scale Chinese character detectionhandwritten in old documentsmultiscale feature networkChemical technologyTP1-1185ENSensors, Vol 21, Iss 7289, p 7289 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
simple pipeline multi-scale Chinese character detection handwritten in old documents multiscale feature network Chemical technology TP1-1185 |
spellingShingle |
simple pipeline multi-scale Chinese character detection handwritten in old documents multiscale feature network Chemical technology TP1-1185 Manar Alnaasan Sungho Kim FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
description |
Inaccurate localization due to scale-variation during character detection causes a widespread issue overconfidence in results of the document analysis community, for the most part in historical and handwritten documents. In this work, we explored the performance of a state-of-the-art network with a simple pipeline that fast and accurately predicts handwritten Chinese characters in old documents. In order to adapt to locations of characters with multi-scale more precisely, excluding pre-processing and in-between steps, we utilized a network with multi-scale feature maps. Then, across each feature map, pre-selected boxes of unalike scales and aspect ratios were employed. The last step was to prune the bounding boxes, sending them to non-maximum suppression to yield the final results. Focusing on a well-designed neural network architecture and loss function that presents well-classified examples, we found our experiments on Caoshu, Character, and Src-images datasets demonstrated that detection performance was enhanced for the detection rate (DT), the false positive per character (FPPC), and the F-score in the order of 98.84%, 0.71, and 97.64%, respectively. In comparison with SSD (single-shot detector), the detection performance of a detection rate (DT), the false positive per character (FPPC), and the F-score were 61.12%, 6.12, and 60.33%, respectively. |
format |
article |
author |
Manar Alnaasan Sungho Kim |
author_facet |
Manar Alnaasan Sungho Kim |
author_sort |
Manar Alnaasan |
title |
FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
title_short |
FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
title_full |
FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
title_fullStr |
FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
title_full_unstemmed |
FAN-MCCD: Fast and Accurate Network for Multi-Scale Chinese Character Detection |
title_sort |
fan-mccd: fast and accurate network for multi-scale chinese character detection |
publisher |
MDPI AG |
publishDate |
2021 |
url |
https://doaj.org/article/8676484a284b49e2abffb4c892ca66d6 |
work_keys_str_mv |
AT manaralnaasan fanmccdfastandaccuratenetworkformultiscalechinesecharacterdetection AT sunghokim fanmccdfastandaccuratenetworkformultiscalechinesecharacterdetection |
_version_ |
1718431595517444096 |