MAT-AGCA: Multi Augmentation Technique on small dataset for Balinese character recognition using Convolutional Neural Network

The lontar manuscript is an ancient Balinese cultural heritage written using Balinese characters on palm leaves. The recognition of Balinese characters in lontar is challenging because it has noise and limited data availability. To solve these problems, data augmentation is needed to increase the va...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Ni Putu Sutramiani, Nanik Suciati, Daniel Siahaan
Format: article
Langue:EN
Publié: Elsevier 2021
Sujets:
Accès en ligne:https://doaj.org/article/1eb1dcb8d5cc44b2b49090584e2410d1
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:The lontar manuscript is an ancient Balinese cultural heritage written using Balinese characters on palm leaves. The recognition of Balinese characters in lontar is challenging because it has noise and limited data availability. To solve these problems, data augmentation is needed to increase the variety and amount of data to improve recognition performance. In this study, we collected Balinese character images from 50 lontar manuscript writers. We proposed MAT-AGCA that combines Adaptive Gaussian Thresholding and Convolutional Autoencoder for data augmentation. Based on experiments using InceptionResnetV2, DenseNet169, ResNet152V2, VGG19, and MobileNetV2, our proposed method achieved the best performance with 96.29% accuracy.