A Convolutional Neural Network-based Ancient Sundanese Character Classifier with Data Augmentation

被引:6
|
作者
Hidayat, Alam Ahmad [1 ]
Purwandari, Kartika [1 ]
Cenggoro, Tjeng Wawan [1 ,2 ]
Pardamean, Bens [1 ,3 ]
机构
[1] Bina Nusantara Univ, Bioinformat & Data Sci Res Ctr, Jakarta 11480, Indonesia
[2] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Jakarta 11480, Indonesia
[3] Bina Nusantara Univ, Comp Sci Dept, BINUS Grad Program Master Comp Sci, Jakarta 11480, Indonesia
关键词
ancient sundanese characters; convolutional neural network; data augmentation; document image analysis; Otsu's threshold method;
D O I
10.1016/j.procs.2021.12.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With an increasing interest in the digitization effort of ancient manuscripts, ancient character recognition becomes one of the most important areas in the automated document image analysis. In this regard, we propose a Convolutional Neural Network (CNN)-based classifier to recognize the ancient Sundanese characters obtained from a digital collection of Southeast Asian palm leaf manuscripts. In this work, we utilize two different preprocessing techniques for the dataset. The first technique involves the use of geometric transformations, noise background addition, and brightness adjustment to augment the imbalanced samples to be fed into the classifier. The second technique makes use of the Otsu's threshold method to binarize the characters and only uses the usual geometric transformations for the data augmentation. The proposed network with different data augmentation processes is trained on the training set and tested on the testing set. Image binarization from the second technique can outperform the performance of the CNN-based classifier upon the first technique by achieving a testing accuracy of 97.74%. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:195 / 201
页数:7
相关论文
共 50 条
  • [1] IMPLEMENTATION OF CONVOLUTIONAL NEURAL NETWORK FOR SUNDANESE SCRIPT HANDWRITING RECOGNITION WITH DATA AUGMENTATION
    Maliki, Irfan
    Prayoga, Ade Syahlan
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 (02): : 1113 - 1123
  • [2] Analyzing Data Augmentation Methods for Convolutional Neural Network-based Brain-Computer Interfaces
    Faria, Gabriel
    de Souza, Gabriel Henrique
    Bernardino, Heder
    Motta, Luciana
    Vieira, Alex
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [3] Bangla Handwritten Character Recognition using Convolutional Neural Network with Data Augmentation
    Chowdhury, Rumman Rashid
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Andersson, Karl
    Hossain, Sazzad
    [J]. 2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 318 - 323
  • [4] A Robust Graph Convolutional Neural Network-Based Classifier for Automatic Modulation Recognition
    Ghasemzadeh, Pejman
    Hempel, Michael
    Sharif, Hamid
    [J]. 2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 907 - 912
  • [5] Convolutional neural network-based ensemble methods to recognize Bangla handwritten character
    Shibly, Mir Moynuddin Ahmed
    Tisha, Tahmina Akter
    Tani, Tanzina Akter
    Ripon, Shamim
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7 : 1 - 30
  • [6] Convolutional Neural Network-based Jaywalking Data Generation and Classification
    Park, Jaeseo
    Lee, Yunsoo
    Heo, Jun Ho
    Kang, Suk-Ju
    [J]. 2019 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2019, : 132 - 133
  • [7] Improvement of Ancient Shui Character Recognition Model Based on Convolutional Neural Network
    Zhao, Hongshuai
    Chu, Haozhen
    Zhang, Yuanyuan
    Jia, Yu
    [J]. IEEE ACCESS, 2020, 8 : 33080 - 33087
  • [8] Convolutional neural network-based cross-corpus speech emotion recognition with data augmentation and features fusion
    Rashid Jahangir
    Ying Wah Teh
    Ghulam Mujtaba
    Roobaea Alroobaea
    Zahid Hussain Shaikh
    Ihsan Ali
    [J]. Machine Vision and Applications, 2022, 33
  • [9] Convolutional neural network-based cross-corpus speech emotion recognition with data augmentation and features fusion
    Jahangir, Rashid
    Teh, Ying Wah
    Mujtaba, Ghulam
    Alroobaea, Roobaea
    Shaikh, Zahid Hussain
    Ali, Ihsan
    [J]. MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
  • [10] Data augmentation in EUV lithography simulation based on convolutional neural network
    Tanabe, Hiroyoshi
    Takahashi, Atsushi
    [J]. DTCO AND COMPUTATIONAL PATTERNING, 2022, 12052