Data augmentation for handwritten digit recognition using generative adversarial networks

被引:0
|
作者
Jha, Ganesh [1 ]
Cecotti, Hubert [1 ]
机构
[1] Calif State Univ Fresno Fresno State, Coll Sci & Math, Dept Comp Sci, 2576 E San Ramon MS ST 109, Fresno, CA 93740 USA
关键词
Machine learning; Neural networks; Classification; Generative adversarial networks; CHARACTER-RECOGNITION;
D O I
10.1007/s11042-020-08883-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning techniques require labeled examples that can be time consuming to obtain. In particular, deep learning approaches, where all the feature extraction stages are learned within the artificial neural network, require a large number of labeled examples to train the model. Various data augmentation techniques can be performed to overcome this issue by taking advantage of known variations that have no impact on the label of an example. Typical solutions in computer vision and document analysis and recognition are based on geometric transformations (e.g. shift and rotation) and random elastic deformations of the original training examples. In this paper, we consider Generative Adversarial Networks (GAN), a technique that does not require prior knowledge of the possible variabilities that exist across examples to create novel artificial examples. In the case of a training dataset with a low number of labeled examples, which are described in a high dimensional space, the classifier may generalize poorly. Therefore, we aim at enriching databases of images or signals for improving the classifier performance by designing a GAN for creating artificial images. While adding more images through a GAN can help, the extent to which it will help is unknown, and it may degrade the performance if too many artificial images are added. The approach is tested on four datasets on handwritten digits (Latin, Bangla, Devanagri, and Oriya). The accuracy for each dataset shows that the addition of GAN generated images in the training dataset provides an improvement of the accuracy. However, the results suggest that the addition of too many GAN generated images deteriorates the performance.
引用
收藏
页码:35055 / 35068
页数:14
相关论文
共 50 条
  • [1] Data augmentation for handwritten digit recognition using generative adversarial networks
    Ganesh Jha
    Hubert Cecotti
    [J]. Multimedia Tools and Applications, 2020, 79 : 35055 - 35068
  • [2] Data augmentation for handwritten digit recognition using generative adversarial networks
    Jha, Ganesh
    Cecotti, Hubert
    [J]. Multimedia Tools and Applications, 2020, 79 (47-48): : 35055 - 35068
  • [3] Separation and recognition of overlapping handwritten digit images based on generative adversarial networks
    Wei, Jiacheng
    Dong, Ran
    Cai, Chengtao
    Lin, Xiaozhu
    Song, Huijia
    Wang, Xiangyu
    [J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 45 (11): : 2226 - 2234
  • [4] Data augmentation using generative adversarial networks for robust speech recognition
    Qian, Yanmin
    Hu, Hu
    Tan, Tian
    [J]. SPEECH COMMUNICATION, 2019, 114 : 1 - 9
  • [5] Handwritten digit recognition based on conditional generative adversarial network
    Wang Ai-li
    Xue Dong
    Wu Hai-bin
    Wang Min-hui
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2020, 35 (12) : 1284 - 1290
  • [6] Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition
    Sheng, Peiyao
    Yang, Zhuolin
    Hu, Hu
    Tan, Tian
    Qian, Yanmin
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 121 - 125
  • [7] A Comparative Study of Engraved-Digit Data Augmentation by Generative Adversarial Networks
    Abdulraheem, Abdulkabir
    Jung, Im Y.
    [J]. SUSTAINABILITY, 2022, 14 (19)
  • [8] Using generative models for handwritten digit recognition
    Revow, M
    Williams, CKI
    Hinton, GE
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1996, 18 (06) : 592 - 606
  • [9] Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    Elarian, Yousef
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [10] MCMC Based Generative Adversarial Networks for Handwritten Numeral Augmentation
    Zhang, He
    Luo, Chunbo
    Yu, Xingrui
    Ren, Peng
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2019, 463 : 2702 - 2710