Large-Scale Printed Chinese Character Recognition for ID Cards Using Deep Learning and Few Samples Transfer Learning

被引:3
|
作者
Li, Yi-Quan [1 ,2 ]
Chang, Hao-Sen [1 ]
Lin, Daw-Tung [1 ]
机构
[1] Natl Taipei Univ, Dept Comp Sci & Informat Engn, 151 Univ Rd, Taipei 237303, Taiwan
[2] Orbit Technol Inc, 5F,126 Minzhu West Rd, Taipei 10342, Taiwan
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 02期
关键词
large-scale image classification; printed Chinese character recognition; data synthesis; GoogLeNet-GAP; transfer learning; TEXT; NETWORK;
D O I
10.3390/app12020907
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the field of computer vision, large-scale image classification tasks are both important and highly challenging. With the ongoing advances in deep learning and optical character recognition (OCR) technologies, neural networks designed to perform large-scale classification play an essential role in facilitating OCR systems. In this study, we developed an automatic OCR system designed to identify up to 13,070 large-scale printed Chinese characters by using deep learning neural networks and fine-tuning techniques. The proposed framework comprises four components, including training dataset synthesis and background simulation, image preprocessing and data augmentation, the process of training the model, and transfer learning. The training data synthesis procedure is composed of a character font generation step and a background simulation process. Three background models are proposed to simulate the factors of the background noise patterns on ID cards. To expand the diversity of the synthesized training dataset, rotation and zooming data augmentation are applied. A massive dataset comprising more than 19.6 million images was thus created to accommodate the variations in the input images and improve the learning capacity of the CNN model. Subsequently, we modified the GoogLeNet neural architecture by replacing the fully connected layer with a global average pooling layer to avoid overfitting caused by a massive amount of training data. Consequently, the number of model parameters was reduced. Finally, we employed the transfer learning technique to further refine the CNN model using a small number of real data samples. Experimental results show that the overall recognition performance of the proposed approach is significantly better than that of prior methods and thus demonstrate the effectiveness of proposed framework, which exhibited a recognition accuracy as high as 99.39% on the constructed real ID card dataset.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Large-scale continual learning for ancient Chinese character recognition
    Xu, Yue
    Zhang, Xu-Yao
    Zhang, Zhaoxiang
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2024, 150
  • [2] Large-scale Pollen Recognition with Deep Learning
    de Geus, Andre R.
    Barcelos, Celia A. Z.
    Batista, Marcos A.
    da Silva, Sergio F.
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [3] Large-Scale Bisample Learning on ID Versus Spot Face Recognition
    Zhu, Xiangyu
    Liu, Hao
    Lei, Zhen
    Shi, Hailin
    Yang, Fan
    Yi, Dong
    Qi, Guojun
    Li, Stan Z.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (6-7) : 684 - 700
  • [4] Large-Scale Bisample Learning on ID Versus Spot Face Recognition
    Xiangyu Zhu
    Hao Liu
    Zhen Lei
    Hailin Shi
    Fan Yang
    Dong Yi
    Guojun Qi
    Stan Z. Li
    [J]. International Journal of Computer Vision, 2019, 127 : 684 - 700
  • [5] Deep Learning Based Large Scale Handwritten Devanagari Character Recognition
    Acharya, Shailesh
    Pant, Ashok Kumar
    Gyawali, Prashnna Kumar
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2015,
  • [6] Large-scale singer recognition using deep metric learning: an experimental study
    Hu, Shichao
    Liang, Beici
    Chen, Zhouxuan
    Lu, Xiao
    Zhao, Ethan
    Lui, Simon
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] EMBEDDED LARGE-SCALE HANDWRITTEN CHINESE CHARACTER RECOGNITION
    Chherawala, Youssouf
    Dolfing, Hans J. G. A.
    Dixon, Ryan S.
    Bellegarda, Jerome R.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8169 - 8173
  • [8] Character Recognition of Components Mounted on Printed Circuit Board Using Deep Learning
    Gang, Sumyung
    Fabrice, Ndayishimiye
    Chung, Daewon
    Lee, Joonjae
    [J]. SENSORS, 2021, 21 (09)
  • [9] Large-Scale Few-Shot Learning: Knowledge Transfer With Class Hierarchy
    Li, Aoxue
    Luo, Tiange
    Lu, Zhiwu
    Xiang, Tao
    Wang, Liwei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7205 - 7213
  • [10] Large-scale Deep Learning at Baidu
    Yu, Kai
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2211 - 2211