High Performance Offline Handwritten Chinese Text Recognition with a New Data Preprocessing and Augmentation Pipeline

被引:13
|
作者
Xie, Canyu [1 ]
Lai, Songxuan [1 ]
Liao, Qianying [1 ]
Jin, Lianwen [1 ,2 ]
机构
[1] South China Univ Technol, Coll Elect & Informat Engn, Guangzhou, Peoples R China
[2] SCUT, Zhuhai Inst Modern Ind Innovat, Zhuhai 519000, Peoples R China
来源
DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷
关键词
Offline Handwritten Text Recognition (HCTR); Data preprocessing; Data augmentation; CNN-ResLSTM; NEURAL-NETWORK; SEQUENCE; ONLINE; MODEL;
D O I
10.1007/978-3-030-57058-3_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition (HCTR) has been a long-standing research topic. To build robust and high-performance offline HCTR systems, it is natural to develop data preprocessing and augmentation techniques, which, however, have not been fully explored. In this paper, we propose a data preprocessing and augmentation pipeline and a CNN-ResLSTM model for high-performance offline HCTR. The data preprocessing and augmentation pipeline consists of three steps: training text sample generation, text sample preprocessing and text sample synthesis. The CNN-ResLSTM model is derived by introducing residual connections into the RNN part of the CRNN architecture. Experiments show that on the proposed CNN-ResLSTM, the data preprocessing and augmentation pipeline can effectively and robustly improve the system performance: On two standard benchmarks, namely the CASIA-HWDB and the ICDAR-2013 handwriting competition dataset, the proposed approach achieves state-of-the-art results with correct rates of 97.28% and 96.99%, respectively. Furthermore, to make our model more practical, we employ model acceleration and compression techniques to build a fast and compact model without sacrificing the accuracy.
引用
收藏
页码:45 / 59
页数:15
相关论文
共 50 条
  • [21] DenseRAN for Offline Handwritten Chinese Character Recognition
    Wang, Wenchao
    Zhang, Jianshu
    Du, Jun
    Wang, Zi-Rui
    Zhu, Yixing
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 104 - 109
  • [22] Online and offline handwritten Chinese character recognition: Benchmarking on new databases
    Liu, Cheng-Lin
    Yin, Fei
    Wang, Da-Han
    Wang, Qiu-Feng
    PATTERN RECOGNITION, 2013, 46 (01) : 155 - 162
  • [23] Offline recognition of large vocabulary cursive handwritten text
    Vinciarelli, A
    Bengio, S
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1101 - 1105
  • [24] Rejection strategies for offline handwritten text line recognition
    Bertolami, Roman
    Zimmermann, Matthias
    Bunke, Horst
    PATTERN RECOGNITION LETTERS, 2006, 27 (16) : 2005 - 2012
  • [25] Offline Handwritten Quranic Text Recognition: A Research Perspective
    Iqbal, Arshad
    Zafar, Aasim
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 125 - 128
  • [26] Writer Code Based Adaptation of Deep Neural Network for Offline Handwritten Chinese Text Recognition
    Wang, Zi-Rui
    Du, Jun
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 548 - 553
  • [27] A Bayesian-based probabilistic model for unconstrained handwritten offline Chinese text line recognition
    Li, Nanxi
    Jin, Lianwen
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3664 - 3668
  • [28] Deep Neural Network based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Du, Jun
    Wang, Zi-Rui
    Zhai, Jian-Fang
    Hu, Jin-Shui
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3428 - 3433
  • [29] Offline handwritten Chinese character recognition by radical decomposition
    Shi, Daming
    Damper, Robert I.
    Gunn, Steve R.
    ACM Transactions on Asian Language Information Processing, 2003, 2 (01): : 27 - 48
  • [30] Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark
    Zhang, Xu-Yao
    Bengio, Yoshua
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2017, 61 : 348 - 360