Handwritten Chinese Text Recognition Using Separable Multi-Dimensional Recurrent Neural Network

被引:38
|
作者
Wu, Yi-Chao [1 ,2 ]
Yin, Fei [1 ]
Chen, Zhuo [1 ,2 ]
Liu, Cheng-Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, 95 Zhongguan East Rd, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
handwritten Chinese text recognition; separable multidimensional recurrent neural network; bidirectional LSTM-RNN; WFST-based decoding; ONLINE;
D O I
10.1109/ICDAR.2017.22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) has been demonstrated successful in handwritten text recognition of Western and Arabic scripts. It is totally segmentation free and can be trained directly from text line images. However, the application of LSTM-RNNs (including Multi-Dimensional LSTM-RNN (MDLSTM-RNN)) to Chinese text recognition has shown limited success, even when training them with large datasets and using pre-training on datasets of other languages. In this paper, we propose a handwritten Chinese text recognition method by using Separable MDLSTM-RNN (SMDLSTM-RNN) modules, which extract contextual information in various directions, and consume much less computation efforts and resources compared with the traditional MDLSTM-RNN. Experimental results on the ICDAR-2013 competition dataset show that the proposed method performs significantly better than the previous LSTM-based methods, and can compete with the state-of-the-art systems.
引用
收藏
页码:79 / 84
页数:6
相关论文
共 50 条
  • [1] Recurrent Neural Network Transducer for Japanese and Chinese Offline Handwritten Text Recognition
    Ngo, Trung Tan
    Nguyen, Hung Tuan
    Ly, Nam Tuan
    Nakagawa, Masaki
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 364 - 376
  • [2] Fully Convolutional Recurrent Network for Handwritten Chinese Text Recognition
    Xie, Zecheng
    Sun, Zenghui
    Jin, Lianwen
    Feng, Ziyong
    Zhang, Shuye
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4011 - 4016
  • [3] Segmented Handwritten Text Recognition with Recurrent Neural Network Classifiers
    Sui, Bolan
    Zhang, Xi
    Lui, Shijian
    Tan, Chew Lim
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 386 - 390
  • [4] Arabic Video Text Recognition Based on Multi-Dimensional Recurrent Neural Networks
    Zayene, Oussama
    Amamou, Soumaya Essefi
    BenAmara, Najoua Essoukri
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 725 - 729
  • [5] Urdu Nasta'liq text recognition system based on multi-dimensional recurrent neural network and statistical features
    Naz, Saeeda
    Umar, Arif I.
    Ahmad, Riaz
    Ahmed, Saad B.
    Shirazi, Syed H.
    Razzak, Muhammad I.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2017, 28 (02): : 219 - 231
  • [6] Urdu Nasta’liq text recognition system based on multi-dimensional recurrent neural network and statistical features
    Saeeda Naz
    Arif I. Umar
    Riaz Ahmad
    Saad B. Ahmed
    Syed H. Shirazi
    Muhammad I. Razzak
    [J]. Neural Computing and Applications, 2017, 28 : 219 - 231
  • [7] Evaluation of Neural Network Language Models In Handwritten Chinese Text Recognition
    Wu, Yi-Chao
    Yin, Fei
    Liu, Cheng-Lin
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 166 - 170
  • [8] In-Air Handwritten Chinese Text Recognition with Attention Convolutional Recurrent Network
    Wu, Zhihong
    Qu, Xiwen
    Huang, Jun
    Wu, Xuangou
    [J]. MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 695 - 707
  • [9] In-air handwritten Chinese text recognition with temporal convolutional recurrent network
    Gan, Ji
    Wang, Weiqiang
    Lu, Ke
    [J]. PATTERN RECOGNITION, 2020, 97
  • [10] Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models
    Wu, Yi-Chao
    Yin, Fei
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2017, 65 : 251 - 264