Chinese Image Text Recognition with BLSTM-CTC: A Segmentation-Free Method

被引:8
|
作者
Zhai, Chuanlei [1 ]
Chen, Zhineng [1 ]
Li, Jie [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Interact Digital Media Technol Res Ctr, Beijing 100190, Peoples R China
来源
关键词
Chinese image text recognition; BLSTM; CTC; Segmentation-free;
D O I
10.1007/978-981-10-3005-5_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents BLSTM-CTC (bidirectional LSTM-Connectionist Temporal Classification), a novel scheme to tackle the Chinese image text recognition problem. Different from traditional methods that perform the recognition on the single character level, the input of BLSTM-CTC is an image text composed of a line of characters and the output is a recognized text sequence, where the recognition is carried out on the whole image text level. To train a neural network for this challenging task, we collect over 2 million news titles from which we generate over 1 million noisy image texts, covering almost the vast majority of common Chinese characters. With these training data, a RNN training procedure is conducted to learn the recognizer. We also carry out some adaptations on the neural network to make it suitable for real scenarios. Experiments on text images from 13 TV channels demonstrate the effectiveness of the proposed pipeline. The results all outperform those of a baseline system.
引用
收藏
页码:525 / 536
页数:12
相关论文
共 50 条
  • [1] Comparative study of HMM and BLSTM segmentation-free approaches for the recognition of handwritten text-lines
    Morillot, Olivier
    Likforman-Sulem, Laurence
    Grosicki, Emmanuele
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 783 - 787
  • [2] Faster Segmentation-Free Handwritten Chinese Text Recognition with Character Decompositions
    Bluche, Theodore
    Messina, Ronaldo
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 530 - 535
  • [3] Segmentation-free Handwritten Chinese Text Recognition with LSTM-RNN
    Messina, Ronaldo
    Louradour, Jerome
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 171 - 175
  • [4] A segmentation-free approach to text recognition with application to Arabic text
    Al-Badr B.
    Haralick R.M.
    International Journal on Document Analysis and Recognition, 1998, 1 (3) : 147 - 166
  • [5] A segmentation-free approach to text recognition with application to Arabic text
    Department of Computer Science and Engineering, University of Washington, Mail Stop FR-35, Seattle, WA 98195, United States
    Int. J. Doc. Anal. Recogn., 3 (147-166):
  • [6] Segmentation-free speech text recognition for comic books
    Rigaud, Christophe
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 3, 2017, : 29 - 34
  • [7] Segmentation-Free Guidance for Text-to-Image Diffusion Models
    Azarian, Kambiz
    Das, Debasmit
    Hou, Qiqi
    Porikli, Fatih
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 7520 - 7529
  • [8] IMAGE CLASSIFICATION BASED ON SEGMENTATION-FREE OBJECT RECOGNITION
    Ma, Jun
    Zheng, Long
    Yaguchi, Yuichi
    Dong, Mianxiong
    Oka, Ryuichi
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2157 - 2160
  • [9] Segmentation-free optical character recognition for printed Urdu text
    Israr Ud Din
    Imran Siddiqi
    Shehzad Khalid
    Tahir Azam
    EURASIP Journal on Image and Video Processing, 2017
  • [10] Segmentation-free optical character recognition for printed Urdu text
    Din, Israr Ud
    Siddiqi, Imran
    Khalid, Shehzad
    Azam, Tahir
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2017,