Chinese Image Text Recognition with BLSTM-CTC: A Segmentation-Free Method

被引:8
|
作者
Zhai, Chuanlei [1 ]
Chen, Zhineng [1 ]
Li, Jie [1 ]
Xu, Bo [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Interact Digital Media Technol Res Ctr, Beijing 100190, Peoples R China
来源
关键词
Chinese image text recognition; BLSTM; CTC; Segmentation-free;
D O I
10.1007/978-981-10-3005-5_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents BLSTM-CTC (bidirectional LSTM-Connectionist Temporal Classification), a novel scheme to tackle the Chinese image text recognition problem. Different from traditional methods that perform the recognition on the single character level, the input of BLSTM-CTC is an image text composed of a line of characters and the output is a recognized text sequence, where the recognition is carried out on the whole image text level. To train a neural network for this challenging task, we collect over 2 million news titles from which we generate over 1 million noisy image texts, covering almost the vast majority of common Chinese characters. With these training data, a RNN training procedure is conducted to learn the recognizer. We also carry out some adaptations on the neural network to make it suitable for real scenarios. Experiments on text images from 13 TV channels demonstrate the effectiveness of the proposed pipeline. The results all outperform those of a baseline system.
引用
收藏
页码:525 / 536
页数:12
相关论文
共 50 条
  • [21] A segmentation-free method for image classification based on pixel-wise matching
    Ma, Jun
    Zheng, Long
    Dong, Mianxiong
    He, Xiangjian
    Guo, Minyi
    Yaguchi, Yuichi
    Oka, Ryuichi
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (02) : 256 - 268
  • [22] A Segmentation-Free Approach for Printed Devanagari Script Recognition
    Karayil, Tushar
    Ul-Hasan, Adnan
    Breuel, Thomas M.
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 946 - 950
  • [23] Segmentation-Free Approaches for Handwritten Numeral String Recognition
    Hochuli, Andre G.
    Oliveira, Luiz S.
    Britto Jr, Alceu de Souza
    Sabourin, Robert
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [24] Segmentation-free composite character recognition (CR) in bilingual handwritten text for Gurumukhi-English scripts
    Kaur, Sukhandeep
    Bawa, Seema
    Kumar, Ravinder
    SOFT COMPUTING, 2023, 27 (21) : 16159 - 16178
  • [25] Text-Conditioned Character Segmentation for CTC-Based Text Recognition
    Tanaka, Ryohei
    Osada, Kunio
    Furuhata, Akio
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 142 - 156
  • [26] A segmentation-free isogeometric extended mortar contact method
    Duong, Thang X.
    De Lorenzis, Laura
    Sauer, Roger A.
    COMPUTATIONAL MECHANICS, 2019, 63 (02) : 383 - 407
  • [27] A segmentation-free isogeometric extended mortar contact method
    Thang X. Duong
    Laura De Lorenzis
    Roger A. Sauer
    Computational Mechanics, 2019, 63 : 383 - 407
  • [28] OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold
    Yousef, Mohamed
    Bishop, Tom E.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 14698 - 14707
  • [29] DAN: A Segmentation-Free Document Attention Network for Handwritten Document Recognition
    Coquenet, Denis
    Chatelain, Clement
    Paquet, Thierry
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8227 - 8243
  • [30] Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach
    Peng, Dezhi
    Jin, Lianwen
    Ma, Weihong
    Xie, Canyu
    Zhang, Hesuo
    Zhu, Shenggao
    Li, Jing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2368 - 2381