Bidirectional extraction and recognition of scene text with layout consistency

被引:0
|
作者
Hinami, Ryota [1 ]
Liu, Xinhao [2 ]
Chiba, Naoki [3 ]
Satoh, Shin'ichi [4 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Tokyo Inst Technol, Tokyo 152, Japan
[3] Rakuten Inc, Tokyo, Japan
[4] Natl Inst Informat, Tokyo, Japan
关键词
Scene text recognition; Character recognition; Word recognition; Layout consistency; STYLE;
D O I
10.1007/s10032-016-0261-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition in natural scene images is a challenging task that has recently been garnering increased research attention. In this paper, we propose a method for recognizing text by utilizing the layout consistency of a text string. We estimate the layout (four lines of a text string) using initial character extraction and recognition result. On the basis of the layout consistency across a word, we perform character extraction and recognition again using four lines, which is more accurate than the first process. Our layout estimation method is different from previous methods in terms of exploiting character recognition results and its use of a class-conditional layout model. More accurate and robust estimation is achieved, and it can be used to refine character extraction and recognition. We call this two-way process-from extraction and recognition to layout, and from layout to extraction and recognition-"bidirectional" to discriminate it from previous feedback refinement approaches. Experimental results demonstrate that our bidirectional processes provide a boost to the performance of word recognition.
引用
收藏
页码:83 / 98
页数:16
相关论文
共 50 条
  • [1] Bidirectional extraction and recognition of scene text with layout consistency
    Ryota Hinami
    Xinhao Liu
    Naoki Chiba
    Shin’ichi Satoh
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2016, 19 : 83 - 98
  • [2] Bidirectional Scene Text Recognition with a Single Decoder
    Bleeker, Maurits
    de Rijke, Maarten
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2664 - 2671
  • [3] DISTILLING KNOWLEDGE OF BIDIRECTIONAL LANGUAGE MODEL FOR SCENE TEXT RECOGNITION
    Orihashi, Shota
    Yamazaki, Yoshihiro
    Uchida, Mihiro
    Takashima, Akihiko
    Masumura, Ryo
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2165 - 2169
  • [4] Scene Layout in Text-to-Scene Conversion
    Yang, Fuping
    Zhou, Yun
    Luo, Xiaobo
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 891 - 895
  • [5] Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
    Kantipudi, M. V. V. Prasad
    Kumar, Sandeep
    Jha, Ashish Kumar
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [6] Augmented Scene Text Recognition Using Crosswise Feature Extraction
    Cinu C Kiliroor
    S. Shrija
    R. Ajay
    [J]. Wireless Personal Communications, 2022, 123 : 421 - 436
  • [7] Augmented Scene Text Recognition Using Crosswise Feature Extraction
    Kiliroor, Cinu C.
    Shrija, S.
    Ajay, R.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2022, 123 (01) : 421 - 436
  • [8] Scene text recognition with context-aware autonomous bidirectional iterative models
    Zhao, Xiaoqing
    Xu, Miaomiao
    Li, Yanbing
    Huang, Hao
    Silamu, Wushour
    [J]. Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 8605 - 8616
  • [9] Gate-based Bidirectional Interactive Decoding Network for Scene Text Recognition
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Lu, Hanqing
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2273 - 2276
  • [10] Scene Recognition With Prototype-Agnostic Scene Layout
    Chen, Gongwei
    Song, Xinhang
    Zeng, Haitao
    Jiang, Shuqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5877 - 5888