Bidirectional Scene Text Recognition with a Single Decoder

被引:6
|
作者
Bleeker, Maurits [1 ]
de Rijke, Maarten [1 ]
机构
[1] Univ Amsterdam, Amsterdam, Netherlands
关键词
D O I
10.3233/FAIA200404
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Recognition (STR) is the problem of recognizing the correct word or character sequence in a cropped word image. To obtain more robust output sequences, the notion of bidirectional STR has been introduced. So far, bidirectional STRs have been implemented by using two separate decoders; one for left-to-right decoding and one for right-to-left. Having two separate decoders for almost the same task with the same output space is undesirable from a computational and optimization point of view. We introduce the Bidirectional Scene Text Transformer (Bi-STET), a novel bidirectional STR method with a single decoder for bidirectional text decoding. With its single decoder, Bi-STET outperforms methods that apply bidirectional decoding by using two separate decoders while also being more efficient than those methods, Furthermore, we achieve or beat state-of-the-art (SOTA) methods on all STR benchmarks with Bi-STET. Finally, we provide analyzes and insights into the performance of Bi-STET.
引用
收藏
页码:2664 / 2671
页数:8
相关论文
共 50 条
  • [41] Scene Text Recognition: No Country for Old Men?
    Gomez, Lluis
    Karatzas, Dimosthenis
    [J]. COMPUTER VISION - ACCV 2014 WORKSHOPS, PT II, 2015, 9009 : 157 - 168
  • [42] Towards Scene Text Recognition with Genetic Programming
    Barlow, Brendan
    Song, Andy
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1310 - 1317
  • [43] Scene Text Recognition and Retrieval for Large Lexicons
    Roy, Udit
    Mishra, Anand
    Alahari, Karteek
    Jawahar, C. V.
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 494 - 508
  • [44] An extended attention mechanism for scene text recognition
    Xiao, Zheng
    Nie, Zhenyu
    Song, Chao
    Chronopoulos, Anthony Theodore
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [45] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [46] Revisiting Scene Text Recognition: A Data Perspective
    Jiang, Qing
    Wang, Jiapeng
    Peng, Dezhi
    Liu, Chongyu
    Jin, Lianwen
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20486 - 20497
  • [47] On Combining Multiple Segmentations in Scene Text Recognition
    Neumann, Lukas
    Matas, Jiri
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 523 - 527
  • [48] Dictionary-guided Scene Text Recognition
    Nguyen Nguyen
    Thu Nguyen
    Vinh Tran
    Minh-Triet Tran
    Thanh Duc Ngo
    Thien Huu Nguyen
    Minh Hoai
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7379 - 7388
  • [49] Optical Character Recognition for Scene Text Detection, Mining and Recognition
    Nathiya, N.
    Pradeepa, K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 662 - 665
  • [50] HIERARCHICAL REFINED ATTENTION FOR SCENE TEXT RECOGNITION
    Zhang, Min
    Ma, Meng
    Wang, Ping
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4175 - 4179