A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition

被引:0
|
作者
Libovicky, Jindrich [1 ]
Neumann, Lukas [2 ]
Pecina, Pavel [1 ]
Matas, Jiri [2 ]
机构
[1] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 1, Czech Republic
[2] Czech Tech Univ, Ctr Machine Percept, CR-16635 Prague 6, Czech Republic
关键词
D O I
10.1007/978-3-319-16631-5_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Recognition (STR) is a task of localizing and transcribing textual information captured in real-word images. With its increasing accuracy, it becomes a new source of textual data for standard Natural Language Processing tasks and poses new problems because of the specific nature of Scene Text. In this paper, we learn a string hypotheses decoding procedure in an STR pipeline using structured prediction methods that proved to be useful in automatic Speech Recognition and Machine Translation. The model allow to employ a wide range of typographical and language features into the decoding process. The proposed method is evaluated on a standard dataset and improves both character and word recognition performance over the baseline.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [1] A novel machine learning approach for scene text extraction
    Ansari, Ghulam Jillani
    Shah, Jamal Hussain
    Yasmin, Mussarat
    Sharif, Muhammad
    Fernandes, Steven Lawrence
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 328 - 340
  • [2] Curriculum learning for scene text recognition
    Yan, Jingzhe
    Tao, Yuefeng
    Zhang, Wanjun
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (04)
  • [3] Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer
    Yamazoe, Takafumi
    Etoh, Minoru
    Yoshimura, Takeshi
    Tsujino, Kousuke
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 359 - 363
  • [4] SNFR: salient neighbor decoding and text feature refining for scene text recognition
    Lu, Tongwei
    Fan, Huageng
    Chen, Yuqian
    Shao, Pengyan
    MACHINE VISION AND APPLICATIONS, 2025, 36 (02)
  • [5] Scene Text Recognition with Single-Point Decoding Network
    Chen, Lei
    Qin, Haibo
    Zhang, Shi-Xue
    Yang, Chun
    Yin, Xucheng
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 142 - 153
  • [6] NDOrder: Exploring a novel decoding order for scene text recognition
    Zhong, Dajian
    Zhan, Hongjian
    Lyu, Shujing
    Liu, Cong
    Yin, Bing
    Shivakumara, Palaiahnakote
    Pal, Umapada
    Lu, Yue
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [7] Decoding Functional Brain Data for Emotion Recognition: A Machine Learning Approach
    Tulay, Emine Elif
    Balli, Tugce
    ACM TRANSACTIONS ON APPLIED PERCEPTION, 2024, 21 (03)
  • [8] Primitive Representation Learning for Scene Text Recognition
    Yan, Ruijie
    Peng, Liangrui
    Xiao, Shanyu
    Yao, Gang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 284 - 293
  • [9] Relational Contrastive Learning for Scene Text Recognition
    Zhang, Jinglei
    Lin, Tiancheng
    Xu, Yi
    Chen, Kai
    Zhang, Rui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5764 - 5775
  • [10] A Feature Learning Method for Scene Text Recognition
    Ho Vu Duong
    Quoc Ngoc Ly
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 176 - 180