A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition

被引:0
|
作者
Libovicky, Jindrich [1 ]
Neumann, Lukas [2 ]
Pecina, Pavel [1 ]
Matas, Jiri [2 ]
机构
[1] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 1, Czech Republic
[2] Czech Tech Univ, Ctr Machine Percept, CR-16635 Prague 6, Czech Republic
关键词
D O I
10.1007/978-3-319-16631-5_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Recognition (STR) is a task of localizing and transcribing textual information captured in real-word images. With its increasing accuracy, it becomes a new source of textual data for standard Natural Language Processing tasks and poses new problems because of the specific nature of Scene Text. In this paper, we learn a string hypotheses decoding procedure in an STR pipeline using structured prediction methods that proved to be useful in automatic Speech Recognition and Machine Translation. The model allow to employ a wide range of typographical and language features into the decoding process. The proposed method is evaluated on a standard dataset and improves both character and word recognition performance over the baseline.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [21] Embedding metric learning into an extreme learning machine for scene recognition
    Wang, Chen
    Peng, Guohua
    De Baets, Bernard
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [22] An End-to-End Sequence Learning Approach for Text Extraction and Recognition from Scene Image
    Lalitha, G.
    Lavanya, B.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (07): : 220 - 228
  • [23] Scene text detection and recognition with advances in deep learning: a survey
    Liu, Xiyan
    Meng, Gaofeng
    Pan, Chunhong
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (02) : 143 - 162
  • [24] Scene text detection and recognition with advances in deep learning: a survey
    Xiyan Liu
    Gaofeng Meng
    Chunhong Pan
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 143 - 162
  • [25] A Novel Scene Text Recognition Method Based on Deep Learning
    Wang, Maosen
    Niu, Shaozhang
    Gao, Zhenguang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 60 (02): : 781 - 794
  • [26] Context-Based Contrastive Learning for Scene Text Recognition
    Zhang, Xinyun
    Zhu, Binwu
    Yao, Xufeng
    Sun, Qi
    Li, Ruiyu
    Yu, Bei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3353 - 3361
  • [27] Scene Text Recognition Based on Deep Learning: A Brief Survey
    Chen, Yuxin
    Shao, Yunxue
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 688 - 693
  • [28] Masked and Permuted Implicit Context Learning for Scene Text Recognition
    Yang, Xiaomeng
    Qiao, Zhi
    Wei, Jin
    Yang, Dongbao
    Zhou, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 964 - 968
  • [29] Rethinking text rectification for scene text recognition
    Ke, Wenjun
    Wei, Jianguo
    Hou, Qingzhi
    Feng, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [30] Correlation-guided decoding strategy for low-resource Uyghur scene text recognition
    Xu, Miaomiao
    Zhang, Jiang
    Xu, Lianghui
    Silamu, Wushour
    Li, Yanbing
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)