A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition

被引:0
|
作者
Libovicky, Jindrich [1 ]
Neumann, Lukas [2 ]
Pecina, Pavel [1 ]
Matas, Jiri [2 ]
机构
[1] Charles Univ Prague, Inst Formal & Appl Linguist, Prague 1, Czech Republic
[2] Czech Tech Univ, Ctr Machine Percept, CR-16635 Prague 6, Czech Republic
关键词
D O I
10.1007/978-3-319-16631-5_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Recognition (STR) is a task of localizing and transcribing textual information captured in real-word images. With its increasing accuracy, it becomes a new source of textual data for standard Natural Language Processing tasks and poses new problems because of the specific nature of Scene Text. In this paper, we learn a string hypotheses decoding procedure in an STR pipeline using structured prediction methods that proved to be useful in automatic Speech Recognition and Machine Translation. The model allow to employ a wide range of typographical and language features into the decoding process. The proposed method is evaluated on a standard dataset and improves both character and word recognition performance over the baseline.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 50 条
  • [11] Text-Level Contrastive Learning for Scene Text Recognition
    Zhuang, Junbin
    Ren, Yixuan
    Li, Xia
    Liang, Zhanpeng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 231 - 236
  • [12] LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
    Cheng, Changxu
    Wang, Peng
    Da, Cheng
    Zheng, Qi
    Yao, Cong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19484 - 19494
  • [13] A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application
    Liu, Yiyi
    Wang, Yuxin
    Shi, Hongjian
    SYMMETRY-BASEL, 2023, 15 (04):
  • [14] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C., V
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 182 - 197
  • [15] LAL: Linguistically Aware Learning for Scene Text Recognition
    Zheng, Yi
    Qin, Wenda
    Wijaya, Derry
    Betke, Margrit
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4051 - 4059
  • [16] Synthetically Supervised Feature Learning for Scene Text Recognition
    Liu, Yang
    Wang, Zhaowen
    Jin, Hailin
    Wassell, Ian
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 449 - 465
  • [17] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C.V.
    arXiv, 2022,
  • [18] Scene Text Detection and Recognition: The Deep Learning Era
    Long, Shangbang
    He, Xin
    Yao, Cong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) : 161 - 184
  • [19] Scene Text Detection and Recognition: The Deep Learning Era
    Shangbang Long
    Xin He
    Cong Yao
    International Journal of Computer Vision, 2021, 129 : 161 - 184
  • [20] Gate-based Bidirectional Interactive Decoding Network for Scene Text Recognition
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Lu, Hanqing
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2273 - 2276