Large-Lexicon Attribute-Consistent Text Recognition in Natural Images

被引:59
|
作者
Novikova, Tatiana [1 ]
Barinova, Olga [1 ]
Kohli, Pushmeet [2 ]
Lempitsky, Victor [3 ]
机构
[1] Moscow MV Lomonosov State Univ, Moscow 117234, Russia
[2] Microsoft Res Cambridge, Cambridge, England
[3] Yandex, Moscow, Russia
来源
关键词
D O I
10.1007/978-3-642-33783-3_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new model for the task of word recognition in natural images that simultaneously models visual and lexicon consistency of words in a single probabilistic model. Our approach combines local likelihood and pairwise positional consistency priors with higher order priors that enforce consistency of characters (lexicon) and their attributes (font and colour). Unlike traditional stage-based methods, word recognition in our framework is performed by estimating the maximum a posteriori (MAP) solution under the joint posterior distribution of the model. MAP inference in our model is performed through the use of weighted finite-state transducers (WFSTs). We show how the efficiency of certain operations on WFSTs can be utilized to find the most likely word under the model in an efficient manner. We evaluate our method on a range of challenging datasets (ICDAR'03, SVT, ICDAR'11). Experimental results demonstrate that our method outperforms state-of-the-art methods for cropped word recognition.
引用
收藏
页码:752 / 765
页数:14
相关论文
共 32 条
  • [1] Ensemble Attention For Text Recognition In Natural Images
    Gao, Hongchao
    Li, Yujia
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49
  • [3] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [4] Integrated Text Detection and Recognition in Natural Images
    Roubtsova, Nadejda S.
    Wijnhoven, Rob G. J.
    de With, Peter H. N.
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS X AND PARALLEL PROCESSING FOR IMAGING APPLICATIONS II, 2012, 8295
  • [5] Ensemble Attention for Text Recognition in Natural Images
    Gao, Hongchao
    Li, Yujia
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    Proceedings of the International Joint Conference on Neural Networks, 2019, 2019-July
  • [6] Simultaneous Recognition of Horizontal and Vertical Text in Natural Images
    Choi, Chankyu
    Yoon, Youngmin
    Lee, Junsu
    Kim, Junseok
    COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 : 202 - 212
  • [7] Research on the Text Detection and Recognition in Natural Scene Images
    Wei Zi-han
    Du Xiao-ping
    Cao Lei
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [8] Research on Text Location and Recognition in Natural Images with Deep Learning
    Zhang, Ping
    Shi, Ziyu
    Gao, Haichang
    2018 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ARTIFICIAL INTELLIGENCE (ICAAI 2018), 2015, : 1 - 6
  • [9] Focusing Attention: Towards Accurate Text Recognition in Natural Images
    Cheng, Zhanzhan
    Bai, Fan
    Xu, Yunlu
    Zheng, Gang
    Pu, Shiliang
    Zhou, Shuigeng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5086 - 5094
  • [10] Arabic Cursive Text Recognition from Natural Scene Images
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yusof, Rubiyah
    APPLIED SCIENCES-BASEL, 2019, 9 (02):