Large-Lexicon Attribute-Consistent Text Recognition in Natural Images

被引:59
|
作者
Novikova, Tatiana [1 ]
Barinova, Olga [1 ]
Kohli, Pushmeet [2 ]
Lempitsky, Victor [3 ]
机构
[1] Moscow MV Lomonosov State Univ, Moscow 117234, Russia
[2] Microsoft Res Cambridge, Cambridge, England
[3] Yandex, Moscow, Russia
来源
关键词
D O I
10.1007/978-3-642-33783-3_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new model for the task of word recognition in natural images that simultaneously models visual and lexicon consistency of words in a single probabilistic model. Our approach combines local likelihood and pairwise positional consistency priors with higher order priors that enforce consistency of characters (lexicon) and their attributes (font and colour). Unlike traditional stage-based methods, word recognition in our framework is performed by estimating the maximum a posteriori (MAP) solution under the joint posterior distribution of the model. MAP inference in our model is performed through the use of weighted finite-state transducers (WFSTs). We show how the efficiency of certain operations on WFSTs can be utilized to find the most likely word under the model in an efficient manner. We evaluate our method on a range of challenging datasets (ICDAR'03, SVT, ICDAR'11). Experimental results demonstrate that our method outperforms state-of-the-art methods for cropped word recognition.
引用
收藏
页码:752 / 765
页数:14
相关论文
共 32 条
  • [21] Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images
    Chandio, Asghar Ali
    Asikuzzamana, Md.
    Pickering, Mark
    Leghari, Mehwish
    DATA IN BRIEF, 2020, 31
  • [22] Analyzing the influence of contrast in large-scale recognition of natural images
    Sanchez, Angel
    Belen Moreno, A.
    Velez, Daniel
    Veleza, Jose F.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2016, 23 (03) : 221 - 235
  • [23] Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning
    Arafat, Syed Yasser
    Iqbal, Muhammad Javed
    IEEE ACCESS, 2020, 8 : 96787 - 96803
  • [24] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [25] Natural Text-Driven, Multi-Attribute Editing of Facial Images with Robustness in Sparse Latent Space
    Zou, Jianpeng
    Uchida, Kaoru
    Yao, Wenxin
    Yuen, Kashing
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 380 - 385
  • [26] CHARACTER/WORD MODELLING: A TWO-STEP FRAMEWORK FOR TEXT RECOGNITION IN NATURAL SCENE IMAGES
    Priya, M. shanmuga
    Pavithra, A.
    Nelson, Leema
    COMPUTER SCIENCE-AGH, 2024, 25 (04):
  • [27] An efficient ROI detection algorithm for Bangla text extraction and recognition from natural scene images
    Islam, Rashedul
    Islam, Md. Rafiqul
    Talukder, Kamrul Hasan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 6150 - 6164
  • [28] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    IEEE ACCESS, 2022, 10 : 10062 - 10078
  • [29] Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images
    Butt, Hanan
    Raza, Muhammad Raheel
    Ramzan, Muhammad Javed
    Ali, Muhammad Junaid
    Haris, Muhammad
    FORECASTING, 2021, 3 (03): : 520 - 540
  • [30] Performance Evaluation of Efficient and Accurate Text Detection and Recognition in Natural Scenes Images Using EAST and OCR Fusion
    Soni, Vishnu Kant
    Shukla, Vivek
    Tandan, S. R.
    Pimpalkar, Amit
    Nema, Neetesh Kumar
    Naik, Muskan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 445 - 453