Top-Down and Bottom-up Cues for Scene Text Recognition

被引:0
|
作者
Mishra, Anand [1 ]
Alahari, Karteek [2 ]
Jawahar, C. V. [1 ]
机构
[1] IIIT Hyderabad, CVIT, Hyderabad, Andhra Pradesh, India
[2] INRIA WILLOW, Ecole Normale Super, Paris, France
关键词
SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text recognition has gained significant attention from the computer vision community in recent years. Recognizing such text is a challenging problem, even more so than the recognition of scanned documents. In this work, we focus on the problem of recognizing text extracted from street images. We present a framework that exploits both bottom-up and top-down cues. The bottom-up cues are derived from individual character detections from the image. We build a Conditional Random Field model on these detections to jointly model the strength of the detections and the interactions between them. We impose top-down cues obtained from a lexicon-based prior, i.e. language statistics, on the model. The optimal word represented by the text image is obtained by minimizing the energy function corresponding to the random field model. We show significant improvements in accuracies on two challenging public datasets, namely Street View Text (over 15%) and ICDAR 2003 (nearly 10%).
引用
收藏
页码:2687 / 2694
页数:8
相关论文
共 50 条
  • [1] BOTTOM-UP AND TOP-DOWN APPROACH TO USING CONTEXT IN TEXT RECOGNITION
    SHINGHAL, R
    TOUSSAINT, GT
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (02): : 201 - 212
  • [2] Bottom-Up and Top-Down Cues in a Comics Reading Task
    Henretty, Dawnelle J.
    McEneaney, John E.
    [J]. READING PSYCHOLOGY, 2020, 41 (03) : 183 - 204
  • [3] Residual Dual Scale Scene Text Spotting by Fusing Bottom-Up and Top-Down Processing
    Wei Feng
    Fei Yin
    Xu-Yao Zhang
    Wenhao He
    Cheng-Lin Liu
    [J]. International Journal of Computer Vision, 2021, 129 : 619 - 637
  • [4] Residual Dual Scale Scene Text Spotting by Fusing Bottom-Up and Top-Down Processing
    Feng, Wei
    Yin, Fei
    Zhang, Xu-Yao
    He, Wenhao
    Liu, Cheng-Lin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (03) : 619 - 637
  • [5] Bottom-up or top-down in dream neuroscience? A top-down critique of two bottom-up studies
    Foulkes, David
    Domhoff, G. William
    [J]. CONSCIOUSNESS AND COGNITION, 2014, 27 : 168 - 171
  • [6] SPEECH RECOGNITION BASED ON TOP-DOWN AND BOTTOM-UP PHONEME RECOGNITION
    MATSUNAGA, S
    SHIKANO, K
    [J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1986, 34 (03): : 349 - 356
  • [7] From bottom-up to top-down
    Johnston, Hamish
    [J]. PHYSICS WORLD, 2023, 36 (08) : 35 - 37
  • [8] Top-down meets bottom-up
    不详
    [J]. R&D MAGAZINE, 2002, 44 (03): : 13 - 13
  • [9] Top-down or bottom-up assessment?
    Kolehmainen, Niina
    [J]. BRITISH JOURNAL OF OCCUPATIONAL THERAPY, 2010, 73 (05) : 209 - 209
  • [10] Unsupervised Tattoo Segmentation Combining Bottom-Up and Top-Down Cues
    Allen, Josef D.
    Zhao, Nan
    Yuan, Jiangbo
    Liu, Xiuwen
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2011, 2011, 8063