Rectification and recognition of text in 3-D scenes

被引:25
|
作者
Myers G.K. [1 ]
Bolles R.C. [1 ]
Luong Q.-T. [1 ]
Herson J.A. [1 ]
Aradhye H.B. [1 ]
机构
[1] SRI International, Menlo Park, CA 94025
关键词
Multimedia content analysis; Perspective rectification; Scene text; Video OCR; Videotext recognition;
D O I
10.1007/s10032-004-0133-4
中图分类号
学科分类号
摘要
Real-world text on street signs, nameplates, etc. often lies in an oblique plane and hence cannot be recognized by traditional OCR systems due to perspective distortion. Furthermore, such text often comprises only one or two lines, preventing the use of existing perspective rectification methods that were primarily designed for images of document pages. We propose an approach that reliably rectifies and subsequently recognizes individual lines of text. Our system, which includes novel algorithms for extraction of text from real-world scenery, perspective rectification, and binarization, has been rigorously tested on still imagery as well as on MPEG-2 video clips in real time. © Springer-Verlag 2005.
引用
收藏
页码:147 / 158
页数:11
相关论文
共 50 条
  • [1] Efficient multiple model recognition in cluttered 3-D scenes
    Johnson, AE
    Hebert, M
    [J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 671 - 677
  • [2] Characterization of 3-D Volumetric Probabilistic Scenes for Object Recognition
    Restrepo, Maria I.
    Mayer, Brandon A.
    Ulusoy, Ali O.
    Mundy, Joseph L.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (05) : 522 - 537
  • [3] Recognition of Indoor Scenes Using 3-D Scene Graphs
    Yue, Han
    Lehtola, Ville
    Wu, Hangbin
    Vosselman, George
    Li, Jincheng
    Liu, Chun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [4] Perceived depth of 3-D objects in 3-D scenes
    Sauer, CW
    Saidpour, A
    Braunstein, ML
    Andersen, GJ
    [J]. PERCEPTION, 2001, 30 (06) : 681 - 692
  • [5] Emotion Recognition in Text for 3-D Facial Expression Rendering
    Calix, Ricardo A.
    Mallepudi, Sri Abhishikth
    Chen, Bin
    Knapp, Gerald M.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (06) : 544 - 551
  • [6] Extraction of 3-D information in scenes
    Castelhano, Monica
    Pollatsek, Alexander
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 37 - 37
  • [7] Rethinking text rectification for scene text recognition
    Ke, Wenjun
    Wei, Jianguo
    Hou, Qingzhi
    Feng, Hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [8] Simultaneous Recognition and Modeling for Learning 3-D Object Models From Everyday Scenes
    Liang, Mingjie
    Min, Huaqing
    Luo, Ronghua
    Zhu, Jinhui
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2237 - 2248
  • [9] A Pipeline for 3-D Object Recognition Based on Local Shape Description in Cluttered Scenes
    Tao, Wuyong
    Hua, Xianghong
    Yu, Kegen
    Chen, Xijiang
    Zhao, Bufan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 801 - 816
  • [10] SURFACE RECONSTRUCTION AND REPRESENTATION OF 3-D SCENES
    WANG, YF
    AGGARWAL, JK
    [J]. PATTERN RECOGNITION, 1986, 19 (03) : 197 - 207