Rectification and recognition of text in 3-D scenes

被引：25

作者：

Myers G.K. ^{[1
]}

Bolles R.C. ^{[1
]}

Luong Q.-T. ^{[1
]}

Herson J.A. ^{[1
]}

Aradhye H.B. ^{[1
]}

机构：

[1] SRI International, Menlo Park, CA 94025

来源：

International Journal of Document Analysis and Recognition (IJDAR) | 2005年 / 7卷 / 2-3期

关键词：

Multimedia content analysis; Perspective rectification; Scene text; Video OCR; Videotext recognition;

D O I：

10.1007/s10032-004-0133-4

中图分类号：

学科分类号：

摘要：

Real-world text on street signs, nameplates, etc. often lies in an oblique plane and hence cannot be recognized by traditional OCR systems due to perspective distortion. Furthermore, such text often comprises only one or two lines, preventing the use of existing perspective rectification methods that were primarily designed for images of document pages. We propose an approach that reliably rectifies and subsequently recognizes individual lines of text. Our system, which includes novel algorithms for extraction of text from real-world scenery, perspective rectification, and binarization, has been rigorously tested on still imagery as well as on MPEG-2 video clips in real time. © Springer-Verlag 2005.

引用

页码：147 / 158

页数：11

共 50 条

[1] Efficient multiple model recognition in cluttered 3-D scenes
Johnson, AE
Hebert, M
[J]. 1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, : 671 - 677
[2] Characterization of 3-D Volumetric Probabilistic Scenes for Object Recognition
Restrepo, Maria I.
Mayer, Brandon A.
Ulusoy, Ali O.
Mundy, Joseph L.
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2012, 6 (05) : 522 - 537
[3] Recognition of Indoor Scenes Using 3-D Scene Graphs
Yue, Han
Lehtola, Ville
Wu, Hangbin
Vosselman, George
Li, Jincheng
Liu, Chun
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[4] Perceived depth of 3-D objects in 3-D scenes
Sauer, CW
Saidpour, A
Braunstein, ML
Andersen, GJ
[J]. PERCEPTION, 2001, 30 (06) : 681 - 692
[5] Emotion Recognition in Text for 3-D Facial Expression Rendering
Calix, Ricardo A.
Mallepudi, Sri Abhishikth
Chen, Bin
Knapp, Gerald M.
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (06) : 544 - 551
[6] Extraction of 3-D information in scenes
Castelhano, Monica
Pollatsek, Alexander
[J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 37 - 37
[7] Rethinking text rectification for scene text recognition
Ke, Wenjun
Wei, Jianguo
Hou, Qingzhi
Feng, Hui
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
[8] Simultaneous Recognition and Modeling for Learning 3-D Object Models From Everyday Scenes
Liang, Mingjie
Min, Huaqing
Luo, Ronghua
Zhu, Jinhui
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (10) : 2237 - 2248
[9] A Pipeline for 3-D Object Recognition Based on Local Shape Description in Cluttered Scenes
Tao, Wuyong
Hua, Xianghong
Yu, Kegen
Chen, Xijiang
Zhao, Bufan
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 801 - 816
[10] SURFACE RECONSTRUCTION AND REPRESENTATION OF 3-D SCENES
WANG, YF
AGGARWAL, JK
[J]. PATTERN RECOGNITION, 1986, 19 (03) : 197 - 207

← 1 2 3 4 5 →