Text extraction from scene images by character appearance and structure modeling

被引:80
|
作者
Yi, Chucai
Tian, Yingli [1 ]
机构
[1] CUNY, City Coll New York, New York, NY 10031 USA
关键词
Text detection; Scene image; Character appearance; Structure modeling; Structure difference; Structure component co-occurrence; Character identification; CLASSIFICATION;
D O I
10.1016/j.cviu.2012.11.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel algorithm to detect text information from natural scene images. Scene text classification and detection are still open research topics. Our proposed algorithm is able to model both character appearance and structure to generate representative and discriminative text descriptors. The contributions of this paper include three aspects: (1) a new character appearance model by a structure correlation algorithm which extracts discriminative appearance features from detected interest points of character samples; (2) a new text descriptor based on structons and correlatons, which model character structure by structure differences among character samples and structure component co-occurrence; and (3) a new text region localization method by combining color decomposition, character contour refinement, and string line alignment to localize character candidates and refine detected text regions. We perform three groups of experiments to evaluate the effectiveness of our proposed algorithm, including text classification, text detection, and character identification. The evaluation results on benchmark datasets demonstrate that our algorithm achieves the state-of-the-art performance on scene text classification and detection, and significantly outperforms the existing algorithms for character identification. (c) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:182 / 194
页数:13
相关论文
共 50 条
  • [1] Character Energy and Link Energy-Based Text Extraction in Scene Images
    Zhang, Jing
    Kasturi, Rangachar
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 308 - 320
  • [2] Devanagari Text Extraction from Natural Scene Images
    Raj, Hrishav
    Ghosh, Rajib
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 513 - 517
  • [3] Video text extraction from images for character recognition
    Amarapur, Basavaraj
    Patil, Nagaraj
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 95 - +
  • [4] Scene text extraction in complex images
    Byun, HR
    Roh, MC
    Kim, KC
    Choi, YW
    Lee, SW
    DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 329 - 340
  • [5] Character extraction from natural scene images by hierarchical classifiers
    Yamaguchi, T
    Maruyama, M
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 687 - 690
  • [6] TEXT DETECTION AND CHARACTER EXTRACTION IN NATURAL SCENE IMAGES USING FRACTIONAL POISSON MODEL
    Rajan, Veena
    Raj, Shani
    2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 1136 - 1141
  • [7] Text Extraction from Scene Images using Statistical Distributions
    Ghoshal, Ranjit
    Roy, Anandarup
    Parui, Swapan K.
    2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2012, : 187 - 190
  • [8] Character extraction and recognition in natural scene images
    Wang, XW
    Ding, XQ
    Liu, CS
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1084 - 1088
  • [9] Efficient Character Skew Rectification in Scene Text Images
    Busta, Michal
    Drtina, Tomas
    Helekal, David
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT II, 2015, 9009 : 134 - 146
  • [10] Character String Extraction from Scene Images by Eliminating Non-character Elements
    Takagi, Noboru
    Chen, Jianjun
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3685 - 3690