Robust text detection in natural scenes using text geometry and visual appearance

被引:0
|
作者
Yan S.-Y. [1 ]
Xu X.-X. [2 ]
Liu Q.-S. [1 ]
机构
[1] School of Information and Control, Nanjing University of Information Science and Technology, Nanjing
[2] School of Computer Engineering, Nanyang Technological University, Singapore
来源
Yan, Sheng-Ye | 1600年 / Chinese Academy of Sciences卷 / 11期
基金
中国国家自然科学基金;
关键词
geometric rule; multiple kernel learning (MKL); stroke width transform (SWT); support vector machine (SVM); Text detection;
D O I
10.1007/s11633-014-0833-2
中图分类号
学科分类号
摘要
This paper proposes a new two-phase approach to robust text detection by integrating the visual appearance and the geometric reasoning rules. In the first phase, geometric rules are used to achieve a higher recall rate. Specifically, a robust stroke width transform (RSWT) feature is proposed to better recover the stroke width by additionally considering the cross of two strokes and the continuousness of the letter border. In the second phase, a classification scheme based on visual appearance features is used to reject the false alarms while keeping the recall rate. To learn a better classifier from multiple visual appearance features, a novel classification method called double soft multiple kernel learning (DS-MKL) is proposed. DS-MKL is motivated by a novel kernel margin perspective for multiple kernel learning and can effectively suppress the influence of noisy base kernels. Comprehensive experiments on the benchmark ICDAR2005 competition dataset demonstrate the effectiveness of the proposed two-phase text detection approach over the state-of-the-art approaches by a performance gain up to 4.4% in terms of F-measure. © 2014, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:480 / 488
页数:8
相关论文
共 50 条
  • [31] Accurate and Robust Text Detection: A Step-In for Text Retrieval in Natural Scene Images
    Yin, Xu-Cheng
    Yin, Xuwang
    Huang, Kaizhu
    Hao, Hong-Wei
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1091 - 1092
  • [32] Text to visual synthesis with appearance models
    Melenehón, I
    de la Torre, F
    Iriondo, I
    Alías, F
    Martínez, E
    Vicent, H
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 237 - 240
  • [33] TEXT SEGMENTATION IN NATURAL SCENES USING TOGGLE-MAPPING
    Fabrizio, J.
    Marcotegui, B.
    Cord, M.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2373 - +
  • [34] Text Detection Algorithm for Natural Scenes under Attention Supervision Strategy
    Haorang L.
    Lingchen Y.
    Ronghua L.
    Long C.
    Hao W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (07): : 1011 - 1019
  • [35] Text Detection and Recognition in Urban Scenes
    Minetto, R.
    Thome, N.
    Cord, M.
    Stolfi, J.
    Precioso, F.
    Guyomard, J.
    Leite, N. J.
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [36] Fast perspective recovery of text in natural scenes
    Merino-Gracia, Carlos
    Mirmehdi, Majid
    Sigut, Jose
    Gonzalez-Mora, Jose L.
    IMAGE AND VISION COMPUTING, 2013, 31 (10) : 714 - 724
  • [37] Recognizing Text with Perspective Distortion in Natural Scenes
    Trung Quy Phan
    Shivakumara, Palaiahnakote
    Tian, Shangxuan
    Tan, Chew Lim
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 569 - 576
  • [38] Text-Background Decomposition for Thai Text Localization and Recognition in Natural Scenes
    Woraratpanya, Kuntpong
    Pasupa, Kitsuchart
    Suttapakti, Ungsumalee
    Boonchukusol, Pimlak
    Titijaroonroj, Taravichet
    Hokking, Rattaphon
    Kuroki, Yoshimitsu
    Kato, Yasushi
    2014 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2014, : 138 - 143
  • [39] A robust algorithm for text region detection in natural scene images
    Park, Jonghyun
    Lee, Gueesang
    CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE, 2008, 33 (3-4): : 215 - 222
  • [40] A robust approach for text detection from natural scene images
    Sun, Lei
    Huo, Qiang
    Jia, Wei
    Chen, Kai
    PATTERN RECOGNITION, 2015, 48 (09) : 2906 - 2920