Robust text detection in natural scenes using text geometry and visual appearance

被引:0
|
作者
Yan S.-Y. [1 ]
Xu X.-X. [2 ]
Liu Q.-S. [1 ]
机构
[1] School of Information and Control, Nanjing University of Information Science and Technology, Nanjing
[2] School of Computer Engineering, Nanyang Technological University, Singapore
来源
Yan, Sheng-Ye | 1600年 / Chinese Academy of Sciences卷 / 11期
基金
中国国家自然科学基金;
关键词
geometric rule; multiple kernel learning (MKL); stroke width transform (SWT); support vector machine (SVM); Text detection;
D O I
10.1007/s11633-014-0833-2
中图分类号
学科分类号
摘要
This paper proposes a new two-phase approach to robust text detection by integrating the visual appearance and the geometric reasoning rules. In the first phase, geometric rules are used to achieve a higher recall rate. Specifically, a robust stroke width transform (RSWT) feature is proposed to better recover the stroke width by additionally considering the cross of two strokes and the continuousness of the letter border. In the second phase, a classification scheme based on visual appearance features is used to reject the false alarms while keeping the recall rate. To learn a better classifier from multiple visual appearance features, a novel classification method called double soft multiple kernel learning (DS-MKL) is proposed. DS-MKL is motivated by a novel kernel margin perspective for multiple kernel learning and can effectively suppress the influence of noisy base kernels. Comprehensive experiments on the benchmark ICDAR2005 competition dataset demonstrate the effectiveness of the proposed two-phase text detection approach over the state-of-the-art approaches by a performance gain up to 4.4% in terms of F-measure. © 2014, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:480 / 488
页数:8
相关论文
共 50 条
  • [1] Robust Text Detection in Natural Scenes Using Text Geometry and Visual Appearance
    Sheng-Ye Yan
    Xin-Xing Xu
    Qing-Shan Liu
    International Journal of Automation and Computing, 2014, (05) : 480 - 488
  • [2] Robust Text Detection in Natural Scenes Using Text Geometry and Visual Appearance
    ShengYe Yan
    XinXing Xu
    QingShan Liu
    International Journal of Automation & Computing, 2014, 11 (05) : 480 - 488
  • [3] Finding Text in Natural Scenes by Visual Attention
    Mesquita, R. G.
    Mello, C. A. B.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 4243 - 4247
  • [4] Text Detection in Natural Scenes using Fully Convolutional DenseNets
    Behzadi, Mitra
    Safabakhsh, Reza
    2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 11 - 14
  • [5] IBN-STR: A Robust Text Recognizer for Irregular Text in Natural Scenes
    Li, Xiaoqian
    Liu, Jie
    Zhang, Guixuan
    Zhang, Shuwu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9522 - 9528
  • [6] Text detection in natural scenes: a literature review
    Bai, Zhi-Cheng
    Li, Qing
    Chen, Peng
    Guo, Li-Qing
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2020, 42 (11): : 1433 - 1448
  • [7] A robust approach for recognition of text embedded in natural scenes
    Jing, Z
    Chen, XL
    Hanneman, A
    Jie, Y
    Waibel, A
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 204 - 207
  • [8] A hybrid approach for text detection in natural scenes
    Wang, Runmin
    Sang, Nong
    Wang, Ruolin
    Kuang, Xiaoqin
    MIPPR 2013: PATTERN RECOGNITION AND COMPUTER VISION, 2013, 8919
  • [9] Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering
    Maryam, Hiba
    Fu, Ling
    Song, Jiajun
    Shafayet, Tajrian A. B. M.
    Luo, Qidi
    Bai, Xiang
    Liu, Yuliang
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 279 - 292
  • [10] Arabic Text Detection on Traffic Panels in Natural Scenes
    Turki, Houssem
    Elleuch, Mohamed
    Othman, Kamal
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (04) : 571 - 588