Robust text detection in natural scenes using text geometry and visual appearance

被引:0
|
作者
Yan S.-Y. [1 ]
Xu X.-X. [2 ]
Liu Q.-S. [1 ]
机构
[1] School of Information and Control, Nanjing University of Information Science and Technology, Nanjing
[2] School of Computer Engineering, Nanyang Technological University, Singapore
来源
Yan, Sheng-Ye | 1600年 / Chinese Academy of Sciences卷 / 11期
基金
中国国家自然科学基金;
关键词
geometric rule; multiple kernel learning (MKL); stroke width transform (SWT); support vector machine (SVM); Text detection;
D O I
10.1007/s11633-014-0833-2
中图分类号
学科分类号
摘要
This paper proposes a new two-phase approach to robust text detection by integrating the visual appearance and the geometric reasoning rules. In the first phase, geometric rules are used to achieve a higher recall rate. Specifically, a robust stroke width transform (RSWT) feature is proposed to better recover the stroke width by additionally considering the cross of two strokes and the continuousness of the letter border. In the second phase, a classification scheme based on visual appearance features is used to reject the false alarms while keeping the recall rate. To learn a better classifier from multiple visual appearance features, a novel classification method called double soft multiple kernel learning (DS-MKL) is proposed. DS-MKL is motivated by a novel kernel margin perspective for multiple kernel learning and can effectively suppress the influence of noisy base kernels. Comprehensive experiments on the benchmark ICDAR2005 competition dataset demonstrate the effectiveness of the proposed two-phase text detection approach over the state-of-the-art approaches by a performance gain up to 4.4% in terms of F-measure. © 2014, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:480 / 488
页数:8
相关论文
共 50 条
  • [41] A method for detecting text of arbitrary shapes in natural scenes that improves text spotting
    Wang, Qitong
    Zheng, Yi
    Betke, Margrit
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2296 - 2305
  • [42] A robust arbitrary text detection system for natural scene images
    Risnumawan, Anhar
    Shivakumara, Palaiahankote
    Chan, Chee Seng
    Tan, Chew Lim
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (18) : 8027 - 8048
  • [43] Text extraction in natural scenes using region-based method
    Huang, Zhihu
    Leng, Jinsong
    Huang, Zhihu, 1600, Digital Information Research Foundation (12): : 246 - 254
  • [44] A Deep Learning-Based Text Detection and Recognition Approach for Natural Scenes
    Li, Xuexiang
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (05)
  • [45] CNN and Fuzzy Rules Based Text Detection and Recognition from Natural Scenes
    Mithila, T.
    Arunprakash, R.
    Ramachandran, A.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 42 (03): : 1165 - 1179
  • [46] A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification
    Zhu, Siyu
    Zanibbi, Richard
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 625 - 632
  • [47] Expressive Visual Text-To-Speech Using Active Appearance Models
    Anderson, Robert
    Stenger, Bjoern
    Wan, Vincent
    Cipolla, Roberto
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3382 - 3389
  • [48] A robust video text detection approach using SVM
    Wei, Yi Cheng
    Lin, Chang Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10832 - 10840
  • [49] Distributional semantics of objects in visual scenes in comparison to text
    Lueddecke, Timo
    Agostini, Alejandro
    Fauth, Michael
    Tamosiunaite, Minija
    Woergoetter, Florentin
    ARTIFICIAL INTELLIGENCE, 2019, 274 : 44 - 65
  • [50] Text Detection and Recognition on Traffic Panels From Street-Level Imagery Using Visual Appearance
    Gonzalez, Alvaro
    Bergasa, Luis M.
    Javier Yebes, J.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (01) : 228 - 238