Robust text detection in natural scenes using text geometry and visual appearance

被引：0

作者：

Yan S.-Y. ^{[1
]}

Xu X.-X. ^{[2
]}

Liu Q.-S. ^{[1
]}

机构：

[1] School of Information and Control, Nanjing University of Information Science and Technology, Nanjing

[2] School of Computer Engineering, Nanyang Technological University, Singapore

来源：

Yan, Sheng-Ye | 1600年 / Chinese Academy of Sciences卷 / 11期

基金：

中国国家自然科学基金;

关键词：

geometric rule; multiple kernel learning (MKL); stroke width transform (SWT); support vector machine (SVM); Text detection;

D O I：

10.1007/s11633-014-0833-2

中图分类号：

学科分类号：

摘要：

This paper proposes a new two-phase approach to robust text detection by integrating the visual appearance and the geometric reasoning rules. In the first phase, geometric rules are used to achieve a higher recall rate. Specifically, a robust stroke width transform (RSWT) feature is proposed to better recover the stroke width by additionally considering the cross of two strokes and the continuousness of the letter border. In the second phase, a classification scheme based on visual appearance features is used to reject the false alarms while keeping the recall rate. To learn a better classifier from multiple visual appearance features, a novel classification method called double soft multiple kernel learning (DS-MKL) is proposed. DS-MKL is motivated by a novel kernel margin perspective for multiple kernel learning and can effectively suppress the influence of noisy base kernels. Comprehensive experiments on the benchmark ICDAR2005 competition dataset demonstrate the effectiveness of the proposed two-phase text detection approach over the state-of-the-art approaches by a performance gain up to 4.4% in terms of F-measure. © 2014, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.

引用

页码：480 / 488

页数：8

共 50 条

[41] A method for detecting text of arbitrary shapes in natural scenes that improves text spotting
Wang, Qitong
Zheng, Yi
Betke, Margrit
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2296 - 2305
[42] A robust arbitrary text detection system for natural scene images
Risnumawan, Anhar
Shivakumara, Palaiahankote
Chan, Chee Seng
Tan, Chew Lim
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (18) : 8027 - 8048
[43] Text extraction in natural scenes using region-based method
Huang, Zhihu
Leng, Jinsong
Huang, Zhihu, 1600, Digital Information Research Foundation (12): : 246 - 254
[44] A Deep Learning-Based Text Detection and Recognition Approach for Natural Scenes
Li, Xuexiang
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (05)
[45] CNN and Fuzzy Rules Based Text Detection and Recognition from Natural Scenes
Mithila, T.
Arunprakash, R.
Ramachandran, A.
COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 42 (03): : 1165 - 1179
[46] A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification
Zhu, Siyu
Zanibbi, Richard
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 625 - 632
[47] Expressive Visual Text-To-Speech Using Active Appearance Models
Anderson, Robert
Stenger, Bjoern
Wan, Vincent
Cipolla, Roberto
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3382 - 3389
[48] A robust video text detection approach using SVM
Wei, Yi Cheng
Lin, Chang Hong
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 10832 - 10840
[49] Distributional semantics of objects in visual scenes in comparison to text
Lueddecke, Timo
Agostini, Alejandro
Fauth, Michael
Tamosiunaite, Minija
Woergoetter, Florentin
ARTIFICIAL INTELLIGENCE, 2019, 274 : 44 - 65
[50] Text Detection and Recognition on Traffic Panels From Street-Level Imagery Using Visual Appearance
Gonzalez, Alvaro
Bergasa, Luis M.
Javier Yebes, J.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (01) : 228 - 238

← 1 2 3 4 5 →