Scene text detection via extremal region based double threshold convolutional network classification

被引:9
|
作者
Zhu, Wei [1 ]
Lou, Jing [1 ]
Chen, Longtao [1 ]
Xia, Qingyuan [1 ]
Ren, Mingwu [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 08期
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
READING TEXT; IMAGES; LOCALIZATION; RECOGNITION; REPRESENTATION; FACE;
D O I
10.1371/journal.pone.0182227
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we present a robust text detection approach in natural images which is based on region proposal mechanism. A powerful low-level detector named saliency enhanced-MSER extended from the widely-used MSER is proposed by incorporating saliency detection methods, which ensures a high recall rate. Given a natural image, character candidates are extracted from three channels in a perception-based illumination invariant color space by saliency-enhanced MSER algorithm. A discriminative convolutional neural network (CNN) is jointly trained with multi-level information including pixel-level and character-level information as character candidate classifier. Each image patch is classified as strong text, weak text and non-text by double threshold filtering instead of conventional one-step classification, leveraging confident scores obtained via CNN. To further prune non-text regions, we develop a recursive neighborhood search algorithm to track credible texts from weak text set. Finally, characters are grouped into text lines using heuristic features such as spatial location, size, color, and stroke width. We compare our approach with several state-of-the-art methods, and experiments show that our method achieves competitive performance on public datasets ICDAR 2011 and ICDAR 2013.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] AUTOMATIC RADAR-BASED GESTURE DETECTION AND CLASSIFICATION VIA A REGION-BASED DEEP CONVOLUTIONAL NEURAL NETWORK
    Sun, Yuliang
    Fei, Tai
    Gao, Shangyin
    Pohl, Nils
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4300 - 4304
  • [22] Scene text detection method research based on maximally stable extremal regions
    Xu, Lei
    Liu, Yi
    Mou, Lianming
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2022, 15 (02) : 142 - 154
  • [23] Text Detection in Low Resolution Scene Images Using Convolutional Neural Network
    Risnumawan, Anhar
    Sulistijono, Indra Adji
    Abawajy, Jemal
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 366 - 375
  • [24] A Convolutional Neural Network Based on Grouping Structure for Scene Classification
    Wu, Xuan
    Zhang, Zhijie
    Zhang, Wanchang
    Yi, Yaning
    Zhang, Chuanrong
    Xu, Qiang
    REMOTE SENSING, 2021, 13 (13)
  • [25] Aksara Jawa Text Detection in Scene Images using Convolutional Neural Network
    Afakh, Muhammad Labiyb
    Risnumawan, Anhar
    Anggraeni, Martianda Erste
    Tamara, Mohamad Nasyir
    Ningrum, Endah Suryawati
    2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), 2017, : 77 - 82
  • [26] A New Unsupervised Convolutional Neural Network Model for Chinese Scene Text Detection
    Ren, Xiaohang
    Chen, Kai
    Yang, Xiaokang
    Zhou, Yi
    He, Jianhua
    Sun, Jun
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 428 - 432
  • [27] CRF based text detection for natural scene images using convolutional neural network and context information
    Wang, Yanna
    Shi, Cunzhao
    Xiao, Baihua
    Wang, Chunheng
    Qi, Chengzuo
    NEUROCOMPUTING, 2018, 295 : 46 - 58
  • [28] Conceptual text region network: Cognition-inspired accurate scene text detection
    Cui, Chenwei
    Lu, Liangfu
    Tan, Zhiyuan
    Hussain, Amir
    NEUROCOMPUTING, 2021, 464 : 252 - 264
  • [29] Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks
    Wang, Xihan
    Xia, Zhaoqiang
    Peng, Jinye
    Feng, Xiaoyi
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (03)
  • [30] FDTA: Fully Convolutional Scene Text Detection With Text Attention
    Cao, Yongcun
    Ma, Shuaisen
    Pan, Haichuan
    IEEE ACCESS, 2020, 8 : 155441 - 155449