Scene text detection via extremal region based double threshold convolutional network classification

被引:9
|
作者
Zhu, Wei [1 ]
Lou, Jing [1 ]
Chen, Longtao [1 ]
Xia, Qingyuan [1 ]
Ren, Mingwu [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
PLOS ONE | 2017年 / 12卷 / 08期
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
READING TEXT; IMAGES; LOCALIZATION; RECOGNITION; REPRESENTATION; FACE;
D O I
10.1371/journal.pone.0182227
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we present a robust text detection approach in natural images which is based on region proposal mechanism. A powerful low-level detector named saliency enhanced-MSER extended from the widely-used MSER is proposed by incorporating saliency detection methods, which ensures a high recall rate. Given a natural image, character candidates are extracted from three channels in a perception-based illumination invariant color space by saliency-enhanced MSER algorithm. A discriminative convolutional neural network (CNN) is jointly trained with multi-level information including pixel-level and character-level information as character candidate classifier. Each image patch is classified as strong text, weak text and non-text by double threshold filtering instead of conventional one-step classification, leveraging confident scores obtained via CNN. To further prune non-text regions, we develop a recursive neighborhood search algorithm to track credible texts from weak text set. Finally, characters are grouped into text lines using heuristic features such as spatial location, size, color, and stroke width. We compare our approach with several state-of-the-art methods, and experiments show that our method achieves competitive performance on public datasets ICDAR 2011 and ICDAR 2013.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Specific category region proposal network for text detection in natural scene
    Zhong, Yuanhong
    Cheng, Xinyu
    Zhou, Zhaokun
    Zhang, Shun
    Zhang, Jing
    Huang, Guan
    IET IMAGE PROCESSING, 2020, 14 (09) : 1832 - 1839
  • [32] A Convolutional Neural Network-Based Chinese Text Detection Algorithm via Text Structure Modeling
    Ren, Xiaohang
    Zhou, Yi
    He, Jianhua
    Chen, Kai
    Yang, Xiaokang
    Sun, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 506 - 518
  • [33] PRPN: Progressive region prediction network for natural scene text detection
    Zhong, Yuanhong
    Cheng, Xinyu
    Chen, Tao
    Zhang, Jing
    Zhou, Zhaokun
    Huang, Guan
    KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [34] Double supervision for scene text detection and recognition based on BMINet
    Wan, Hanyang
    Liu, Ruoyun
    Yu, Li
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
  • [35] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
    Yee, Pui Sin
    Lim, Kian Ming
    Lee, Chin Poo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [36] Text Classification Based on Convolutional Neural Network and Attention Model
    Yang, Shuang
    Tang, Yan
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 67 - 73
  • [37] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [38] An Integration Model Based on Graph Convolutional Network for Text Classification
    Tang, Hengliang
    Mi, Yuan
    Xue, Fei
    Cao, Yang
    IEEE ACCESS, 2020, 8 : 148865 - 148876
  • [39] TextConvoNet: a convolutional neural network based architecture for text classification
    Soni, Sanskar
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14249 - 14268
  • [40] The Study on the Text Classification Based on Graph Convolutional Network and BiLSTM
    Xue, Bingxin
    Zhu, Cui
    Wang, Xuan
    Zhu, Wenjun
    APPLIED SCIENCES-BASEL, 2022, 12 (16):