Natural Scene Text Detection Based on Deep Supervised Fully Convolutional Network

被引:0
|
作者
Zhang, Nan [1 ]
Jin, Xiaoning [1 ]
Li, Xiaowei [1 ]
机构
[1] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
关键词
Scene image; Multi-oriented text; Deep supervision;
D O I
10.1007/978-3-030-00764-5_40
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years, text detection in natural scenes has attracted increasing attention due to many real-world applications. Most existing methods only detect horizontal or nearly horizontal texts and have complicated processes. When using the neural network to detect text in the image, some ambiguity and small words are easy to be ignored because of many pooling operations. Therefore, this paper proposes an end-to-end trainable neural network for detecting multi-oriented text lines or words in natural scene images. The network fuses multi-level features and is guided by deep supervision during training. In this way, richer hierarchical representations can be learned automatically. The network makes two kinds of predictions: text/no text classification and location regression, thus we can directly locate multi-oriented words or text lines without other unnecessary intermediate steps. Experimental results on the ICDAR 2015 datasets and MSRA-TD500 datasets have proven that the proposed method outperforms the state-of-the-art methods by a noticeable margin on F-score.
引用
收藏
页码:439 / 448
页数:10
相关论文
共 50 条
  • [1] FDTA: Fully Convolutional Scene Text Detection With Text Attention
    Cao, Yongcun
    Ma, Shuaisen
    Pan, Haichuan
    IEEE ACCESS, 2020, 8 : 155441 - 155449
  • [2] Scene text detection with fully convolutional neural networks
    Zhandong Liu
    Wengang Zhou
    Houqiang Li
    Multimedia Tools and Applications, 2019, 78 : 18205 - 18227
  • [3] Scene text detection with fully convolutional neural networks
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 18205 - 18227
  • [4] Irregular Scene Text Detection Based on a Graph Convolutional Network
    Zhang, Shiyu
    Zhou, Caiying
    Li, Yonggang
    Zhang, Xianchao
    Ye, Lihua
    Wei, Yuanwang
    SENSORS, 2023, 23 (03)
  • [5] Text Detection in Natural Scene Images Based on Enhanced Receptive Field and Fully Convolution Network
    Li X.-Y.
    Song Y.-H.
    Yu T.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (03): : 797 - 807
  • [6] Multi-lingual Scene Text Detection Based on Fully Convolutional Networks
    Liu, Shaohua
    Shang, Yan
    Han, Jizhong
    Wang, Xi
    Gao, Hongchao
    Liu, Dongqin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 423 - 432
  • [7] TEXNET: A DEEP CONVOLUTIONAL NEURAL NETWORK MODEL TO RECOGNIZE TEXT IN NATURAL SCENE IMAGES
    KAVITHA, D.
    RADHA, V.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2021, 16 (02): : 1782 - 1799
  • [8] A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network
    Ren, Xiaohang
    Chen, Kai
    Yang, Xiaokang
    Zhou, Yi
    He, Jianhua
    Sun, Jun
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [9] CRF based text detection for natural scene images using convolutional neural network and context information
    Wang, Yanna
    Shi, Cunzhao
    Xiao, Baihua
    Wang, Chunheng
    Qi, Chengzuo
    NEUROCOMPUTING, 2018, 295 : 46 - 58
  • [10] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812