Anchor-free multi-orientation text detection in natural scene images

被引:5
|
作者
Lu, Liqiong [1 ]
Wu, Dong [1 ]
Wu, Tao [1 ]
Huang, Faliang [2 ]
Yi, Yaohua [3 ]
机构
[1] Lingnan Normal Univ, Sch Informat Engn, Zhanjiang 524048, Peoples R China
[2] Nanning Normal Univ, Sch Comp & Informat Engn, Nanning 530001, Peoples R China
[3] Wuhan Univ, Sch Printing & Packaging, Wuhan 430072, Peoples R China
关键词
Text detection; Natural scene image; Anchor-free; Convolutional Neural Network; LOCALIZATION;
D O I
10.1007/s10489-020-01742-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection in natural scene images is a key prerequisite for computer vision tasks such as image search, blind navigation, autopilot, and multi-language translation. Existing text detection methods only detect partial region of large-scale texts and are difficult to detect small-scale texts. Aiming at this problem, an anchor-free multi-orientation text detection method is proposed. Firstly, Feature Pyramid Network (FPN) is used to combine the multiple feature layers of Convolutional Neural Network (CNN) to predict the geometric properties of text, which can be used to expand the receptive field of each pixel and thus help to detect more large-scale texts. Secondly, a new loss function independent of the scale of text is designed, which enables the pixels in the small-scale text to have a larger calculation weight, thereby facilitating the detection of small-scale texts. Finally, the results of pixel-level semantic segmentation are used to filter obviously unreasonable candidate text boxes, and at the same time improve the accuracy and recall rate of text detection. The experimental results on ICDAR 2015 and MSRA-TD500 prove the good performance of our method.
引用
收藏
页码:3623 / 3637
页数:15
相关论文
共 50 条
  • [1] Anchor-free multi-orientation text detection in natural scene images
    Liqiong Lu
    Dong Wu
    Tao Wu
    Faliang Huang
    Yaohua Yi
    Applied Intelligence, 2020, 50 : 3623 - 3637
  • [2] Multi-Orientation Scene Text Detection with Adaptive Clustering
    Yin, Xu-Cheng
    Pei, Wei-Yi
    Zhang, Jun
    Hao, Hong-Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1930 - 1937
  • [3] Anchor-Free Braille Character Detection Based on Edge Feature in Natural Scene Images
    Lu, Liqiong
    Wu, Dong
    Xiong, Jianfang
    Liang, Zhou
    Huang, Faliang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [4] Multi-Orientation Scene Text Detection with Multi-Information Fusion
    Pei, Wei-Yi
    Yang, Chun
    Kau, Lih-Jen
    Yin, Xu-Cheng
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 657 - 662
  • [5] Multi-orientation Scene Text Detection Leveraging Background Suppression
    Wang, Xihan
    Feng, Xiaoyi
    Xia, Zhaoqiang
    Peng, Jinye
    Granger, Eric
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 555 - 566
  • [6] Multi-orientation scene text detection with scale-guided regression
    Liang, Min
    Hou, Jie-Bo
    Zhu, Xiaobin
    Yang, Chun
    Qin, Jingyan
    Yin, Xu-Cheng
    NEUROCOMPUTING, 2021, 461 : 310 - 318
  • [7] Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection
    Yuliang Liu
    Tong He
    Hao Chen
    Xinyu Wang
    Canjie Luo
    Shuaitao Zhang
    Chunhua Shen
    Lianwen Jin
    International Journal of Computer Vision, 2021, 129 : 1972 - 1992
  • [8] Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection
    Liu, Yuliang
    He, Tong
    Chen, Hao
    Wang, Xinyu
    Luo, Canjie
    Zhang, Shuaitao
    Shen, Chunhua
    Jin, Lianwen
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1972 - 1992
  • [9] Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming
    Yang, Chun
    Yin, Xu-Cheng
    Pei, Wei-Yi
    Tian, Shu
    Zuo, Ze-Yu
    Zhu, Chao
    Yan, Junchi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3235 - 3248
  • [10] Multi-Orientation Text Detection by Skeletonization (MOTDS)
    Azadboni, Mohammad Khodadadi
    Samadhiya, Aditi
    Khatri, Pallavi
    PROCEEDINGS OF 2014 2ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2014, : 5 - 9