Anchor-free multi-orientation text detection in natural scene images

被引:5
|
作者
Lu, Liqiong [1 ]
Wu, Dong [1 ]
Wu, Tao [1 ]
Huang, Faliang [2 ]
Yi, Yaohua [3 ]
机构
[1] Lingnan Normal Univ, Sch Informat Engn, Zhanjiang 524048, Peoples R China
[2] Nanning Normal Univ, Sch Comp & Informat Engn, Nanning 530001, Peoples R China
[3] Wuhan Univ, Sch Printing & Packaging, Wuhan 430072, Peoples R China
关键词
Text detection; Natural scene image; Anchor-free; Convolutional Neural Network; LOCALIZATION;
D O I
10.1007/s10489-020-01742-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection in natural scene images is a key prerequisite for computer vision tasks such as image search, blind navigation, autopilot, and multi-language translation. Existing text detection methods only detect partial region of large-scale texts and are difficult to detect small-scale texts. Aiming at this problem, an anchor-free multi-orientation text detection method is proposed. Firstly, Feature Pyramid Network (FPN) is used to combine the multiple feature layers of Convolutional Neural Network (CNN) to predict the geometric properties of text, which can be used to expand the receptive field of each pixel and thus help to detect more large-scale texts. Secondly, a new loss function independent of the scale of text is designed, which enables the pixels in the small-scale text to have a larger calculation weight, thereby facilitating the detection of small-scale texts. Finally, the results of pixel-level semantic segmentation are used to filter obviously unreasonable candidate text boxes, and at the same time improve the accuracy and recall rate of text detection. The experimental results on ICDAR 2015 and MSRA-TD500 prove the good performance of our method.
引用
收藏
页码:3623 / 3637
页数:15
相关论文
共 50 条
  • [21] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [22] Scene Text Detection in Natural Images: A Review
    Cao, Dongping
    Zhong, Yong
    Wang, Lishun
    He, Yilong
    Dang, Jiachen
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
  • [23] Uyghur Text Detection in Natural Scene Images
    Li, Xinming
    Li, Junfang
    Gao, Qiag
    Yu, Xiao
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 1542 - 1547
  • [24] LASDNET: A LIGHTWEIGHT ANCHOR-FREE SHIP DETECTION NETWORK FOR SAR IMAGES
    Zhou, Lifan
    Yu, Hanwen
    Wang, Yong
    Xu, Shaojie
    Gong, Shengrong
    Xing, Mengdao
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2630 - 2633
  • [25] A cascaded method for text detection in natural scene images
    Zheng, Yang
    Li, Qing
    Liu, Jie
    Liu, Heping
    Li, Gen
    Zhang, Shuwu
    NEUROCOMPUTING, 2017, 238 : 307 - 315
  • [26] Cascade Detector for Text Detection in Natural Scene Images
    Hanif, Shehzad Muhammad
    Prevost, Lionel
    Negri, Pablo Augusto
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1917 - +
  • [27] Fast and Accurate Text Detection in Natural Scene Images
    Xiao, Chengqiu
    Ji, Lixin
    Gao, Chao
    Li, Shaomei
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 1 - 10
  • [28] BANet: A Balance Attention Network for Anchor-Free Ship Detection in SAR Images
    Hu, Qi
    Hu, Shaohai
    Liu, Shuaiqi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [29] Integrated Method for Text Detection in Natural Scene Images
    Zheng, Yang
    Liu, Jie
    Liu, Heping
    Li, Qing
    Li, Gen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5583 - 5604
  • [30] Anchor-Free Multi-UAV Detection and Classification Using Spectrogram
    Zhao, Runyi
    Li, Tao
    Li, Yongzhao
    Ruan, Yuhan
    Zhang, Rui
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 5259 - 5272