FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network

被引:23
|
作者
Liu, Fagui [1 ]
Chen, Cheng [1 ]
Gu, Dian [1 ]
Zheng, Jingzhong [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
关键词
Scene text detection; multi-orientation; convolutional neural network; recurrent neural network; residual network; LOCALIZATION; RECOGNITION;
D O I
10.1109/ACCESS.2019.2908933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene text detection is to detect the position of a text in the natural scene, the quality of which will directly affect the subsequent text recognition. It plays an important role in fields such as image retrieval and autopilot. How to perform multi-scale and multi-oriented text detection in the scene still remains as a problem. This paper proposes an effective scene text detection method that combines the convolutional neural network (CNN) and recurrent neural network (RNN). In order to better adapt to texts in different scales, feature pyramid networks (FPN) have been applied in the CNN part to extract multi-scale features of the image. We then utilize bidirectional long-short-term memory (Bi-LSTM) to encode these features to make full use of the text sequence characteristics with the outputs as a series of text proposals. The generated proposals are finally linked into a text line through a well-designed text connector, which can be flexibly adapted to any oriented texts. The proposed method is evaluated on three public datasets: ICDAR2013, ICDAR2015, and USTB-SV1K. For ICDAR2013 and USTB-1K, we have reached 92.5% and 62.6% F-measure, respectively. Our method has reached 72.8% F-measure on the more challenging ICDAR2015 which demonstrates the effectiveness of our method.
引用
收藏
页码:44219 / 44228
页数:10
相关论文
共 50 条
  • [41] Text detection in natural scene images with feature combination
    Ye, Qixiang
    Jiao, Jianbin
    Huang, Jun
    Yu, Hua
    [J]. PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2007, : 397 - 402
  • [42] Adaptive Segmentation Network for Scene Text Detection
    Zhao, Guiqin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 511 - 522
  • [43] Scene Text Detection Based on Text Probability and Pruning Algorithm
    Zhou, Gang
    Liu, Yajun
    Shi, Fei
    Hu, Ying
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 726 - 735
  • [44] Refinement Correction Network for Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Hu, Wei
    Xu, Qiaozhi
    Zhi, Min
    Lu, Jingfang
    Qi, Xuanhao
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 93 - 105
  • [45] Collaborative Learning Network for Scene Text Detection
    Zhang, Xiaoye
    Yue, Yuanhao
    Yang, Yingyi
    Zhang, Xining
    Wang, Wei
    Zou, Qin
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6788 - 6793
  • [46] Scene text detection by adaptive feature selection with text scale-aware loss
    Wu, Qin
    Luo, Wenli
    Chai, Zhilei
    Guo, Guodong
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 514 - 529
  • [47] Scene text detection by adaptive feature selection with text scale-aware loss
    Qin Wu
    Wenli Luo
    Zhilei Chai
    Guodong Guo
    [J]. Applied Intelligence, 2022, 52 : 514 - 529
  • [48] Rwin-FPN plus plus : Rwin Transformer with Feature Pyramid Network for Dense Scene Text Spotting
    Zeng, Chengbin
    Liu, Yi
    Song, Chunli
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [49] MFECN: Multi-level Feature Enhanced Cumulative Network for Scene Text Detection
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
  • [50] A Text-Specific Domain Adaptive Network for Scene Text Detection in the Wild
    He, Xuan
    Yuan, Jin
    Li, Mengyao
    Wang, Runmin
    Wang, Haidong
    Li, Zhiyong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 26827 - 26839