FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network

被引:23
|
作者
Liu, Fagui [1 ]
Chen, Cheng [1 ]
Gu, Dian [1 ]
Zheng, Jingzhong [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
关键词
Scene text detection; multi-orientation; convolutional neural network; recurrent neural network; residual network; LOCALIZATION; RECOGNITION;
D O I
10.1109/ACCESS.2019.2908933
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene text detection is to detect the position of a text in the natural scene, the quality of which will directly affect the subsequent text recognition. It plays an important role in fields such as image retrieval and autopilot. How to perform multi-scale and multi-oriented text detection in the scene still remains as a problem. This paper proposes an effective scene text detection method that combines the convolutional neural network (CNN) and recurrent neural network (RNN). In order to better adapt to texts in different scales, feature pyramid networks (FPN) have been applied in the CNN part to extract multi-scale features of the image. We then utilize bidirectional long-short-term memory (Bi-LSTM) to encode these features to make full use of the text sequence characteristics with the outputs as a series of text proposals. The generated proposals are finally linked into a text line through a well-designed text connector, which can be flexibly adapted to any oriented texts. The proposed method is evaluated on three public datasets: ICDAR2013, ICDAR2015, and USTB-SV1K. For ICDAR2013 and USTB-1K, we have reached 92.5% and 62.6% F-measure, respectively. Our method has reached 72.8% F-measure on the more challenging ICDAR2015 which demonstrates the effectiveness of our method.
引用
收藏
页码:44219 / 44228
页数:10
相关论文
共 50 条
  • [1] Towards Accurate Scene Text Detection with Bidirectional Feature Pyramid Network
    Cao, Dongping
    Dang, Jiachen
    Zhong, Yong
    [J]. SYMMETRY-BASEL, 2021, 13 (03):
  • [2] Feature Pyramid Based Scene Text Detector
    En, MengYi
    Li, Rong
    Li, JianQiang
    Liu, Bo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 6, 2017, : 3 - 8
  • [3] Natural scene text detection based on multiscale connectionist text proposal network
    Huang, Min
    Lan, Chaohao
    Huang, Wei
    Tao, Yang
    [J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 326 - 329
  • [4] Feature Fusion Pyramid Network for End-to-End Scene Text Detection
    Wu, Yirui
    Zhang, Lilai
    Li, Hao
    Zhang, Yunfei
    Wan, Shaohua
    [J]. ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 23 (11)
  • [5] BDFPN: Bi-Direction Feature Pyramid Network for Scene Text Detection
    Shao, Hai-Lin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Robust Scene Text Detection with Deep Feature Pyramid Network and CNN based NMS Model
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3741 - 3746
  • [7] Scene Text Detection with Supervised Pyramid Context Network
    Xie, Enze
    Zang, Yuhang
    Shao, Shuai
    Yu, Gang
    Yao, Cong
    Li, Guangyao
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9038 - 9045
  • [8] Scene text detection via decoupled feature pyramid networks
    Liang, Min
    Hou, Jie-Bo
    Zhu, Xiaobin
    Yang, Chun
    Qin, Jingyan
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (3) : 163 - 175
  • [9] Scene text detection via decoupled feature pyramid networks
    Min Liang
    Jie-Bo Hou
    Xiaobin Zhu
    Chun Yang
    Jingyan Qin
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 163 - 175
  • [10] Max-Pooling based Scene Text Proposal for Scene Text Detection
    Dinh Nguyen Van
    Lu, Shijian
    Bai, Xiang
    Ouarti, Nizar
    Mokhtari, Mounir
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1295 - 1300