Text Detection by Faster R-CNN with Multiple Region Proposal Networks

被引:15
|
作者
Nagaoka, Yoshito [1 ]
Miyazaki, Tomo [1 ]
Sugaya, Yoshihiro [1 ]
Omachi, Shinichiro [1 ]
机构
[1] Tohoku Univ, Grad Sch Engn, Dept Commun Engn, Sendai, Miyagi, Japan
关键词
Text detection; Faster R-CNN; Region Proposal Network;
D O I
10.1109/ICDAR.2017.343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an end-to-end consistently trainable text detection method based on the Faster R-CNN. The original Faster R-CNN is an end-to-end CNN for fast and accurate object detection. By considering the characteristics of texts, a novel architecture that make use of its ability on object detection is proposed. Although the original Faster R-CNN generates region of interests (RoIs) by a region proposal network (RPN) using the feature map of the last convolutional layer, the proposed method generates RoIs by multiple RPNs using the feature maps of multiple convolutional layers. This method uses multiresolution feature maps to detect texts of various sizes simultaneously. To aggregate the RoIs, we introduce RoI-merge layer, and this layer enables to select valid RoIs from multiple RPNs effectively. In addition, a training strategy is proposed for realizing end-to-end training and making each RPN be specialized in text region size. Experimental results using ICDAR2013/2015 RRC test dataset show that the proposed Multi-RPN method improved detection scores and kept almost the same detection speed as compared to the original Faster R-CNN and recent methods.
引用
收藏
页码:15 / 20
页数:6
相关论文
共 50 条
  • [1] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [2] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [3] Revisiting Faster R-CNN: A Deeper Look at Region Proposal Network
    Han, Guangxing
    Zhang, Xuan
    Li, Chongrong
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 14 - 24
  • [4] On-road vehicle detection in varying weather conditions using faster R-CNN with several region proposal networks
    Rajib Ghosh
    [J]. Multimedia Tools and Applications, 2021, 80 : 25985 - 25999
  • [5] On-road vehicle detection in varying weather conditions using faster R-CNN with several region proposal networks
    Ghosh, Rajib
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 25985 - 25999
  • [6] Face Detection with the Faster R-CNN
    Jiang, Huaizu
    Learned-Miller, Erik
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 650 - 657
  • [7] A Detection Method for Liver Cancer Region Based on Faster R-CNN
    Furuzuki, Muki
    Lu, Huimin
    Kim, Hyoungseop
    Hirano, Yasushi
    Mabu, Shingo
    Tanabe, Masahiro
    Kido, Shoji
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 808 - 811
  • [8] Handwriting Text Recognition Based on Faster R-CNN
    Yang, Junqing
    Ren, Peng
    Kong, Xiaoxiao
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2450 - 2454
  • [9] Improved Localization Accuracy by LocNet for Faster R-CNN Based Text Detection
    Zhong, Zhuoyao
    Sun, Lei
    Huo, Qiang
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 923 - 928
  • [10] Crack Detection and Comparison Study Based on Faster R-CNN and Mask R-CNN
    Xu, Xiangyang
    Zhao, Mian
    Shi, Peixin
    Ren, Ruiqi
    He, Xuhui
    Wei, Xiaojun
    Yang, Hao
    [J]. SENSORS, 2022, 22 (03)