Real-Time Object Detection With Reduced Region Proposal Network via Multi-Feature Concatenation

被引:42
|
作者
Shih, Kuan-Hung [1 ]
Chiu, Ching-Te [1 ]
Lin, Jiou-Ai [1 ]
Bu, Yen-Yu [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30000, Taiwan
关键词
Object detection; Proposals; Convolution; Computer architecture; Feature extraction; Real-time systems; Deep learning; Multi-feature concatenation; object detection; region proposal network (RPN); weight pruning;
D O I
10.1109/TNNLS.2019.2929059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, object detection became more and more important following the successful results from studies in deep learning. Two types of neural network architectures are used for object detection: one-stage and two-stage. In this paper, we analyze a widely used two-stage architecture called Faster R-CNN to improve the inference time and achieve real-time object detection without compromising on accuracy. To increase the computation efficiency, pruning is first adopted to reduce the weights in convolutional and fully connected (FC) layers. However, this reduces the accuracy of detection. To address this loss in accuracy, we propose a reduced region proposal network (RRPN) with dilated convolution and concatenation of multi-scale features. In the assisted multi-feature concatenation, we propose the intra-layer concatenation and proposal refinement to efficiently integrate the feature maps from different convolutional layers; this is then provided as an input to the RRPN. Using the proposed method, the network can find object bounding boxes more accurately, thus compensating for the loss arising from compression. Finally, we test the proposed architecture using ZF-Net and VGG16 as a backbone network on the image sets in PASCAL VOC 2007 or VOC 2012. The results show that we can compress the parameters of the ZF-Net-based network by 81.2% and save 66% of computation. The parameters of VGG16-based network are compressed by 73% and save 77% of computation. Consequently, the inference speed is improved from 27 to 40 frames/s for ZF-Net and 9 to 27 frames/s for VGG16. Despite significant compression rates, the accuracy of ZF-Net is increased from 2.2% to 60.2% mean average precision (mAP) and that of VGG16 is increased from 2.6% to 69.1% mAP.
引用
收藏
页码:2164 / 2173
页数:10
相关论文
共 50 条
  • [1] REAL-TIME OBJECT DETECTION VIA PRUNING AND A CONCATENATED MULTI-FEATURE ASSISTED REGION PROPOSAL NETWORK
    Shih, Kuan-Hung
    Chiu, Ching-Te
    Pu, Yen-Yu
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1398 - 1402
  • [2] Multi-Feature Concatenation Network for Object Detection
    Yang, Aiping
    Lu, Liyu
    Ji, Zhong
    [J]. Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2020, 53 (06): : 647 - 652
  • [3] REAL-TIME OBJECT DETECTION BY A MULTI-FEATURE FULLY CONVOLUTIONAL NETWORK
    Guo, Yajing
    Guo, Xiaoqiang
    Jiang, Zhuqing
    Men, Aidong
    Zhou, Yun
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 670 - 674
  • [4] Multi-feature fusion Siamese Network for Real-Time Object Tracking
    Zhou, Lijun
    Li, Hongyun
    Zhang, Jianlin
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 478 - 481
  • [5] Real-time Object Tracking with Multi-feature Particle Filter
    Meng, Bo
    Han, Guang-liang
    [J]. 2015 INTERNATIONAL CONFERENCE ON APPLIED MECHANICS AND MECHATRONICS ENGINEERING (AMME 2015), 2015, : 147 - 157
  • [6] A real-time hand detection system based on multi-feature
    Mei, Kuizhi
    Xu, Lu
    Li, Boliang
    Lin, Bin
    Wang, Fang
    [J]. NEUROCOMPUTING, 2015, 158 : 184 - 193
  • [7] Multi-feature aggregation network for salient object detection
    Hu Huang
    Ping Liu
    Yanzhao Wang
    Tongchi Zhou
    Boyang Qu
    Aimin Tao
    Hao Zhang
    [J]. Signal, Image and Video Processing, 2023, 17 : 1043 - 1051
  • [8] Multi-feature aggregation network for salient object detection
    Huang, Hu
    Liu, Ping
    Wang, Yanzhao
    Zhou, Tongchi
    Qu, Boyang
    Tao, Aimin
    Zhang, Hao
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1043 - 1051
  • [9] Real-time multi-feature based fire flame detection in video
    Chi, Rui
    Lu, Zhe-Ming
    Ji, Qing-Ge
    [J]. IET IMAGE PROCESSING, 2017, 11 (01) : 31 - 37
  • [10] A Multi-feature Fusion-based Algorithm for Real-time Single Object Tracking
    Yang, Xiaowei
    Huang, Yingting
    [J]. Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2019, 47 (06): : 1 - 9