nMultiscale Feature Fusion-Based Object Detection Algorithm

被引:0
|
作者
Tao, Zhang [1 ]
Le, Zhang [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
关键词
machine vision; convolution neural network; object detection; feature pyramid; feature fusion;
D O I
10.3788/L0P202158.0215003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The RetinaNet and Libra RetinaNet object detectors based on deep learning employ feature pyramid networks to fuse multiscale features. However, insufficient feature fusion is problematic in these detectors. In this paper, a multiscale feature fusion algorithm is proposed. The proposed algorithm is extended based on Libra RetinaNet. Two independent feature fusion modules are constructed by establishing two bottom-up paths, and the results generated by the two modules are fused with the original predicted features to improve the accuracy of the detector. The multiscale feature fusion module and Libra RetinaNet are combined to build a target detector and conduct experiments on different datasets. Experimental results demonstrate that the average accuracy of the added module detector on PASCAL VOC and MSCOCO datasets is improved by 2. 2 and 1. 3 percentage, respectively, compared to the Libra RetinaNet detector.
引用
收藏
页数:7
相关论文
共 22 条
  • [1] [Anonymous], 2020, IEEE T PATTERN ANAL, DOI [DOI 10.1109/TPAMI.2018.2844175, 10.1109/TPAMI.2018.2844175]
  • [2] [Anonymous], 2018, FSSD: Feature Fusion Single Shot Multibox Detector
  • [3] Microscopic dynamic observation of adhesion hysteresis friction and exploration of the influence of different pressures on friction transmission
    Feng, Cun-ao
    Zhang, De-kun
    Chen, Kai
    [J]. FRICTION, 2021, 9 (04) : 758 - 773
  • [4] Fu C.Y., 2017, arXiv
  • [5] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [6] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [7] He Kaiming, 2015, C COMP VIS PATT REC
  • [8] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [9] Backpropagation Applied to Handwritten Zip Code Recognition
    LeCun, Y.
    Boser, B.
    Denker, J. S.
    Henderson, D.
    Howard, R. E.
    Hubbard, W.
    Jackel, L. D.
    [J]. NEURAL COMPUTATION, 1989, 1 (04) : 541 - 551
  • [10] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327