Multi-level refinement enriched feature pyramid network for object detection

被引:15
|
作者
Aziz, Lubna [1 ,2 ]
Salam, Md. Sah Bin Haji F. C. [1 ]
Ayub, Sara [3 ]
机构
[1] Univ Teknol Malaysia, Sch Comp, Div Artificial Intelligence, Fac Engn, Skudai 81310, Johor, Malaysia
[2] FICT BUITEMS, Dept Comp Engn, Lahore, Pakistan
[3] Univ Teknol Malaysia, Dept Elect Engn, Fac Engn, Skudai 81310, Johor, Malaysia
关键词
CNN; Object detection; Chained parallel pooling; Computer vision; Feature pyramid;
D O I
10.1016/j.imavis.2021.104287
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class Imbalance and scales imbalance are common in object detection. A class imbalance occurs due to insufficient inequality between the number of instances with respect to different classes, while an imbalance in scale occurs when object have different scales and a different number of examples of different scales. In order to solve the problem of scale variance (scale imbalance) and class imbalance together, we propose a simple and effective feature enhancement scheme that explicitly uses all information of a multi-level structure to generate a multilevel contextual features pyramid with multiple scales. We also introduce a cascaded refinement scheme that incorporates multi-scale contextual features into the Single Shot Detector (SSD) predictive layers to improve their distinctiveness for multi-scale detection. A stack of multi-scale contextual feature modules is used in a feature enhancement scheme to merge the multi-level and multi-scale features. Then we collect the equivalent scale features over the Multi-layer Feature Fusion (MLFF) unit to construct a feature pyramid in which each feature map is made up of layers from multiple levels. More robustness and contextual information are integrated into the pyramid through chain parallel pooling operation. To improve classification and regression, a cascaded refinement scheme is proposed that effectively captures a large amount of contextual information and refines the anchors to solve the class imbalance problem. The experiments are carried out on two benchmarks datasets: MS COCO and PASCAL VOC 07/12. Our proposed approach achieves state-of-the-art accuracy with an AP of 40.6 in the case of multi-scale inference on MS COCO Test-dev (input size 320 x 320). For 512 x 512 input on the MS COCO Test-dev, our approach leads in an absolute gain in precision of 1.8% compared to the best reported results of single-stage detector (AP: 45.7). (c) 2021 Elsevier B.V. All rights reserved.
引用
下载
收藏
页数:11
相关论文
共 50 条
  • [31] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [32] A multi-level feature weight fusion model for salient object detection
    Zhang, Shanqing
    Chen, Yujie
    Meng, Yiheng
    Lu, Jianfeng
    Li, Li
    Bai, Rui
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 887 - 895
  • [33] A multi-level feature weight fusion model for salient object detection
    Zhang Shanqing
    Chen Yujie
    Meng Yiheng
    Lu Jianfeng
    Li Li
    Bai Rui
    Multimedia Systems, 2023, 29 : 887 - 895
  • [34] SFPN: Semantic Feature Pyramid Network for Object Detection
    Gan, Yi
    Xu, Wei
    Su, Jianbo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 795 - 802
  • [35] Bidirectional Matrix Feature Pyramid Network for Object Detection
    Xu, Wei
    Gan, Yi
    Su, Jianbo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8000 - 8007
  • [36] Bidirectional Parallel Feature Pyramid Network for Object Detection
    Zhang, Zhengning
    Zhang, Lin
    Wang, Yue
    Feng, Pengming
    Sun, Baochen
    IEEE ACCESS, 2022, 10 : 49422 - 49432
  • [37] Attentional feature pyramid network for small object detection
    Min, Kyungseo
    Lee, Gun-Hee
    Lee, Seong-Whan
    NEURAL NETWORKS, 2022, 155 : 439 - 450
  • [38] Adaptively Dense Feature Pyramid Network for Object Detection
    Pan, Haodong
    Chen, Guangfeng
    Jiang, Jue
    IEEE ACCESS, 2019, 7 : 81132 - 81144
  • [39] Extended Feature Pyramid Network for Small Object Detection
    Deng, Chunfang
    Wang, Mengmeng
    Liu, Liang
    Liu, Yong
    Jiang, Yunliang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
  • [40] GraphFPN: Graph Feature Pyramid Network for Object Detection
    Zhao, Gangming
    Ge, Weifeng
    Yu, Yizhou
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2743 - 2752