DCIFPN: Deformable cross-scale interaction feature pyramid network for object detection

被引:1
|
作者
Xiao, Junrui [1 ,2 ]
Jiang, He [1 ,2 ]
Li, Zhikai [1 ,2 ]
Gu, Qingyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Ctr Precis Sensing & Control, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Zhongguancun South 1st Alley, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
feature extraction; object detection;
D O I
10.1049/ipr2.12800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting multi-scale features is one of the most effective methods to recognize objects of different scales in object detection. Since image pyramid is time-consuming, Feature Pyramid Network (FPN) becomes the most popular component used for obtaining pyramidal features. Despite its effectiveness, there still exist some intrinsic defects. In this work, it is attributed to insufficient information flow and a Deformable Cross-scale Interaction Feature Pyramid Network (DCIFPN) is proposed, which aims to promote the information transfer process with content-aware sampling and dynamic aggregation weights. More specifically, Deformable Semantic Enhancement Module (DSEM) is designed that can construct accurate information flow with dynamic aggregation weights. In addition, Deformable Spatial Refinement Module (DSRM) is proposed to enhance high-level features with low-level location details. When DCIFPN is deployed on RetinaNet and FCOS with ResNet-50, the performance is improved by 1.6 AP and 1.1 AP, respectively, on the challenging MS COCO benchmark. Apart from one-stage detectors, DCIFPN is also applicable to two-stage methods such as Faster R-CNN and Mask R-CNN. Further experiments on Pascal VOC and CrowdHuman datasets can verify the effectiveness and generalization of the method.
引用
收藏
页码:2596 / 2610
页数:15
相关论文
共 50 条
  • [1] RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
    Zong, Zhuofan
    Cao, Qianggang
    Leng, Biao
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5637 - 5645
  • [2] Feature-transferable pyramid network for cross-scale object detection in sar images
    Zhou, Zheng
    Cui, Zongyong
    Cao, Zongjie
    Yang, Jianyu
    [J]. Journal of Radars, 2021, 10 (04) : 544 - 558
  • [3] Cross-scale global attention feature pyramid network for person search
    Li, Yang
    Xu, Huahu
    Bian, Minjie
    Xiao, Junsheng
    [J]. IMAGE AND VISION COMPUTING, 2021, 116
  • [4] Cross-scale resolution consistent network for salient object detection
    Huang, Xiaoyu
    Liu, Wei
    Li, Minghui
    Nie, Hangyu
    [J]. IET IMAGE PROCESSING, 2024, 18 (10) : 2788 - 2799
  • [5] SEFPN: Scale-Equalizing Feature Pyramid Network for Object Detection
    Zhang, Zhiqiang
    Qiu, Xin
    Li, Yongzhou
    [J]. SENSORS, 2021, 21 (21)
  • [6] Cross-Layer Feature Pyramid Network for Salient Object Detection
    Li, Zun
    Lang, Congyan
    Liew, Jun Hao
    Li, Yidong
    Hou, Qibin
    Feng, Jiashi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4587 - 4598
  • [7] CSFFNet: Lightweight cross-scale feature fusion network for salient object detection in remote sensing images
    Wang, Longbao
    Long, Chong
    Li, Xin
    Tang, Xiaodan
    Bai, Zhipeng
    Gao, Hongmin
    [J]. IET IMAGE PROCESSING, 2024, 18 (03) : 602 - 614
  • [8] Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images
    Cheng, Gong
    Si, Yongjie
    Hong, Hailong
    Yao, Xiwen
    Guo, Lei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 431 - 435
  • [9] CF2PN: A Cross-Scale Feature Fusion Pyramid Network Based Remote Sensing Target Detection
    Huang, Wei
    Li, Guanyi
    Chen, Qiqiang
    Ju, Ming
    Qu, Jiantao
    [J]. REMOTE SENSING, 2021, 13 (05) : 1 - 23
  • [10] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    [J]. NEUROCOMPUTING, 2022, 483 : 127 - 139