Content-augmented feature pyramid network with light linear spatial transformers for object detection

被引:2
|
作者
Gu, Yongxiang [1 ,2 ]
Qin, Xiaolin [1 ,2 ,3 ]
Peng, Yuncong [1 ,2 ]
Li, Lu [4 ]
机构
[1] Chinese Acad Sci, Chengdu Inst Comp Applicat, Chengdu 610041, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Civil Aviat Flight Univ China, Sch Sci, Guanghan, Peoples R China
[4] Zenseact, Gothenburg, Sweden
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the prevalent components, feature pyramid network (FPN) is widely used in current object detection models for improving multi-scale object detection performance. However, its feature fusion mode is still in a misaligned and local manner, thus limiting the representation power. To address the inherited defects of FPN, a novel architecture termed content-augmented feature pyramid network (CA-FPN) is proposed in this paper. Firstly, a global content extraction module (GCEM) is proposed to extract multi-scale context information. Secondly, lightweight linear spatial Transformer connections are added in the top-down pathway to augment each feature map with multi-scale features, where a linearized approximate self-attention function is designed for reducing model complexity. By means of the self-attention mechanism in Transformer, it is no longer needed to align feature maps during feature fusion, thus solving the misaligned defect. By setting the query scope to the entire feature map, the local defect can also be solved. Extensive experiments on COCO and PASCAL VOC datasets demonstrated that the CA-FPN outperforms other FPN-based detectors without bells and whistles and is robust in different settings.
引用
收藏
页码:3567 / 3578
页数:12
相关论文
共 50 条
  • [41] SEFPN: Scale-Equalizing Feature Pyramid Network for Object Detection
    Zhang, Zhiqiang
    Qiu, Xin
    Li, Yongzhou
    [J]. SENSORS, 2021, 21 (21)
  • [42] An Efficient Feature Pyramid Network for Object Detection in Remote Sensing Imagery
    Fang Qingyun
    Zhang Lin
    Wang Zhaokui
    [J]. IEEE ACCESS, 2020, 8 : 93058 - 93068
  • [43] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    [J]. The Visual Computer, 2023, 39 : 4267 - 4277
  • [44] Discriminative Feature Pyramid Network For Object Detection In Remote Sensing Images
    Zhu, Xiaoqian
    Zhang, Xiangrong
    Zhang, Tianyang
    Zhu, Peng
    Tang, Xu
    Li, Chen
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [45] Dual-bottleneck feature pyramid network for multiscale object detection
    Chen, Suting
    Ma, Wenyan
    Zhang, Liangchen
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [46] Weighted Feature Pyramid Network for One-Stage Object Detection
    Tu, Xiaobo
    Zhan, Yongzhao
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 606 - 617
  • [47] Cross-Layer Feature Pyramid Network for Salient Object Detection
    Li, Zun
    Lang, Congyan
    Liew, Jun Hao
    Li, Yidong
    Hou, Qibin
    Feng, Jiashi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4587 - 4598
  • [48] FI-FPN: Feature-integration feature pyramid network for object detection
    Su, Qichen
    Zhang, Guangjian
    Wu, Shuang
    Yin, Yiming
    [J]. AI COMMUNICATIONS, 2023, 36 (03) : 191 - 203
  • [49] Feature spatial pyramid network for low-light image enhancement
    Song, Xijuan
    Huang, Jijiang
    Cao, Jianzhong
    Song, Dawei
    [J]. VISUAL COMPUTER, 2023, 39 (01): : 489 - 499
  • [50] Feature spatial pyramid network for low-light image enhancement
    Xijuan Song
    Jijiang Huang
    Jianzhong Cao
    Dawei Song
    [J]. The Visual Computer, 2023, 39 : 489 - 499