Content-augmented feature pyramid network with light linear spatial transformers for object detection

被引:2
|
作者
Gu, Yongxiang [1 ,2 ]
Qin, Xiaolin [1 ,2 ,3 ]
Peng, Yuncong [1 ,2 ]
Li, Lu [4 ]
机构
[1] Chinese Acad Sci, Chengdu Inst Comp Applicat, Chengdu 610041, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Civil Aviat Flight Univ China, Sch Sci, Guanghan, Peoples R China
[4] Zenseact, Gothenburg, Sweden
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the prevalent components, feature pyramid network (FPN) is widely used in current object detection models for improving multi-scale object detection performance. However, its feature fusion mode is still in a misaligned and local manner, thus limiting the representation power. To address the inherited defects of FPN, a novel architecture termed content-augmented feature pyramid network (CA-FPN) is proposed in this paper. Firstly, a global content extraction module (GCEM) is proposed to extract multi-scale context information. Secondly, lightweight linear spatial Transformer connections are added in the top-down pathway to augment each feature map with multi-scale features, where a linearized approximate self-attention function is designed for reducing model complexity. By means of the self-attention mechanism in Transformer, it is no longer needed to align feature maps during feature fusion, thus solving the misaligned defect. By setting the query scope to the entire feature map, the local defect can also be solved. Extensive experiments on COCO and PASCAL VOC datasets demonstrated that the CA-FPN outperforms other FPN-based detectors without bells and whistles and is robust in different settings.
引用
收藏
页码:3567 / 3578
页数:12
相关论文
共 50 条
  • [1] Augmented weighted bidirectional feature pyramid network for marine object detection
    Gao, Jinxiong
    Geng, Xu
    Zhang, Yonghui
    Wang, Rong
    Shao, Kaixuan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [2] Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers
    Huang, Zhou
    Dai, Hang
    Xiang, Tian-Zhu
    Wang, Shuo
    Chen, Huai-Xin
    Qin, Jie
    Xiong, Huan
    [J]. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023, 2023-June : 5557 - 5566
  • [3] Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers
    Huang, Zhou
    Dai, Hang
    Xiang, Tian-Zhu
    Wang, Shuo
    Chen, Huai-Xin
    Qin, Jie
    Xiong, Huan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5557 - 5566
  • [4] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    [J]. NEUROCOMPUTING, 2022, 483 : 127 - 139
  • [5] Complementary Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Pan, Jing
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [6] Parallel Feature Pyramid Network for Object Detection
    Kim, Seung-Wook
    Kook, Hyong-Keun
    Sun, Jee-Young
    Kang, Mun-Cheon
    Ko, Sung-Jea
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
  • [7] Latent Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2153 - 2163
  • [8] Gated Feature Pyramid Network for Object Detection
    Xie, Xuemei
    Liao, Quan
    Ma, Lihua
    Jin, Xing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 199 - 208
  • [9] FocusTR: Focusing on Valuable Feature by Multiple Transformers for Fusing Feature Pyramid on Object Detection
    Xie, Bangquan
    Yang, Liang
    Yang, Zongming
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 518 - 525
  • [10] SFPN: Semantic Feature Pyramid Network for Object Detection
    Gan, Yi
    Xu, Wei
    Su, Jianbo
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 795 - 802