Content-augmented feature pyramid network with light linear spatial transformers for object detection

被引:2
|
作者
Gu, Yongxiang [1 ,2 ]
Qin, Xiaolin [1 ,2 ,3 ]
Peng, Yuncong [1 ,2 ]
Li, Lu [4 ]
机构
[1] Chinese Acad Sci, Chengdu Inst Comp Applicat, Chengdu 610041, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Civil Aviat Flight Univ China, Sch Sci, Guanghan, Peoples R China
[4] Zenseact, Gothenburg, Sweden
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the prevalent components, feature pyramid network (FPN) is widely used in current object detection models for improving multi-scale object detection performance. However, its feature fusion mode is still in a misaligned and local manner, thus limiting the representation power. To address the inherited defects of FPN, a novel architecture termed content-augmented feature pyramid network (CA-FPN) is proposed in this paper. Firstly, a global content extraction module (GCEM) is proposed to extract multi-scale context information. Secondly, lightweight linear spatial Transformer connections are added in the top-down pathway to augment each feature map with multi-scale features, where a linearized approximate self-attention function is designed for reducing model complexity. By means of the self-attention mechanism in Transformer, it is no longer needed to align feature maps during feature fusion, thus solving the misaligned defect. By setting the query scope to the entire feature map, the local defect can also be solved. Extensive experiments on COCO and PASCAL VOC datasets demonstrated that the CA-FPN outperforms other FPN-based detectors without bells and whistles and is robust in different settings.
引用
收藏
页码:3567 / 3578
页数:12
相关论文
共 50 条
  • [21] Hierarchical Focused Feature Pyramid Network for Small Object Detection
    Wang, Siwei
    Chen, Zhiwei
    Ding, Haoyang
    Cao, Liujuan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 432 - 444
  • [22] Feature Pyramid Object Detection Network Based on Function Maintenance
    Xu C.
    Hong X.
    [J]. Hong, Xuehai (hxh@ict.ac.cn), 1600, Science Press (33): : 507 - 517
  • [23] SAFPN: a full semantic feature pyramid network for object detection
    Wang, Gaihua
    Li, Qi
    Wang, Nengyuan
    Liu, Hong
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1729 - 1739
  • [24] A Novel Pyramid Network with Feature Fusion and Disentanglement for Object Detection
    Yu, Guoyi
    Wu, You
    Xiao, Jing
    Cao, Yang
    [J]. Xiao, Jing (xiaojing@scnu.edu.cn), 1600, Hindawi Limited (2021):
  • [25] Enhancement-fusion feature pyramid network for object detection
    Dong, Shifeng
    Wang, Rujing
    Du, Jianming
    Jiao, Lin
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [26] SSRDet: Small Object Detection Based on Feature Pyramid Network
    Zhang, Lijuan
    Wang, Minhui
    Jiang, Yutong
    Li, Dongming
    Zhou, Yue
    [J]. IEEE ACCESS, 2023, 11 : 96743 - 96752
  • [27] Reverse Densely Connected Feature Pyramid Network for Object Detection
    Xin, Yongjian
    Wang, Shuhui
    Li, Liang
    Zhang, Weigang
    Huang, Qingming
    [J]. COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 530 - 545
  • [28] Enhanced semantic feature pyramid network for small object detection
    Chen, Yuqi
    Zhu, Xiangbin
    Li, Yonggang
    Wei, Yuanwang
    Ye, Lihua
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 113
  • [29] A Novel Pyramid Network with Feature Fusion and Disentanglement for Object Detection
    Yu, Guoyi
    Wu, You
    Xiao, Jing
    Cao, Yang
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [30] An Enhanced Feature Pyramid Object Detection Network for Autonomous Driving
    Wu, Yutian
    Tang, Shuming
    Zhang, Shuwei
    Ogai, Harutoshi
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (20):