Centralized Feature Pyramid for Object Detection

被引:52
|
作者
Quan, Yu [1 ]
Zhang, Dong [1 ,2 ]
Zhang, Liyan [3 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Collaborat Innovat Ctr Novel Software Technol & In, MIIT Key Lab Pattern Anal & Machine Intelligence, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature pyramid; visual center; object detection; attention learning mechanism; long-range dependencies;
D O I
10.1109/TIP.2023.3297408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The visual feature pyramid has shown its superiority in both effectiveness and efficiency in a variety of applications. However, current methods overly focus on inter-layer feature interactions while disregarding the importance of intra-layer feature regulation. Despite some attempts to learn a compact intra-layer feature representation with the use of attention mechanisms or vision transformers, they overlook the crucial corner regions that are essential for dense prediction tasks. To address this problem, we propose a Centralized Feature Pyramid (CFP) network for object detection, which is based on a globally explicit centralized feature regulation. Specifically, we first propose a spatial explicit visual center scheme, where a lightweight MLP is used to capture the globally long-range dependencies, and a parallel learnable visual center mechanism is used to capture the local corner regions of the input images. Based on this, we then propose a globally centralized regulation for the commonly-used feature pyramid in a top-down fashion, where the explicit visual center information obtained from the deepest intra-layer feature is used to regulate frontal shallow features. Compared to the existing feature pyramids, CFP not only has the ability to capture the global long-range dependencies but also efficiently obtain an all-round yet discriminative feature representation. Experimental results on the challenging MS-COCO validate that our proposed CFP can achieve consistent performance gains on the state-of-the-art YOLOv5 and YOLOX object detection baselines.
引用
收藏
页码:4341 / 4354
页数:14
相关论文
共 50 条
  • [1] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [2] Adaptive Feature Pyramid Networks for Object Detection
    Wang, Chengyang
    Zhong, Caiming
    [J]. IEEE ACCESS, 2021, 9 : 107024 - 107032
  • [3] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    [J]. NEUROCOMPUTING, 2022, 483 : 127 - 139
  • [4] Deep Feature Pyramid Reconfiguration for Object Detection
    Kong, Tao
    Sun, Fuchun
    Huang, Wenbing
    Liu, Huaping
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 172 - 188
  • [5] Complementary Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Pan, Jing
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [6] Weighted Feature Pyramid Networks for Object Detection
    Li, Xiaohan
    Lai, Taotao
    Wang, Shuaiyu
    Chen, Quan
    Yang, Changcai
    Chen, Riqing
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1500 - 1504
  • [7] Parallel Feature Pyramid Network for Object Detection
    Kim, Seung-Wook
    Kook, Hyong-Keun
    Sun, Jee-Young
    Kang, Mun-Cheon
    Ko, Sung-Jea
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
  • [8] Adaptive learning feature pyramid for object detection
    Wong, Fukoeng
    Hu, Haifeng
    [J]. IET COMPUTER VISION, 2019, 13 (08) : 742 - 748
  • [9] An Interconnected Feature Pyramid Networks for object detection
    Wang, Qiang
    Zhou, Lukuan
    Yao, Yuncong
    Wang, Yong
    Li, Jun
    Yang, Wankou
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [10] Latent Feature Pyramid Network for Object Detection
    Xie, Jin
    Pang, Yanwei
    Nie, Jing
    Cao, Jiale
    Han, Jungong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2153 - 2163