Learning Discriminated Features Based on Feature Pyramid Networks and Attention for Multi-scale Object Detection

被引:0
|
作者
Yunhua Lu
Minghui Su
Yong Wang
Zhi Liu
Tao Peng
机构
[1] Chongqing University of Technology,School of Artificial Intelligence
来源
Cognitive Computation | 2023年 / 15卷
关键词
Object detection; Multi-scale; Feature pyramid; Discriminative learning; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
As the research scene in object detection becomes increasingly complex, the extracted feature information needs to be further improved. Many multi-scale feature pyramid network methods have been proposed to improve detection accuracy. However, most of them just follow a simple chain aggregation structure, resulting in not considering the distinction between multi-scale objects. Modern cognitive research presents that human cognitive ability is not a simple image-based matching process. It has an inherent process of information decomposition and reconstruction. Inspired by this theory, a new feature pyramid network model denoted as SuFPN based on discriminative learning is proposed to solve the problem of multi-scale object detection. In SuFPN, the correlation between the underlying location information and the deep feature information is fully considered. Firstly, object features are extracted through top-down path and lateral connection. Then deformable convolution is used to extract object discriminant spatial information. Finally, the attention mechanism is introduced to generate a discriminative feature map with enhanced spatial and channel interdependence, which provides excellent location information for the feature pyramid while considering semantic information. The proposed SuFPN is validated on the PASCAL VOC and COCO datasets. The Average Precision (AP) value reaches 80.0 on the PASCAL VOC dataset, which is 1.7 points higher than the feature pyramid networks (FPN), and 39.2 on the COCO dataset, which is 1.8 points higher than the FPN. The result demonstrates that our SuFPN outperforms other advanced methods in the multi-scale detection precision.
引用
收藏
页码:486 / 495
页数:9
相关论文
共 50 条
  • [31] Feature Enhancement for Multi-scale Object Detection
    Zheng, Huicheng
    Chen, Jiajie
    Chen, Lvran
    Li, Ye
    Yan, Zhiwei
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1907 - 1919
  • [32] MDFN: Multi-scale deep feature learning network for object detection
    Ma, Wenchi
    Wu, Yuanwei
    Cen, Feng
    Wang, Guanghui
    PATTERN RECOGNITION, 2020, 100
  • [33] FEATURE FUSING OF FEATURE PYRAMID NETWORK FOR MULTI-SCALE PEDESTRIAN DETECTION
    Tesema, Fiseha B.
    Lin, Junpeng
    Ou, Jie
    Wu, Hong
    Zhu, William
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 10 - 13
  • [34] Animal species detection and classification framework based on modified multi-scale attention mechanism and feature pyramid network
    Ukwuoma, Chiagoziem C.
    Qin, Zhiguang
    Yussif, Sophyani B.
    Happy, Monday N.
    Nneji, Grace U.
    Urama, Gilbert C.
    Ukwuoma, Chibueze D.
    Darkwa, Nimo B.
    Agobah, Harriet
    SCIENTIFIC AFRICAN, 2022, 16
  • [35] Multi-scale global context feature pyramid network for object detector
    Yunhao Li
    Mingwen Shao
    Bingbing Fan
    Wei Zhang
    Signal, Image and Video Processing, 2022, 16 : 705 - 713
  • [36] Salient Object Detection Using Multi-Scale Features with Attention Recurrent Mechanism
    Lu S.
    Guo Q.
    Wang R.
    Zhang C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (12): : 1926 - 1937
  • [37] FEATURE-TRANSFERABLE PYRAMID NETWORK FOR DENSE MULTI-SCALE OBJECT DETECTION IN SAR IMAGES
    Zhou, Zheng
    Cui, Zongyong
    Cao, Zongjie
    Yang, Jianyu
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 647 - 650
  • [38] Combined with Pyramid Split Attention and Multi-Scale Feature Learning Network for Single Image Deraining
    Chen, Hui
    Zhu, Songhao
    Liang, Zhiwei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7064 - 7069
  • [39] Multi-scale global context feature pyramid network for object detector
    Li, Yunhao
    Shao, Mingwen
    Fan, Bingbing
    Zhang, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (03) : 705 - 713
  • [40] Multi-scale feature pyramid-based crater detection on lunar surface
    Chaini, Chinmayee
    Jha, Vijay Kumar
    Rajnish, Kumar
    EARTH SCIENCE INFORMATICS, 2025, 18 (03)