IPD-Net: Infrared Pedestrian Detection Network via Adaptive Feature Extraction and Coordinate Information Fusion

被引:7
|
作者
Zhou, Lun [1 ,2 ]
Gao, Song [1 ,2 ]
Wang, Simin [1 ,2 ]
Zhang, Hengsheng [1 ,2 ]
Liu, Ruochen [1 ,2 ]
Liu, Jiaming [1 ,2 ]
机构
[1] Chengdu Univ Technol, Minist Educ, Key Lab Earth Explorat & Informat Tech, Chengdu 610059, Peoples R China
[2] Chengdu Univ Technol, Coll Mech & Elect Engn, Chengdu 610059, Peoples R China
基金
中国国家自然科学基金;
关键词
infrared images; infrared target detection; pedestrian detection; attention mechanism; YOLOv5; VIDEO SURVEILLANCE; ROBUST;
D O I
10.3390/s22228966
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Infrared pedestrian detection has important theoretical research value and a wide range of application scenarios. Because of its special imaging method, infrared images can be used for pedestrian detection at night and in severe weather conditions. However, the lack of pedestrian feature information in infrared images and the small scale of pedestrian objects makes it difficult for detection networks to extract feature information and accurately detect small-scale pedestrians. To address these issues, this paper proposes an infrared pedestrian detection network based on YOLOv5, named IPD-Net. Firstly, an adaptive feature extraction module (AFEM) is designed in the backbone network section, in which a residual structure with stepwise selective kernel was included to enable the model to better extract feature information under different sizes of the receptive field. Secondly, a coordinate attention feature pyramid network (CA-FPN) is designed to enhance the deep feature map with location information through the coordinate attention module, so that the network gains better capability of object localization. Finally, shallow information is introduced into the feature fusion network to improve the detection accuracy of weak and small objects. Experimental results on the large infrared image dataset ZUT show that the mean Average Precision (mAP50) of our model is improved by 3.6% compared to that of YOLOv5s. In addition, IPD-Net shows various degrees of accuracy improvement compared to other excellent methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    Infrared Physics and Technology, 2021, 116
  • [2] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    INFRARED PHYSICS & TECHNOLOGY, 2021, 116
  • [3] A multispectral feature fusion network for robust pedestrian detection
    Song, Xiaoru
    Gao, Song
    Chen, Chaobo
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (01) : 73 - 85
  • [4] Pedestrian Detection via Multi-scale Feature Fusion Convolutional Neural Network
    Guo, Aixin
    Yin, Baoqun
    Zhang, Jing
    Yao, Jinfa
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 1364 - 1368
  • [5] Moderately Dense Adaptive Feature Fusion Network for Infrared Small Target Detection
    Li, Chengyu
    Zhang, Yan
    Shi, Zhiguang
    Zhang, Yu
    Zhang, Yi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 (1-12): : 1 - 12
  • [6] EAFF-Net: Efficient attention feature fusion network for dual-modality pedestrian detection
    Shen, Ying
    Xie, Xiaoyang
    Wu, Jing
    Chen, Liqiong
    Huang, Feng
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [7] Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion
    Huang, Ji
    Li, Tianrui
    PROCEEDINGS OF 2024 ACM ICMR WORKSHOP ON MULTIMODAL VIDEO RETRIEVAL, ICMR-MVR 2024, 2024, : 39 - 44
  • [8] Pedestrian Detection Using Regional Proposal Network with Feature Fusion
    Lv, Xiaogang
    Zhang, Xiaotao
    Jiang, Yinghua
    Zhang, Jianxin
    2018 EIGHTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2018, : 108 - 112
  • [9] MSIA-Net: A Lightweight Infrared Target Detection Network with Efficient Information Fusion
    Yu, Jimin
    Li, Shun
    Zhou, Shangbo
    Wang, Hui
    ENTROPY, 2023, 25 (05)
  • [10] Adaptive Feature-Manipulated Vehicle and Pedestrian Detection in Infrared Images
    Chen, Guangchen
    Zhang, Pengcheng
    Zhang, Yinhui
    He, Zifen
    Shi, Benjie
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (04) : 4579 - 4591