IPD-Net: Infrared Pedestrian Detection Network via Adaptive Feature Extraction and Coordinate Information Fusion

被引:7
|
作者
Zhou, Lun [1 ,2 ]
Gao, Song [1 ,2 ]
Wang, Simin [1 ,2 ]
Zhang, Hengsheng [1 ,2 ]
Liu, Ruochen [1 ,2 ]
Liu, Jiaming [1 ,2 ]
机构
[1] Chengdu Univ Technol, Minist Educ, Key Lab Earth Explorat & Informat Tech, Chengdu 610059, Peoples R China
[2] Chengdu Univ Technol, Coll Mech & Elect Engn, Chengdu 610059, Peoples R China
基金
中国国家自然科学基金;
关键词
infrared images; infrared target detection; pedestrian detection; attention mechanism; YOLOv5; VIDEO SURVEILLANCE; ROBUST;
D O I
10.3390/s22228966
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Infrared pedestrian detection has important theoretical research value and a wide range of application scenarios. Because of its special imaging method, infrared images can be used for pedestrian detection at night and in severe weather conditions. However, the lack of pedestrian feature information in infrared images and the small scale of pedestrian objects makes it difficult for detection networks to extract feature information and accurately detect small-scale pedestrians. To address these issues, this paper proposes an infrared pedestrian detection network based on YOLOv5, named IPD-Net. Firstly, an adaptive feature extraction module (AFEM) is designed in the backbone network section, in which a residual structure with stepwise selective kernel was included to enable the model to better extract feature information under different sizes of the receptive field. Secondly, a coordinate attention feature pyramid network (CA-FPN) is designed to enhance the deep feature map with location information through the coordinate attention module, so that the network gains better capability of object localization. Finally, shallow information is introduced into the feature fusion network to improve the detection accuracy of weak and small objects. Experimental results on the large infrared image dataset ZUT show that the mean Average Precision (mAP50) of our model is improved by 3.6% compared to that of YOLOv5s. In addition, IPD-Net shows various degrees of accuracy improvement compared to other excellent methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] DRA-Net: Medical image segmentation based on adaptive feature extraction and region-level information fusion
    Huang, Zhongmiao
    Wang, Liejun
    Xu, Lianghui
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [32] Thermal-Infrared Pedestrian ROI Extraction through Thermal and Motion Information Fusion
    Fernandez-Caballero, Antonio
    Lopez, Maria T.
    Serrano-Cuerda, Juan
    SENSORS, 2014, 14 (04) : 6666 - 6676
  • [33] Infrared and Visible Image Fusion via Attention-Based Adaptive Feature Fusion
    Wang, Lei
    Hu, Ziming
    Kong, Quan
    Qi, Qian
    Liao, Qing
    ENTROPY, 2023, 25 (03)
  • [34] Convolutional Feature Frequency Adaptive Fusion Object Detection Network
    Mao, Lin
    Li, Xuemeng
    Yang, Dawei
    Zhang, Rubo
    NEURAL PROCESSING LETTERS, 2021, 53 (05) : 3545 - 3560
  • [35] Convolutional Feature Frequency Adaptive Fusion Object Detection Network
    Lin Mao
    Xuemeng Li
    Dawei Yang
    Rubo Zhang
    Neural Processing Letters, 2021, 53 : 3545 - 3560
  • [36] AGF-Net: adaptive global feature fusion network for road extraction from remote-sensing images
    Yajuan Zhang
    Lan Zhang
    Yunhe Wang
    Wenjia Xu
    Complex & Intelligent Systems, 2024, 10 : 4311 - 4328
  • [37] AGF-Net: adaptive global feature fusion network for road extraction from remote-sensing images
    Zhang, Yajuan
    Zhang, Lan
    Wang, Yunhe
    Xu, Wenjia
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4311 - 4328
  • [38] AHFF-NET: ADAPTIVE HIERARCHICAL FEATURE FUSION NETWORK FOR IMAGE INPAINTING
    Zhang, Jiaqi
    Tang, Sheng
    Zhang, Xu
    Li, Yu
    Zhang, Rui
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 478 - 482
  • [39] Pedestrian detection-driven cascade network for infrared and visible image fusion
    Zheng, Bowen
    Huo, Hongtao
    Liu, Xiaowen
    Pang, Shan
    Li, Jing
    SIGNAL PROCESSING, 2024, 225
  • [40] An Information Retention and Feature Transmission Network for Infrared and Visible Image Fusion
    Liu, Chang
    Yang, Bin
    Li, Yuehua
    Zhang, Xiaozhi
    Pang, Lihui
    IEEE SENSORS JOURNAL, 2021, 21 (13) : 14950 - 14959