IPD-Net: Infrared Pedestrian Detection Network via Adaptive Feature Extraction and Coordinate Information Fusion

被引:7
|
作者
Zhou, Lun [1 ,2 ]
Gao, Song [1 ,2 ]
Wang, Simin [1 ,2 ]
Zhang, Hengsheng [1 ,2 ]
Liu, Ruochen [1 ,2 ]
Liu, Jiaming [1 ,2 ]
机构
[1] Chengdu Univ Technol, Minist Educ, Key Lab Earth Explorat & Informat Tech, Chengdu 610059, Peoples R China
[2] Chengdu Univ Technol, Coll Mech & Elect Engn, Chengdu 610059, Peoples R China
基金
中国国家自然科学基金;
关键词
infrared images; infrared target detection; pedestrian detection; attention mechanism; YOLOv5; VIDEO SURVEILLANCE; ROBUST;
D O I
10.3390/s22228966
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Infrared pedestrian detection has important theoretical research value and a wide range of application scenarios. Because of its special imaging method, infrared images can be used for pedestrian detection at night and in severe weather conditions. However, the lack of pedestrian feature information in infrared images and the small scale of pedestrian objects makes it difficult for detection networks to extract feature information and accurately detect small-scale pedestrians. To address these issues, this paper proposes an infrared pedestrian detection network based on YOLOv5, named IPD-Net. Firstly, an adaptive feature extraction module (AFEM) is designed in the backbone network section, in which a residual structure with stepwise selective kernel was included to enable the model to better extract feature information under different sizes of the receptive field. Secondly, a coordinate attention feature pyramid network (CA-FPN) is designed to enhance the deep feature map with location information through the coordinate attention module, so that the network gains better capability of object localization. Finally, shallow information is introduced into the feature fusion network to improve the detection accuracy of weak and small objects. Experimental results on the large infrared image dataset ZUT show that the mean Average Precision (mAP50) of our model is improved by 3.6% compared to that of YOLOv5s. In addition, IPD-Net shows various degrees of accuracy improvement compared to other excellent methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] SAFE-NET: SOLID AND ABSTRACT FEATURE EXTRACTION NETWORK FOR PEDESTRIAN ATTRIBUTE RECOGNITION
    Gao, Daiheng
    Wu, Zhenzhi
    Zhang, Weihao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1655 - 1659
  • [22] An infrared small target detection method using coordinate attention and feature fusion
    Shi, Qi
    Zhang, Congxuan
    Chen, Zhen
    Lu, Feng
    Ge, Liyue
    Wei, Shuigen
    INFRARED PHYSICS & TECHNOLOGY, 2023, 131
  • [23] AER-Net: Adaptive Feature Enhancement and Hierarchical Refinement Network for Infrared Small Target Detection
    Zhang, Fuqing
    Yang, Jing
    Deng, Shen
    Pan, Anning
    Yang, Yang
    Zhou, Chengjiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [24] Multi-feature Fusion Pedestrian Detection Combining Head and Overall Information
    Chen Yong
    Xie Wenyang
    Liu Huanlin
    Wang Bo
    Huang Meiyong
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (04) : 1453 - 1460
  • [25] Infrared and visual image fusion through infrared feature extraction and visual information preservation
    Zhang, Yu
    Zhang, Lijia
    Bai, Xiangzhi
    Zhang, Li
    INFRARED PHYSICS & TECHNOLOGY, 2017, 83 : 227 - 237
  • [26] DSDANet: Infrared Dim Small Target Detection via Attention Enhanced Feature Fusion Network
    Chen, Fei
    Wang, Hao
    Zhou, Yuan
    Ye, Tingting
    Fan, Zunlin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14866 : 219 - 235
  • [27] Pedestrian detection in underground mines via parallel feature transfer network
    Wei, Xing
    Zhang, Haitao
    Liu, Shaofan
    Lu, Yang
    PATTERN RECOGNITION, 2020, 103
  • [28] Efficient feature fusion network based on center and scale prediction for pedestrian detection
    Zhang, Tao
    Cao, Yahui
    Zhang, Le
    Li, Xuan
    VISUAL COMPUTER, 2023, 39 (09): : 3865 - 3872
  • [29] Efficient feature fusion network based on center and scale prediction for pedestrian detection
    Tao Zhang
    Yahui Cao
    Le Zhang
    Xuan Li
    The Visual Computer, 2023, 39 : 3865 - 3872
  • [30] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
    Li, You
    Zhang, Qingxuan
    Zhang, Yulei
    2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267