Real-time Infrared Target Detection Algorithm for Embedded System in Complex Scene

被引:1
|
作者
Zhang Penghui [1 ]
Liu Zhi [2 ]
Zheng Jianyong [3 ]
He Boxia [1 ]
Pei Yuhao [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Mech Engn, Nanjing 210094, Peoples R China
[2] Nanjing Plandech Intelligent Technol Co LTD, Nanjing 210014, Peoples R China
[3] Shanghai Univ, Inst Artificial Intelligence, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Infared image; Visual attention; Transfer learning; Target detection; Embedded platform;
D O I
10.3788/gzxb20225102.0210002
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In order to solve the problems of low accuracy and recall rate of infrared target detection under complex background conditions, as well as slow inference speed of network model on embedded computing platform, lightweight network YOLOv4-Tiny was taken as the basic architecture of the algorithm, combined with visual attention mechanism and spatial pyramid pooling idea. Two real-time infrared target detection networks for embedded systems are proposed. Among them, there are a lot of background interference information in target detection in infrared complex scenes. Therefore, the visual attention mechanism is used to effectively learn the weight distribution of the feature map, recalibrate the feature map, strengthen the focus on the target, reduce the influence of irrelevant background information and improve the detection and recognition ability of the model. Spatial pyramid pooling can fuse multi-scale features, enrich the information of feature maps and improve the ability of infrared target recognition and location at different scales. Grad-CAM was used to visualize the feature map strengthened by the attention mechanism, showing the attention of the network model to the target region. The training is carried out on a 2080Ti GPU computer platform using the transfer learning strategy, and deployed on the Atlas 200 DK embedded computing platform with Ascend 310 AI chip as the core. The experimental results show that compared with the original network YOLOv4-Tiny, the infrared images with a resolution of 640 pixels x 512 pixels are detected on the computer platform. The average accuracy and recall rate of the proposed YOLOv4-Tiny+SE+SPP network were improved by 13.96% and 20.14%, respectively, and the inference speed reached 212 FPS. The average accuracy and recall rate of the proposed YOLOv4-Tiny+CBAM+SPP network were improved by 15.75% and 22.41%, respectively, and the inference speed reached 202 FPS. On Atlas 200 DK embedded computing platform, infrared images with a resolution of 640 pixelx512 pixel are detected, compared with the original network YOLOv4-Tiny. The average accuracy and recall rate of the proposed YOLOv4-Tiny+SE+SPP network were improved by 12.36% and 18.6%, respectively, and the inference speed reached 78 FPS. The average accuracy and recall rate of the proposed network YOLOv4-Tiny+CBAM+SPP are improved by 15.94% and 22.89%, respectively, and the inference speed reaches 71 FPS, which can meet the needs of real-time detection and tracking of infrared targets in military and security fields.
引用
收藏
页数:10
相关论文
共 18 条
  • [1] Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection, DOI DOI 10.48550/ARXIV.2004.10934
  • [2] [陈东杰 Chen Dongjie], 2017, [中国科学技术大学学报, Journal of University of Science and Technology of China], V47, P320
  • [3] FLIR, FLIR
  • [4] HaimingTIE FU Guangyuan, 2019, LASER OPTOELECTRON P, V56, P147
  • [5] He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
  • [6] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
  • [7] JIN Xin, 2020, CHINESE HIGH TECHNOL, V30, P32
  • [8] [鞠默然 Ju Moran], 2021, [光学精密工程, Optics and Precision Engineering], V29, P843
  • [9] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [10] LU Jian, 2019, Acta Photonica Sinica, V48