A Multi-Level Cross-Attention Image Registration Method for Visible and Infrared Small Unmanned Aerial Vehicle Targets via Image Style Transfer

被引:3
|
作者
Jiang, Wen [1 ]
Pan, Hanxin [1 ]
Wang, Yanping [1 ]
Li, Yang [1 ]
Lin, Yun [1 ]
Bi, Fukun [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Radar Monitoring Technol Lab, Beijing 100144, Peoples R China
关键词
image registration; small UAV targets; cross-modality image; image fusion; deep learning; TRANSLATION;
D O I
10.3390/rs16162880
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Small UAV target detection and tracking based on cross-modality image fusion have gained widespread attention. Due to the limited feature information available from small UAVs in images, where they occupy a minimal number of pixels, the precision required for detection and tracking algorithms is particularly high in complex backgrounds. Image fusion techniques can enrich the detailed information for small UAVs, showing significant advantages under extreme lighting conditions. Image registration is a fundamental step preceding image fusion. It is essential to achieve accurate image alignment before proceeding with image fusion to prevent severe ghosting and artifacts. This paper specifically focused on the alignment of small UAV targets within infrared and visible light imagery. To address this issue, this paper proposed a cross-modality image registration network based on deep learning, which includes a structure preservation and style transformation network (SPSTN) and a multi-level cross-attention residual registration network (MCARN). Firstly, the SPSTN is employed for modality transformation, transferring the cross-modality task into a single-modality task to reduce the information discrepancy between modalities. Then, the MCARN is utilized for single-modality image registration, capable of deeply extracting and fusing features from pseudo infrared and visible images to achieve efficient registration. To validate the effectiveness of the proposed method, comprehensive experimental evaluations were conducted on the Anti-UAV dataset. The extensive evaluation results validate the superiority and universality of the cross-modality image registration framework proposed in this paper, which plays a crucial role in subsequent image fusion tasks for more effective target detection.
引用
收藏
页数:19
相关论文
共 10 条
  • [1] Visible and infrared image fusion based on multi-level method and image contrast improvement
    Peng, Yiyue
    He, Weiji
    Gu, Guohua
    Tong, Tao
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2013, 42 (04): : 1095 - 1099
  • [2] Multi-Level Adaptive Attention Fusion Network for Infrared and Visible Image Fusion
    Hu, Ziming
    Kong, Quan
    Liao, Qing
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 366 - 370
  • [3] Image Registration and Fusion of Visible and Infrared Integrated Camera for Medium-Altitude Unmanned Aerial Vehicle Remote Sensing
    Li, Hongguang
    Ding, Wenrui
    Cao, Xianbin
    Liu, Chunlei
    REMOTE SENSING, 2017, 9 (05)
  • [4] Infrared image denoising via adversarial learning with multi-level feature attention network
    Yang, Pengfei
    Wu, Heng
    Cheng, Lianglun
    Luo, Shaojuan
    INFRARED PHYSICS & TECHNOLOGY, 2023, 128
  • [5] DFA-Net: Multi-Scale Dense Feature-Aware Network via Integrated Attention for Unmanned Aerial Vehicle Infrared and Visible Image Fusion
    Shen, Sen
    Li, Di
    Mei, Liye
    Xu, Chuan
    Ye, Zhaoyi
    Zhang, Qi
    Hong, Bo
    Yang, Wei
    Wang, Ying
    DRONES, 2023, 7 (08)
  • [6] A novel infrared and visible image fusion method based on multi-level saliency integration
    Lu, Ruitao
    Gao, Fan
    Yang, Xiaogang
    Fan, Jiwei
    Li, Dalei
    VISUAL COMPUTER, 2023, 39 (06): : 2321 - 2335
  • [7] A novel infrared and visible image fusion method based on multi-level saliency integration
    Ruitao Lu
    Fan Gao
    Xiaogang Yang
    Jiwei Fan
    Dalei Li
    The Visual Computer, 2023, 39 (6) : 2321 - 2335
  • [8] MdedFusion: A multi-level detail enhancement decomposition method for infrared and visible image fusion
    Tang, Haojie
    Liu, Gang
    Tang, Lili
    Bavirisetti, Durga Prasad
    Wang, Jiebang
    INFRARED PHYSICS & TECHNOLOGY, 2022, 127
  • [9] Efficient multi-level cross-modal fusion and detection network for infrared and visible image
    Gao, Hongwei
    Wang, Yutong
    Sun, Jian
    Jiang, Yueqiu
    Gai, Yonggang
    Yu, Jiahui
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 108 : 306 - 318
  • [10] AMLCA: Additive multi-layer convolution-guided cross-attention network for visible and infrared image fusion
    Wang, Dongliang
    Huang, Chuang
    Pan, Hao
    Sun, Yuan
    Dai, Jian
    Li, Yanan
    Ren, Zhenwen
    PATTERN RECOGNITION, 2025, 163