Visible-Infrared Dual-Sensor Tracking Based on Transformer via Progressive Feature Enhancement and Fusion

被引:0
|
作者
Kuai, Yangliu [1 ]
Li, Dongdong [2 ]
Gao, Zhinan [2 ]
Yuan, Mingwei [1 ]
Zhang, Da [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature fusion; progressive feature enhancement; RGBT tracking; sensor data fusion; RGBT TRACKING; NETWORK;
D O I
10.1109/JSEN.2024.3372991
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article investigates how to implement accurate RGB-T tracking by achieving effective feature enhancement of the target and adaptive fusion of the complementary information in RGB and thermal infrared modalities. Inspired by the excellent long-range dependency modeling ability of the transformer, we propose a novel RGBT tracking method based on the transformer via progressive feature enhancement and fusion. The overall flowchart of our proposed tracker consists of a two-branch Siamese network, respectively, an exemplar branch, and a search branch. First, deep features of the RGB and thermal infrared images are extracted by a backbone. Then the features in each branch are enhanced progressively in the channel and spatial dimensions. Specifically, in the channel dimension, the channel attention feature module (CAFM) is designed to adaptively enhance the RGB and thermal infrared features. In the spatial dimension, the transformer self-attention mechanism with the AiA module is integrated to enhance the dual-modality features. Next, the enhanced features from the exemplar and search branches are fused based on the transformer cross-attention mechanism, which can achieve global and deep interaction between the exemplar and search images. Finally, the fused features are fed into a corner predictor head to estimate the target state. Experiments on two widely used public benchmarks (RGBT234 and LasHeR) demonstrate the effectiveness and efficiency of our proposed method when compared to many other state-of-the-art (SOTA) trackers released recently.
引用
收藏
页码:14519 / 14528
页数:10
相关论文
共 50 条
  • [1] Visible-Infrared Dual-Sensor Fusion for Single-Object Tracking
    Liu, Weichun
    Liu, Weibing
    Sun, Yuxin
    [J]. IEEE SENSORS JOURNAL, 2023, 23 (04) : 4118 - 4128
  • [2] Dual Attention Feature Fusion for Visible-Infrared Object Detection
    Hu, Yuxuan
    Shi, Limin
    Yao, Libo
    Weng, Lubin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 53 - 65
  • [3] Dual-granularity feature fusion in visible-infrared person re-identification
    Cai, Shuang
    Yang, Shanmin
    Hu, Jing
    Wu, Xi
    [J]. IET IMAGE PROCESSING, 2024, 18 (04) : 972 - 980
  • [4] Learning dual attention enhancement feature for visible-infrared person re-identification
    Zhang, Guoqing
    Zhang, Yinyin
    Zhang, Hongwei
    Chen, Yuhao
    Zheng, Yuhui
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [5] DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer
    Tang, Wei
    He, Fazhi
    Liu, Yu
    Duan, Yansong
    Si, Tongzhen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3159 - 3172
  • [6] Person Tracking by Detection Using Dual Visible-Infrared Cameras
    Geng, Xuewen
    Li, Minglei
    Liu, Wenping
    Zhu, Shengkai
    Jiang, Hongbo
    Bian, Jiawen
    Fan, Xuezhi
    Peng, Ruiqing
    Luo, Jun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (22) : 23241 - 23251
  • [7] Progressive learning in cross-modal cross-scale fusion transformer for visible-infrared video-based person reidentification
    Mukhtar, Hamza
    Mukhtar, Umar Raza
    [J]. Knowledge-Based Systems, 2024, 304
  • [8] Progressive Discriminative Feature Learning for Visible-Infrared Person Re-Identification
    Zhou, Feng
    Cheng, Zhuxuan
    Yang, Haitao
    Song, Yifeng
    Fu, Shengpeng
    [J]. ELECTRONICS, 2024, 13 (14)
  • [9] Fusion of infrared and visible images based on image enhancement and feature extraction
    Luo, Jinzhe
    Rong, Chuanzhen
    Jia, Yongxing
    Yang, Yu
    Zhu, Ying
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 212 - 216
  • [10] Thermal infrared and visible sequences tracking via dual adversarial pixel fusion
    Zheng, Hang
    Yuan, Nangezi
    Ding, Hongwei
    Hu, Peng
    Yang, Zhijun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (40) : 88303 - 88322