Bridging Search Region Interaction with Template for RGB-T Tracking

被引:44
|
作者
Hui, Tianrui [1 ,2 ]
Xun, Zizheng [3 ,5 ]
Peng, Fengguang [3 ,5 ]
Huang, Junshi [4 ]
Wei, Xiaoming [4 ]
Wei, Xiaolin [4 ]
Dai, Jiao [1 ,2 ]
Han, Jizhong [1 ,2 ]
Liu, Si [3 ,5 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Beihang Univ, Inst Artificial Intelligence, Beijing, Peoples R China
[4] Meituan, Beijing, Peoples R China
[5] Beihang Univ, Hangzhou Innovat Inst, Hangzhou, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RGB-T tracking aims to leverage the mutual enhancement and complement ability of RGB and TIR modalities for improving the tracking process in various scenarios, where cross-modal interaction is the key component. Some previous methods concatenate the RGB and TIR search region features directly to perform a coarse interaction process with redundant background noises introduced. Many other methods sample candidate boxes from search frames and conduct various fusion approaches on isolated pairs of RGB and TIR boxes, which limits the cross-modal interaction within local regions and brings about inadequate context modeling. To alleviate these limitations, we propose a novel Template-Bridged Search region Interaction (TBSI) module which exploits templates as the medium to bridge the cross-modal interaction between RGB and TIR search regions by gathering and distributing target-relevant object and environment contexts. Original templates are also updated with enriched multimodal contexts from the template medium. Our TBSI module is inserted into a ViT backbone for joint feature extraction, search-template matching, and cross-modal interaction. Extensive experiments on three popular RGB-T tracking benchmarks demonstrate our method achieves new state-of-the-art performances. Code is available at https://github.com/RyanHTR/TBSI.
引用
收藏
页码:13630 / 13639
页数:10
相关论文
共 50 条
  • [41] TEFNet: Target-Aware Enhanced Fusion Network for RGB-T Tracking
    Chen, Panfeng
    Gong, Shengrong
    Ying, Wenhao
    Du, Xin
    Zhong, Shan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 432 - 443
  • [42] RGB-T tracking by modality difference reduction and feature re-selection
    Zhang, Qiang
    Liu, Xueru
    Zhang, Tianlu
    IMAGE AND VISION COMPUTING, 2022, 127
  • [43] Weighted Sparse Representation Regularized Graph Learning for RGB-T Object Tracking
    Li, Chenglong
    Zhao, Nan
    Lu, Yijuan
    Zhu, Chengli
    Tang, Jin
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1856 - 1864
  • [44] RGB-T目标跟踪综述
    丁正彤
    徐磊
    张研
    李飘扬
    李阳阳
    罗斌
    涂铮铮
    南京信息工程大学学报(自然科学版), 2019, 11 (06) : 690 - 697
  • [45] Fusing two-stream convolutional neural networks for RGB-T object tracking
    Li, Chenglong
    Wu, Xiaohao
    Zhao, Nan
    Cao, Xiaochun
    Tang, Jin
    NEUROCOMPUTING, 2018, 281 : 78 - 85
  • [46] Unsupervised RGB-T object tracking with attentional multi-modal feature fusion
    Shenglan Li
    Rui Yao
    Yong Zhou
    Hancheng Zhu
    Bing Liu
    Jiaqi Zhao
    Zhiwen Shao
    Multimedia Tools and Applications, 2023, 82 : 23595 - 23613
  • [47] Multi-scale feature extraction and fusion with attention interaction for RGB-T
    Xing, Haijiao
    Wei, Wei
    Zhang, Lei
    Zhang, Yanning
    PATTERN RECOGNITION, 2025, 157
  • [48] An RGB-T Object Tracking Method for Solving Camera Motion Based on Correlation Filter
    Zhao, Zhongxuan
    Li, Weixing
    Pan, Feng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3526 - 3531
  • [49] Maximize Peak-to-Sidelobe Ratio for Real-Time RGB-T Tracking
    Zhu, Xu
    Liu, Jun
    Xiong, Xingzhong
    Luo, Zhongqiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [50] Correlation Filters Based on Strong Spatio-Temporal for Robust RGB-T Tracking
    Luo, Futing
    Zhou, Mingliang
    Fang, Bing
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (03)