Visual and Language Collaborative Learning for RGBT Object Tracking

被引:0
|
作者
Wang, Jiahao [1 ]
Liu, Fang [1 ]
Jiao, Licheng [1 ]
Gao, Yingjia [1 ]
Wang, Hao [1 ]
Li, Shuo [1 ]
Li, Lingling [1 ]
Chen, Puhua [1 ]
Liu, Xu [1 ]
机构
[1] Xidian University, Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education, International Research Center for Intelligent Perception and Computation, Joint International Research Laboratory of Intelligent Perception and Comp
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Benchmarking - Clutter (information theory) - Infrared imaging - Job analysis - Object recognition - Target tracking - Timing circuits - Visual languages;
D O I
10.1109/TCSVT.2024.3436878
中图分类号
学科分类号
摘要
Despite the extensive research on RGBT object tracking, there are still several challenges and issues in practical applications, such as modality differences, lighting variations and disappearance of the target, and changes in viewpoint. Existing methods mostly address these issues by fusing image features, while neglecting a significant amount of target label information. To address these challenges, this paper introduces text to drive the alignment of visible and infrared image features, transforming features from different modalities into the same feature space and fully using complementary features between different modalities. Furthermore, inspired by the success of prompt learning in various tasks, we utilize prior boxes and language as prompts to further guide the model in tracking the target. Extensive experiments demonstrate that the proposed VLCTrack tracker has excellent potential in RGBT object tracking. Compared to previous methods developed for this purpose, our approach achieves state-of-the-art performance on three benchmark datasets. © 1991-2012 IEEE.
引用
收藏
页码:12770 / 12781
相关论文
共 50 条
  • [1] Exploring fusion strategies for accurate RGBT visual object tracking
    Tang, Zhangyong
    Xu, Tianyang
    Li, Hui
    Wu, Xiao-Jun
    Zhu, XueFeng
    Kittler, Josef
    INFORMATION FUSION, 2023, 99
  • [2] Trans-RGBT:RGBT Object Tracking with Transformer
    Wanjun, Liu
    Linlin, Liang
    Haicheng, Qu
    Computer Engineering and Applications, 2024, 60 (11) : 84 - 94
  • [3] Collaborative strategy for visual object tracking
    Yang, Yongquan
    Chen, Ning
    Jiang, Shenlu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (06) : 7283 - 7303
  • [4] Collaborative strategy for visual object tracking
    Yongquan Yang
    Ning Chen
    Shenlu Jiang
    Multimedia Tools and Applications, 2018, 77 : 7283 - 7303
  • [5] Specific and Collaborative Representations Siamese Network for RGBT Tracking
    Liu, Yisong
    Zhou, Dongming
    Cao, Jinde
    Yan, Kaixiang
    Geng, Lizhi
    IEEE SENSORS JOURNAL, 2024, 24 (11) : 18520 - 18534
  • [6] Learning Collaborative Model for Visual Tracking
    Ma, Ding
    Bu, Wei
    Cui, Yuehua
    Xie, Yuying
    Wu, Xiangqian
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2582 - 2587
  • [7] Visual object tracking via collaborative correlation filters
    Lu, Xiaohuan
    Li, Jing
    He, Zhenyu
    Liu, Wei
    You, Lei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (01) : 177 - 185
  • [8] Object tracking with collaborative extreme learning machines
    Kuang, Haipeng
    Xun, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) : 4965 - 4988
  • [9] Object tracking with collaborative extreme learning machines
    Haipeng Kuang
    Liang Xun
    Multimedia Tools and Applications, 2020, 79 : 4965 - 4988
  • [10] Collaborative Visual Object Tracking via Hierarchical Structure
    Tu, Fangwen
    Ge, Shuzhi Sam
    Suryadi, Henry Pratama
    Tang, Yazhe
    Hang, Chang Chieh
    SOCIAL ROBOTICS, (ICSR 2016), 2016, 9979 : 413 - 421