Visual and Language Collaborative Learning for RGBT Object Tracking

被引：0

作者：

Wang, Jiahao ^{[1
]}

Liu, Fang ^{[1
]}

Jiao, Licheng ^{[1
]}

Gao, Yingjia ^{[1
]}

Wang, Hao ^{[1
]}

Li, Shuo ^{[1
]}

Li, Lingling ^{[1
]}

Chen, Puhua ^{[1
]}

Liu, Xu ^{[1
]}

机构：

[1] Xidian University, Key Laboratory of Intelligent Perception and Image Understanding, Ministry of Education, International Research Center for Intelligent Perception and Computation, Joint International Research Laboratory of Intelligent Perception and Comp

来源：

IEEE Transactions on Circuits and Systems for Video Technology | 2024年 / 34卷 / 12期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Benchmarking - Clutter (information theory) - Infrared imaging - Job analysis - Object recognition - Target tracking - Timing circuits - Visual languages;

D O I：

10.1109/TCSVT.2024.3436878

中图分类号：

学科分类号：

摘要：

Despite the extensive research on RGBT object tracking, there are still several challenges and issues in practical applications, such as modality differences, lighting variations and disappearance of the target, and changes in viewpoint. Existing methods mostly address these issues by fusing image features, while neglecting a significant amount of target label information. To address these challenges, this paper introduces text to drive the alignment of visible and infrared image features, transforming features from different modalities into the same feature space and fully using complementary features between different modalities. Furthermore, inspired by the success of prompt learning in various tasks, we utilize prior boxes and language as prompts to further guide the model in tracking the target. Extensive experiments demonstrate that the proposed VLCTrack tracker has excellent potential in RGBT object tracking. Compared to previous methods developed for this purpose, our approach achieves state-of-the-art performance on three benchmark datasets. © 1991-2012 IEEE.

引用

页码：12770 / 12781

共 50 条

[41] Bayesian Dumbbell Diffusion Model for RGBT Object Tracking With Enriched Priors
Fan, Shenghua
He, Chu
Wei, Chenxia
Zheng, Yujin
Chen, Xi
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 873 - 877
[42] UniRTL: A universal RGBT and low-light benchmark for object tracking
Wang, Lingxue (neobull@bit.edu.cn), 2025, 158
[43] Collaborative Tracking Learning for Frame-Rate-Insensitive Multi-Object Tracking
Liu, Yiheng
Wu, Junta
Fu, Yi
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9930 - 9939
[44] Collaborative Tracking Learning for Frame-Rate-Insensitive Multi-Object Tracking
Liu, Yiheng
Wu, Junta
Fu, Yi
Proceedings of the IEEE International Conference on Computer Vision, 2023, : 9930 - 9939
[45] Collaborative and Reconfigurable Object Tracking
Soheil Ghiasi
Hyun J. Moon
Ani Nahapetian
Majid Sarrafzadeh
The Journal of Supercomputing, 2004, 30 : 213 - 238
[46] Adaptive Weight Collaborative Complementary Learning for Robust Visual Tracking
Wang, Benxuan
Kong, Jun
Jiang, Min
Shen, Jianyu
Liu, Tianshan
Gu, Xiaofeng
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (01): : 305 - 326
[47] Collaborative and reconfigurable object tracking
Ghiasi, S
Moon, HJ
Sarrafzadeh, M
ERSA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2003, : 13 - 20
[48] Collaborative and reconfigurable object tracking
Ghiasi, S
Moon, HJ
Nahapetian, A
Sarrafzadeh, M
JOURNAL OF SUPERCOMPUTING, 2004, 30 (03): : 213 - 238
[49] Multi-robot cooperative localization through collaborative visual object tracking
Liu, Zhibin
Zhao, Mingguo
Shi, Zongying
Xu, Wenli
ROBOCUP 2007: ROBOT SOCCER WORLD CUP XI, 2008, 5001 : 41 - 52
[50] Learning reliable modal weight with transformer for robust RGBT tracking
Feng, Mingzheng
Su, Jianbo
KNOWLEDGE-BASED SYSTEMS, 2022, 249

← 1 2 3 4 5 →