Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking

被引:98
|
作者
Li, Chenglong [1 ,2 ]
Zhu, Chengli [2 ]
Huang, Yan [1 ]
Tang, Jin [2 ]
Wang, Liang [1 ]
机构
[1] Chinese Acad Sci CASIA, Ctr Res Intelligent Percept & Comp CRIPAC, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
来源
基金
中国国家自然科学基金; 北京市自然科学基金; 中国博士后科学基金;
关键词
Visual tracking; Information fusion; Manifold ranking; Soft cross-modality consistency; Label optimization;
D O I
10.1007/978-3-030-01261-8_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the complementary benefits of visible (RGB) and thermal infrared (T) data, RGB-T object tracking attracts more and more attention recently for boosting the performance under adverse illumination conditions. Existing RGB-T tracking methods usually localize a target object with a bounding box, in which the trackers or detectors is often affected by the inclusion of background clutter. To address this problem, this paper presents a novel approach to suppress background effects for RGB-T tracking. Our approach relies on a novel cross-modal manifold ranking algorithm. First, we integrate the soft cross-modality consistency into the ranking model which allows the sparse inconsistency to account for the different properties between these two modalities. Second, we propose an optimal query learning method to handle label noises of queries. In particular, we introduce an intermediate variable to represent the optimal labels, and formulate it as a l(1)-optimization based sparse learning problem. Moreover, we propose a single unified optimization algorithm to solve the proposed model with stable and efficient convergence behavior. Finally, the ranking results are incorporated into the patch-based object features to address the background effects, and the structured SVM is then adopted to perform RGB-T tracking. Extensive experiments suggest that the proposed approach performs well against the state-of-the-art methods on large-scale benchmark datasets.
引用
收藏
页码:831 / 847
页数:17
相关论文
共 50 条
  • [1] Learning cross-modal interaction for RGB-T tracking
    Xu, Chunyan
    Cui, Zhen
    Wang, Chaoqun
    Zhou, Chuanwei
    Yang, Jian
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (01)
  • [2] Learning cross-modal interaction for RGB-T tracking
    Chunyan XU
    Zhen CUI
    Chaoqun WANG
    Chuanwei ZHOU
    Jian YANG
    [J]. Science China(Information Sciences), 2023, 66 (01) : 320 - 321
  • [3] Learning cross-modal interaction for RGB-T tracking
    Chunyan Xu
    Zhen Cui
    Chaoqun Wang
    Chuanwei Zhou
    Jian Yang
    [J]. Science China Information Sciences, 2023, 66
  • [4] Cross-Modal Pattern-Propagation for RGB-T Tracking
    Wang, Chaoqun
    Xu, Chunyan
    Cui, Zhen
    Zhou, Ling
    Zhang, Tong
    Zhang, Xiaoya
    Yang, Jian
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7062 - 7071
  • [5] Fast RGB-T Tracking via Cross-Modal Correlation Filters
    Zhai, Sulan
    Shao, Pengpeng
    Liang, Xinyan
    Wang, Xin
    [J]. NEUROCOMPUTING, 2019, 334 : 172 - 181
  • [6] Cross-modal collaborative propagation for RGB-T saliency detection
    Yu, Xiaosheng
    Pang, Yu
    Chi, Jianning
    Qi, Qi
    [J]. VISUAL COMPUTER, 2024, 40 (06): : 4337 - 4354
  • [7] CROSS-MODAL RETRIEVAL WITH NOISY LABELS
    Mandal, Devraj
    Biswas, Soma
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2326 - 2330
  • [8] ROBUST RGB-T TRACKING VIA CONSISTENCY REGULATED SCENE PERCEPTION
    Kang, Bin
    Liu, Liwei
    Zhao, Shihao
    Du, Songlin
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 510 - 514
  • [9] Asymmetric cross-modal activation network for RGB-T salient object detection
    Xu, Chang
    Li, Qingwu
    Zhou, Qingkai
    Jiang, Xiongbiao
    Yu, Dabing
    Zhou, Yaqin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [10] CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting
    Cai, Jialu
    Wang, Qing
    Jiang, Shengqin
    [J]. BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 427 - 436