Learning Multi-domain Convolutional Network for RGB-T Visual Tracking

被引:0
|
作者
Zhang, Xingming [1 ]
Zhang, Xuehan [1 ]
Du, Xuedan [1 ]
Zhou, Xiangming [1 ]
Yin, Jun [1 ]
机构
[1] Zhejiang Dahua Technol Co Ltd, Hangzhou, Zhejiang, Peoples R China
关键词
Tracking; Convolutional neural network; Thermal infrared;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Object tracking is one of the challenging problems in the field of computer vision. Affected by the unstructured environments, for example, the occlusion, noise, and light, These factors can affect the appearance of the specific object and result in failures when tracking specific objects. To address this issue, we propose a novel visual tracking method based on multimodal convolutional network learning. Our framework adopts a parallel structure, which consists of two shallow convolutional neural networks. First, the parallel network is used to draw the different features of the RGB-T (RGB and thermal) data separately. Second, this two kind of features are mixed together and finally the mixed feature is sent to domain-specific layers for binary classification and identification of the targets. We perform comprehensive experiments on RGBT234 visual data and the results prove that the proposed visual tracking method improves the effects significantly through the use of multi-modal features, which illustrates that our method is competitive in performances against with the state-of-the-art tracking algorithms.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Learning Multi-Domain Convolutional Neural Networks for Visual Tracking
    Nam, Hyeonseob
    Han, Bohyung
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4293 - 4302
  • [2] Learning a Twofold Siamese Network for RGB-T Object Tracking
    Kuai, Yangliu
    Li, Dongdong
    Qian, Que
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (05)
  • [3] Branch-Activated Multi-Domain Convolutional Neural Network for Visual Tracking
    陈一民
    陆蓉蓉
    邹一波
    张燕辉
    [J]. Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 360 - 367
  • [4] Branch-Activated Multi-Domain Convolutional Neural Network for Visual Tracking
    Chen Y.
    Lu R.
    Zou Y.
    Zhang Y.
    [J]. Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 360 - 367
  • [5] Region Selective Fusion Network for Robust RGB-T Tracking
    Yu, Zhencheng
    Fan, Huijie
    Wang, Qiang
    Li, Ziwan
    Tang, Yandong
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1357 - 1361
  • [6] Multi-scale Region Proposal Network Trained by Multi-domain Learning for Visual Object Tracking
    Fang, Yang
    Ko, Seunghyun
    Jo, Geun-Sik
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 330 - 339
  • [7] Learning cross-modal interaction for RGB-T tracking
    Chunyan Xu
    Zhen Cui
    Chaoqun Wang
    Chuanwei Zhou
    Jian Yang
    [J]. Science China Information Sciences, 2023, 66
  • [8] Learning cross-modal interaction for RGB-T tracking
    Chunyan XU
    Zhen CUI
    Chaoqun WANG
    Chuanwei ZHOU
    Jian YANG
    [J]. Science China(Information Sciences), 2023, 66 (01) : 320 - 321
  • [9] Learning cross-modal interaction for RGB-T tracking
    Xu, Chunyan
    Cui, Zhen
    Wang, Chaoqun
    Zhou, Chuanwei
    Yang, Jian
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (01)
  • [10] Learning Local-Global Multi-Graph Descriptors for RGB-T Object Tracking
    Li, Chenglong
    Zhu, Chengli
    Zhang, Jian
    Luo, Bin
    Wu, Xiaohao
    Tang, Jin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2913 - 2926