Unsupervised Deep Representation Learning for Real-Time Tracking

被引:77
|
作者
Wang, Ning [1 ]
Zhou, Wengang [1 ,2 ]
Song, Yibing [3 ]
Ma, Chao [4 ]
Liu, Wei [3 ]
Li, Houqiang [1 ,2 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab GIPAS, Hefei, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
[3] Tencent AI Lab, Shenzhen, Peoples R China
[4] Shanghai Jiao Tong Univ, AI Inst, MOE Key Lab Artificial Intelligence, Shanghai, Peoples R China
关键词
Visual tracking; Unsupervised learning; Correlation filter; Siamese network; CORRELATION FILTERS; OBJECT TRACKING;
D O I
10.1007/s11263-020-01357-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotation and learn to track arbitrary objects, we propose an unsupervised learning method for visual tracking. The motivation of our unsupervised learning is that a robust tracker should be effective in bidirectional tracking. Specifically, the tracker is able to forward localize a target object in successive frames and backtrace to its initial position in the first frame. Based on such a motivation, in the training process, we measure the consistency between forward and backward trajectories to learn a robust tracker from scratch merely using unlabeled videos. We build our framework on a Siamese correlation filter network, and propose a multi-frame validation scheme and a cost-sensitive loss to facilitate unsupervised learning. Without bells and whistles, the proposed unsupervised tracker achieves the baseline accuracy of classic fully supervised trackers while achieving a real-time speed. Furthermore, our unsupervised framework exhibits a potential in leveraging more unlabeled or weakly labeled data to further improve the tracking accuracy.
引用
收藏
页码:400 / 418
页数:19
相关论文
共 50 条
  • [21] Study on Tracking Real-Time Target Human Using Deep Learning for High Accuracy
    Nguyen, Van-Truong
    Chu, Duc-Tuan
    JOURNAL OF ROBOTICS, 2023, 2023
  • [22] Real-Time Recognition and Tracking in Urban Spaces Through Deep Learning: A Case Study
    Villegas, William Eduardo
    Sanchez-Viteri, Santiago
    Lujan-Mora, Sergio
    IEEE ACCESS, 2024, 12 : 95599 - 95612
  • [23] A Real-Time Tracking Algorithm for Multi-Target UAV Based on Deep Learning
    Hong, Tao
    Liang, Hongming
    Yang, Qiye
    Fang, Linquan
    Kadoch, Michel
    Cheriet, Mohamed
    REMOTE SENSING, 2023, 15 (01)
  • [24] Real-time tracking based on deep feature fusion
    Pang, Yuhang
    Li, Fan
    Qiao, Xiaoya
    Gilman, Andrew
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27229 - 27255
  • [25] Real-Time License Plate Recognition and Vehicle Tracking System Based on Deep Learning
    Chen, Guan-Wen
    Yang, Chun-Min
    Ik, Tsi-Ui
    2021 22ND ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2021, : 378 - 381
  • [26] Real-time deep learning based multi object tracking of spermatozoa in fresh samples
    Thambawita, V.
    Hicks, S. A.
    Storas, A.
    Witczak, O.
    Andersen, J. M.
    Hammer, H. L.
    Halvorsen, P.
    Riegler, M. A.
    Haugen, T. B.
    HUMAN REPRODUCTION, 2022, 37 : 241 - 241
  • [27] Robust Deep Simple Online Real-time Tracking
    Belmouhcine, Abdelbadie
    Simon, Julien
    Courtrai, Luc
    Lefevre, Sebastien
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 138 - 144
  • [28] Real-time tracking based on deep feature fusion
    Yuhang Pang
    Fan Li
    Xiaoya Qiao
    Andrew Gilman
    Multimedia Tools and Applications, 2020, 79 : 27229 - 27255
  • [29] Real-time Pedestrian Tracking based on Deep Features
    Bhola, Geetanjali
    Kathuria, Akhil
    Kumar, Deepak
    Das, Chandan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 1101 - 1106
  • [30] Learning Adaptive Attribute-Driven Representation for Real-Time RGB-T Tracking
    Pengyu Zhang
    Dong Wang
    Huchuan Lu
    Xiaoyun Yang
    International Journal of Computer Vision, 2021, 129 : 2714 - 2729