Deep Learning in Visual Tracking: A Review

被引:32
|
作者
Jiao, Licheng [1 ,2 ]
Wang, Dan [1 ,2 ]
Bai, Yidong [3 ,4 ]
Chen, Puhua [1 ,2 ]
Liu, Fang [1 ,2 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Joint Int Res Lab Intelligent Percept & Computat, Xian 710071, Peoples R China
[3] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[4] Waseda Univ, Intelligent Software Lab, Tokyo 1698555, Japan
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Visualization; Target tracking; Task analysis; Feature extraction; Deep learning; Trajectory; Nonhomogeneous media; Deep learning (DL); multiple-object tracking (MOT); single-object tracking (SOT); MULTIPLE OBJECT TRACKING; CORRELATION FILTERS; NEURAL-NETWORKS; ROBUST; MULTITARGET; SYSTEM;
D O I
10.1109/TNNLS.2021.3136907
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning (DL) has made breakthroughs in many computer vision tasks and also in visual tracking. From the beginning of the research on the automatic acquisition of high abstract feature representation, DL has gone deep into all aspects of tracking to date, to name a few, similarity metric, data association, and bounding box estimation. Also, pure DL-based trackers have obtained the state-of-the-art performance after the community's constant research. We believe that it is time to comprehensively review the development of DL research in visual tracking. In this article, we overview the critical improvements brought to the field by DL: deep feature representations, network architecture, and four crucial issues in visual tracking (spatiotemporal information integration, target-specific classification, target information update, and bounding box estimation). The scope of the survey of DL-based tracking covers two primary subtasks for the first time, single-object tracking and multiple-object tracking. Also, we analyze the performance of DL-based approaches and give meaningful conclusions. Finally, we provide several promising directions and tasks in visual tracking and relevant fields.
引用
收藏
页码:5497 / 5516
页数:20
相关论文
共 50 条
  • [31] Visual Vehicle Tracking Based on Deep Representation and Semisupervised Learning
    Cai, Yingfeng
    Wang, Hai
    Sun, Xiao-qiang
    Chen, Long
    JOURNAL OF SENSORS, 2017, 2017
  • [32] Learning deep convolutional descriptor aggregation for efficient visual tracking
    Ke, Xiao
    Li, Yuezhou
    Guo, Wenzhong
    Huang, Yanyan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05): : 3745 - 3765
  • [33] A Robust Visual Tracking Method through Deep Learning Features
    Xu, Jia-zhen
    Zuo, Ming-zhang
    Yang, Lin
    Huang, Lei
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNIQUES AND APPLICATIONS, AITA 2016, 2016, : 159 - 164
  • [34] Method For Learning Deep Features For Correlation Based Visual Tracking
    Gundogdu, Erhan
    Alatan, A. Aydin
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [35] DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking
    Wu, Qiangqiang
    Yan, Yan
    Liang, Yanjie
    Liu, Yi
    Wang, Hanzi
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 119 - 134
  • [36] Deep learning of spatio-temporal information for visual tracking
    Choe, Gwangmin
    Son, Ilmyong
    Choe, Chunhwa
    So, Hyoson
    Kim, Hyokchol
    Choe, Gyongnam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 17283 - 17302
  • [37] Visual Object Tracking in Drone Images with Deep Reinforcement Learning
    Gozen, Derya
    Ozer, Sedat
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10082 - 10089
  • [38] Robust visual tracking based on scale invariance and deep learning
    Nan Ren
    Junping Du
    Suguo Zhu
    Linghui Li
    Dan Fan
    JangMyung Lee
    Frontiers of Computer Science, 2017, 11 : 230 - 242
  • [39] Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator
    Dunnhofer, Matteo
    Martinel, Niki
    Foresti, Gian Luca
    Micheloni, Christian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2290 - 2299
  • [40] A Review of Deep Learning-Based Visual Multi-Object Tracking Algorithms for Autonomous Driving
    Guo, Shuman
    Wang, Shichang
    Yang, Zhenzhong
    Wang, Lijun
    Zhang, Huawei
    Guo, Pengyan
    Gao, Yuguo
    Guo, Junkai
    APPLIED SCIENCES-BASEL, 2022, 12 (21):