End-to-end deep metric network for visual tracking

被引:0
|
作者
Shengjing Tian
Shuwei Shen
Guoqiang Tian
Xiuping Liu
Baocai Yin
机构
[1] Dalian University of Technology,School of Mathematical Sciences
[2] Dalian University of Technology,Faculty of Electronic Information and Electrical Engineering
来源
The Visual Computer | 2020年 / 36卷
关键词
Metric learning; Visual tracking; Deep neural networks; One-shot learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose an end-to-end deep metric network (DMN) for visual tracking, where any target can be accurately tracked given only a bounding box of the first frame. Our main motivation is to make the network learn to learn a deep distance metric by following the philosophy of one-shot learning. Instead of utilizing a hand-crafted distance metric like Euclidean distance, our DMN focuses on providing a learnable metric, which is more robust to appearance variations. Furthermore, we are the first to properly combine mean square errors and contrastive loss into a joint loss function for back-propagation. During online tracking, DMN firstly applies our instance initialization for obtaining sequence-specific information and then straightforwardly tracks the target without the help of box refinement, occlusion detection and online updating. The final tracking score considers both our DMN scalar output and the constrain of motion smoothness. Ablation analyses are carried out to validate the effectiveness of our proposed method. And experiments on the prevalent benchmarks show that our method can achieve a competitive performance when compared with some representative trackers, especially those existing metric learning-based algorithms.
引用
收藏
页码:1219 / 1232
页数:13
相关论文
共 50 条
  • [41] An End-to-End Deep Generative Network for Low Bitrate Image Coding
    Pei, Yifei
    Liu, Ying
    Ling, Nam
    Ren, Yongxiong
    Liu, Lingzhi
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [42] Low Rank Based End-to-End Deep Neural Network Compression
    Jain, Swayambhoo
    Hamidi-Rad, Shahab
    Racape, Fabien
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 233 - 242
  • [43] Absorption Attenuation Compensation Using an End-to-End Deep Neural Network
    Zhou, Chen
    Wang, Shoudong
    Wang, Zixu
    Cheng, Wanli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [44] An End-To-End Hyperbolic Deep Graph Convolutional Neural Network Framework
    Zhou, Yuchen
    Huo, Hongtao
    Hou, Zhiwen
    Bu, Lingbin
    Wang, Yifan
    Mao, Jingyi
    Lv, Xiaojun
    Bu, Fanliang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (01): : 537 - 563
  • [45] FOSNet: An End-to-End Trainable Deep Neural Network for Scene Recognition
    Seong, Hongje
    Hyun, Junhyuk
    Kim, Euntai
    IEEE ACCESS, 2020, 8 (08) : 82066 - 82077
  • [46] Deep Reinforcement Learning for End-to-End Network Slicing: Challenges and Solutions
    Liu, Qiang
    Choi, Nakjung
    Han, Tao
    IEEE NETWORK, 2023, 37 (02): : 222 - 228
  • [47] Local-CycleGAN: a general end-to-end network for visual enhancement in complex deep-water environment
    Zong, Xianhui
    Chen, Zhehan
    Wang, Dadong
    APPLIED INTELLIGENCE, 2021, 51 (04) : 1947 - 1958
  • [48] Local-CycleGAN: a general end-to-end network for visual enhancement in complex deep-water environment
    Xianhui Zong
    Zhehan Chen
    Dadong Wang
    Applied Intelligence, 2021, 51 : 1947 - 1958
  • [49] DeconNet: End-to-End Decontaminated Network for Vision-Based Aerial Tracking
    Zuo, Haobo
    Fu, Changhong
    Li, Sihang
    Ye, Junjie
    Zheng, Guangze
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [50] END-TO-END DEEP MULTIMODAL CLUSTERING
    Zhang, Xianchao
    Mu, Jie
    Zong, Linlin
    Yang, Xiaochun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,