LGTrack: Exploiting Local and Global Properties for Robust Visual Tracking

被引:2
|
作者
Liu, Chang [1 ,2 ]
Zhao, Jie [1 ]
Bo, Chunjuan [2 ,3 ]
Li, Shengming [4 ]
Wang, Dong [1 ,2 ]
Lu, Huchuan [1 ,2 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Ningbo Inst, Ningbo 315016, Peoples R China
[3] Dalian Minzu Univ, Sch Informat & Commun Engn, Dalian 116600, Peoples R China
[4] Dalian Univ Technol, Sch Innovat & Entrepreneurship, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Object tracking; visual tracking; long-term tracking; re-detection;
D O I
10.1109/TCSVT.2024.3390054
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Re-detection is a necessary capability for long-term tracking. Target candidate proposals in the whole image can provide a chance of tracking reset when tracking fails due to tracking drift or target invisibility. In this paper, we propose a unified local-global tracker based on the same transformer architecture sharing weights, which can not only search in a continuous local region but also provide target candidates of the global image in every frame. The requirements of both long-term and short-term scenarios can be addressed using a unified model. A simple proposal selection scheme is adopted to properly select the candidate proposals of re-detection, to assist tracking and obtain better performance. The scheme performs re-evaluation of all high-quality proposals based on a transformer-based embedding network, once the predicted state of the local tracking is not sufficient to be accurate. To capture appearance variations brought by online updates in minimum risks, a long-term-friendly dynamic template update scheme is also designed. Extensive experiments are conducted to demonstrate the effectiveness of our proposed tracker, including three short-term tracking benchmarks and six long-term benchmarks. Our tracker can achieve results comparable to that of the state-of-the-art. The proposed tracker can also work well in balancing the performance and speed, achieving an average speed of approximately 25 fps tested on LaSOT testing set.
引用
收藏
页码:8161 / 8171
页数:11
相关论文
共 50 条
  • [41] Exploiting structural constraints for visual object tracking
    Bouachir, Wassim
    Bilodeau, Guillaume-Alexandre
    IMAGE AND VISION COMPUTING, 2015, 43 : 39 - 49
  • [42] Exploiting Competition Relationship for Robust Visual Recognition
    Du, Liang
    Ling, Haibin
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2746 - 2752
  • [43] Pose Tracking by Efficiently Exploiting Global Features
    Kumar, Ratnesh
    Batra, Dhruv
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [44] Regressing Local to Global Shape Properties for Online Segmentation and Tracking
    Ren, Carl Yuheng
    Prisacariu, Victor Adrian
    Reid, Ian
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [45] Regressing Local to Global Shape Properties for Online Segmentation and Tracking
    Carl Yuheng Ren
    Victor Prisacariu
    Ian Reid
    International Journal of Computer Vision, 2014, 106 : 269 - 281
  • [46] Regressing Local to Global Shape Properties for Online Segmentation and Tracking
    Ren, Carl Yuheng
    Prisacariu, Victor
    Reid, Ian
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 106 (03) : 269 - 281
  • [47] Leveraging Local and Global Cues for Visual Tracking via Parallel Interaction Network
    Zheng, Yaozong
    Zhong, Bineng
    Liang, Qihua
    Tang, Zhenjun
    Ji, Rongrong
    Li, Xianxian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1671 - 1683
  • [48] Exploiting local STFT properties
    Gaston, KL
    Nelson, DJ
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 97 - 100
  • [49] Online Multi-Scale Classification and Global Feature Modulation for Robust Visual Tracking
    Gao, Qi
    Yin, Mingfeng
    Wu, Xiang
    Liu, Di
    Bo, Yuming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5321 - 5334
  • [50] Robust Visual Tracking Using Local Salient Coding and PCA Sub space Modeling
    Lin, Dajun
    Zheng, Huicheng
    Ma, Donghong
    PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS'13), 2013, : 25 - 30