Learning Localization-Aware Target Confidence for Siamese Visual Tracking

被引:20
|
作者
Nie, Jiahao [1 ]
He, Zhiwei [1 ]
Yang, Yuxiang [2 ]
Gao, Mingyu [1 ]
Dong, Zhekang [3 ]
机构
[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Peoples R China
[2] Univ Sci & Technol China, Sch Control Sci & Engn, Hefei 230052, Peoples R China
[3] Zhejiang Univ, Sch Elect Engn, Hangzhou 310058, Peoples R China
基金
中国国家自然科学基金;
关键词
Target tracking; Task analysis; Feature extraction; Training; Location awareness; Visualization; Smoothing methods; Localization-aware components; Siamese tracking paradigm; task misalignment; OBJECT TRACKING;
D O I
10.1109/TMM.2022.3206668
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Siamese tracking paradigm has achieved great success, providing effective appearance discrimination and size estimation by classification and regression. While such a paradigm typically optimizes the classification and regression independently, leading to task misalignment (accurate prediction boxes have no high target confidence scores). In this paper, to alleviate this misalignment, we propose a novel tracking paradigm, called SiamLA. Within this paradigm, a series of simple, yet effective localization-aware components are introduced to generate localization-aware target confidence scores. Specifically, with the proposed localization-aware dynamic label (LADL) loss and localization-aware label smoothing (LALS) strategy, collaborative optimization between the classification and regression is achieved, enabling classification scores to be aware of location state, not just appearance similarity. Besides, we propose a separate localization-aware quality prediction (LAQP) branch to produce location quality scores to further modify the classification scores. To guide a more reliable modification, a novel localization-aware feature aggregation (LAFA) module is designed and embedded into this branch. Consequently, the resulting target confidence scores are more discriminative for the location state, allowing accurate prediction boxes tend to be predicted as high scores. Extensive experiments are conducted on six challenging benchmarks, including GOT10 k, TrackingNet, LaSOT, TNL2K, OTB100 and VOT2018. Our SiamLA achieves competitive performance in terms of both accuracy and efficiency. Furthermore, a stability analysis reveals that our tracking paradigm is relatively stable, implying that the paradigm is potential for real-world applications.
引用
收藏
页码:6194 / 6206
页数:13
相关论文
共 50 条
  • [41] SiamCross: Siamese Cross Object-Aware Networks for Visual Object Tracking
    Huang W.-H.
    Feng Y.
    Qiang B.-H.
    Pei Y.-X.
    Luo Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (10): : 2151 - 2166
  • [42] A location-aware siamese network for high-speed visual tracking
    Zhou, Lifang
    Ding, Xiang
    Li, Weisheng
    Leng, Jiaxu
    Lei, Bangjun
    Yang, Weibin
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4431 - 4447
  • [43] Robust adaptive learning with Siamese network architecture for visual tracking
    Wancheng Zhang
    Yongzhao Du
    Zhi Chen
    Jianhua Deng
    Peizhong Liu
    The Visual Computer, 2021, 37 : 881 - 894
  • [44] Relation-aware Siamese region proposal network for visual object tracking
    Jiaming Zhu
    Guopeng Zhang
    Shibin Zhou
    Kun Li
    Multimedia Tools and Applications, 2021, 80 : 15469 - 15485
  • [45] Relation-aware Siamese region proposal network for visual object tracking
    Zhu, Jiaming
    Zhang, Guopeng
    Zhou, Shibin
    Li, Kun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15469 - 15485
  • [46] An IoU-aware Siamese network for real-time visual tracking
    Wei, Bingbing
    Chen, Hongyu
    Cao, Siqi
    Ding, Qinghai
    Luo, Haibo
    NEUROCOMPUTING, 2023, 527 : 13 - 26
  • [47] Siamada: visual tracking based on Siamese adaptive learning network
    Xin Lu
    Fusheng Li
    Wanqi Yang
    Neural Computing and Applications, 2024, 36 : 7639 - 7656
  • [48] Robust adaptive learning with Siamese network architecture for visual tracking
    Zhang, Wancheng
    Du, Yongzhao
    Chen, Zhi
    Deng, Jianhua
    Liu, Peizhong
    VISUAL COMPUTER, 2021, 37 (05): : 881 - 894
  • [49] Siamada: visual tracking based on Siamese adaptive learning network
    Lu, Xin
    Li, Fusheng
    Yang, Wanqi
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (14): : 7639 - 7656
  • [50] LEARNING CASCADED SIAMESE NETWORKS FOR HIGH PERFORMANCE VISUAL TRACKING
    Gao, Peng
    Ma, Yipeng
    Yuan, Ruyue
    Xiao, Liyi
    Wang, Fei
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3078 - 3082