Robust Visual Tracking With a Novel Segmented Fine-grained Regularization

被引:0
|
作者
An Z.-Y. [1 ]
Liang S.-K. [1 ]
Li B. [1 ]
Zhao F. [1 ]
Dou Q.-S. [1 ]
Xiang Z.-L. [1 ]
机构
[1] School of Computer Science and Technology, Shandong Technology and Business University, Yantai
来源
基金
中国国家自然科学基金;
关键词
fine-grained regularization; group lasso; Siamese network; Visual tracking;
D O I
10.16383/j.aas.c220544
中图分类号
学科分类号
摘要
Most of the Siamese network tracking algorithms use L2 regularization in the training stage, while ignoring the hierarchy and characteristic of the network architecture. As a result, such trackers have poor robustness. With this insight, we propose a segmented fine-grained regularization tracking (SFGRT) algorithm, which divides the regularization of Siamese network into three fine-grained levels, namely filter level, channel level and shape level. Then we creatively build a segmented fine-grained regularization model that constructs penalty functions based on group lasso, which combines with different levels of granularity to improve generalization ability and robustness. In addition, aiming at the imbalance of gradient magnitude of each penalty function, our approach constructs a gradient self-balancing optimization function to adaptively optimize the coefficients of each penalty function. Finally, ablation study on VOT2019 show that compared with the baseline algorithm SiamRPN++, our approach achieves relative gains of 7.1% and 1.7% in terms of robustness and expected average overlap (EAO) metrics, respectively. It means that the robustness of our tracker is significantly enhanced over baseline tracker since the smaller the robustness metrics, the better. Extensive experiments based on VOT2018, VOT2019, UAV123 and LaSOT show that the proposed algorithm has better robustness and tracking performance than related state-of-the-art methods. © 2023 Science Press. All rights reserved.
引用
收藏
页码:1116 / 1130
页数:14
相关论文
共 63 条
  • [1] Xing D T, Evangeliou N, Tsoukalas A, Tzes A., Siamese transformer pyramid networks for real-time UAV tracking, Proceedings of IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 1898-1907, (2022)
  • [2] Fang L P, Liang N X, Kang W X, Wang Z Y, Feng D D., Real-time hand posture recognition using hand geometric features and fisher vector, Signal Processing: Image Communication, 82, (2020)
  • [3] Ballester I, Fontan A, Civera J, Strobl K H, Triebel R., DOT: Dynamic object tracking for visual SLAM, Proceedings of IEEE International Conference on Robotics and Automation (ICRA), pp. 11705-11711, (2021)
  • [4] Wu Y, Lim J, Yang M H., Object tracking benchmark, IEEE Transactions on Pattern Analysis and Machine Intelligence, 37, 9, pp. 1834-1848, (2015)
  • [5] Tang M, Yu B, Zhang F, Wang J Q., High-speed tracking with multi-kernel correlation filters, Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4874-4883, (2018)
  • [6] Sun Y X, Sun C, Wang D, He Y, Lu H C., ROI pooled correlation filters for visual tracking, Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5783-5791, (2019)
  • [7] Qiu Zhu-Ling, Zha Yu-Fei, Wu Min, Wang Qing, Learning attentional regularized correlation filter for visual tracking, Acta Electronica Sinica, 48, 9, pp. 1762-1768, (2020)
  • [8] Zhu Jian-Zhang, Wang Dong, Lu Hu-Chuan, Learning temporal-spatial consistency correlation filter for visual tracking, Scientia Sinica Informationis, 50, 1, pp. 128-150, (2020)
  • [9] Hu H W, Ma B, Shen J B, Shao L., Manifold regularized correlation object tracking, IEEE Transactions on Neural Networks and Learning Systems, 29, 5, pp. 1786-1795, (2018)
  • [10] Xu T Y, Feng Z H, Wu X J, Kittler J., Joint group feature selection and discriminative filter learning for robust visual object tracking, Proceedings of IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7949-7959, (2019)