Frequency and Spatial Domain Filter Network for Visual Object Tracking

被引:0
|
作者
Zhao, Manqi [1 ,2 ,3 ]
Li, Shenyang [1 ,2 ,3 ]
Wang, Han [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Visual tracking; Frequency filter; Global context;
D O I
10.1007/978-981-99-8537-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-correlation serves as the core similarity calculation operation in Siamese-based trackers, and generally produces response maps with high values at the target center. During this process, global context, including boundary and surrounding background of the target, which is conducive to target localization and bounding box regression, has been overlooked. In this work, we propose a Frequency and Spatial domain Filter Network (FSFNet) for visual object tracking, which exploits abundant global context in the frequency domain and enhances target representation in the spatial domain. First, frequency filters generated from template and search patches are applied to the target, capturing and enhancing valuable frequency components. These enhanced frequency components describe the global regions of interest in the spatial domain. Second, spatial domain convolutions are adopted to highlight local details of the target. Compared with mechanisms including depth-wise correlation, pixel-wise correlation, and transformer, our method provides more accurate tracking results. Experiments on five benchmarks show that our tracker obtains competitive results. For example, our tracker achieves an AUC score of 81.2% on TrackingNet, outperforming the state-of-the-art two-stream tracker TrDiMP by 2.8% while running at 50 FPS.
引用
收藏
页码:108 / 120
页数:13
相关论文
共 50 条
  • [1] CFNN: Correlation Filter Neural Network for Visual Object Tracking
    Li, Yang
    Xu, Zhan
    Zhu, Jianke
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2222 - 2229
  • [2] Learning Spatial-Frequency Transformer for Visual Object Tracking
    Tang, Chuanming
    Wang, Xiao
    Bai, Yuanchao
    Wu, Zhe
    Zhang, Jianlin
    Huang, Yongmei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5102 - 5116
  • [3] Deep Spatial and Temporal Network for Robust Visual Object Tracking
    Teng, Zhu
    Xing, Junliang
    Wang, Qiang
    Zhang, Baopeng
    Fan, Jianping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1762 - 1775
  • [4] Small dim object tracking using frequency and spatial domain information
    Ahmadi, Kaveh
    Salari, Ezzatollah
    [J]. PATTERN RECOGNITION, 2016, 58 : 227 - 234
  • [5] DomainSiam: Domain-Aware Siamese Network for Visual Object Tracking
    Abdelpakey, Mohamed H.
    Shehata, Mohamed S.
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 45 - 58
  • [6] Visual Object Tracking Using Particle Filter
    Hossain, Kabir
    Lee, Chi-Woo
    [J]. 2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 98 - 102
  • [7] Chaotic particle filter for visual object tracking
    Firouznia, Marjan
    Faez, Karim
    Amindavar, Hamidreza
    Koupaei, Javad Alikhani
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 53 : 1 - 12
  • [8] Channel and spatial attention-based Siamese network for visual object tracking
    Tian, Shishun
    Chen, Zixi
    Chen, Bolin
    Zou, Wenbin
    Li, Xia
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [9] Object Relocation Visual Tracking Based on Histogram Filter and Siamese Network in Intelligent Transportation
    Zhang, Jianlong
    Liu, Yifan
    Li, Qiao
    He, Ci
    Wang, Bin
    Wang, Tianhong
    [J]. SENSORS, 2022, 22 (22)
  • [10] OPTIMIZED SPATIAL MATCHING FOR VISUAL OBJECT TRACKING
    Wang, Fuxiang
    Mei, Qing
    Liu, Xuhui
    Xiao, Yao
    [J]. 2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,