Frequency and Spatial Domain Filter Network for Visual Object Tracking

被引:0
|
作者
Zhao, Manqi [1 ,2 ,3 ]
Li, Shenyang [1 ,2 ,3 ]
Wang, Han [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[2] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Beijing 100094, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Visual tracking; Frequency filter; Global context;
D O I
10.1007/978-981-99-8537-1_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-correlation serves as the core similarity calculation operation in Siamese-based trackers, and generally produces response maps with high values at the target center. During this process, global context, including boundary and surrounding background of the target, which is conducive to target localization and bounding box regression, has been overlooked. In this work, we propose a Frequency and Spatial domain Filter Network (FSFNet) for visual object tracking, which exploits abundant global context in the frequency domain and enhances target representation in the spatial domain. First, frequency filters generated from template and search patches are applied to the target, capturing and enhancing valuable frequency components. These enhanced frequency components describe the global regions of interest in the spatial domain. Second, spatial domain convolutions are adopted to highlight local details of the target. Compared with mechanisms including depth-wise correlation, pixel-wise correlation, and transformer, our method provides more accurate tracking results. Experiments on five benchmarks show that our tracker obtains competitive results. For example, our tracker achieves an AUC score of 81.2% on TrackingNet, outperforming the state-of-the-art two-stream tracker TrDiMP by 2.8% while running at 50 FPS.
引用
收藏
页码:108 / 120
页数:13
相关论文
共 50 条
  • [31] Scalable implementation of particle filter-based visual object tracking on network-on-chip (NoC)
    Pinalkumar Engineer
    Rajbabu Velmurugan
    Sachin Patkar
    [J]. Journal of Real-Time Image Processing, 2020, 17 : 1117 - 1134
  • [32] Scalable implementation of particle filter-based visual object tracking on network-on-chip (NoC)
    Engineer, Pinalkumar
    Velmurugan, Rajbabu
    Patkar, Sachin
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (05) : 1117 - 1134
  • [33] Transformer Union Convolution Network for visual object tracking
    Song, Zhehan
    Chen, Yiming
    Luo, Peng
    Feng, Huajun
    Xu, Zhihai
    Li, Qi
    [J]. OPTICS COMMUNICATIONS, 2022, 524
  • [34] Visual Object Tracking via Deep Neural Network
    Xu, Tianyang
    Wu, Xiaojun
    [J]. 2015 IEEE FIRST INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2015,
  • [35] Object-Adaptive LSTM Network for Visual Tracking
    Du, Yihan
    Yan, Yan
    Chen, Si
    Hua, Yang
    Wang, Hanzi
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1719 - 1724
  • [36] SiamMN: Siamese modulation network for visual object tracking
    Li-hua Fu
    Yu Ding
    Yu-bin Du
    Bo Zhang
    Lu-yuan Wang
    Dan Wang
    [J]. Multimedia Tools and Applications, 2020, 79 : 32623 - 32641
  • [37] SiamMN: Siamese modulation network for visual object tracking
    Fu, Li-hua
    Ding, Yu
    Du, Yu-bin
    Zhang, Bo
    Wang, Lu-yuan
    Wang, Dan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32623 - 32641
  • [38] Visual object tracking with adaptive structural convolutional network
    Yuan, Di
    Li, Xin
    He, Zhenyu
    Liu, Qiao
    Lu, Shuwei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [39] Visual Object Tracking by Hierarchical Attention Siamese Network
    Shen, Jianbing
    Tang, Xin
    Dong, Xingping
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3068 - 3080
  • [40] Learning Dynamic Siamese Network for Visual Object Tracking
    Guo, Qing
    Feng, Wei
    Zhou, Ce
    Huang, Rui
    Wan, Liang
    Wang, Song
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1781 - 1789