Spiking Transformers for Event-based Single Object Tracking

被引:44
|
作者
Zhang, Jiqing [1 ]
Dong, Bo [2 ]
Zhang, Haiwei [1 ]
Ding, Jianchuan [1 ]
Heide, Felix [2 ]
Yin, Baocai [1 ]
Yang, Xin [1 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Princeton Univ, Princeton, NJ 08544 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.00860
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event-based cameras bring a unique capability to tracking, being able to fiarction in challenging real-world conditions as a direct result of their high temporal resolution and high dynamic range. These imagers capture events asynchronously that encode rich temporal and spatial information. However, effectively extracting this information from events remains an open challenge. In this work, we propose a spiking transformer network, STNet, for single object tracking. STNet dynamically extracts and fuses information from both temporal and spatial domains. In particular, the proposed architecture features a transformer module to provide global spatial information and a spiking neural network (SNN) module for extracting temporal cues. The spiking threshold of the SNN module is dynamically adjusted based on the statistical cues of the spatial information, which we find essential in providing robust SNN features. We fuse both feature branches dynamically with a novel cross-domain attention fusion algorithm. Extensive experiments on three event-based datasets, FE240hz, EED and VisEvent validate that the proposed STNet outperforms existing state-of-the-art methods in both tracking accuracy and speed with a significant margin. The code and pretrained models are at https: //github.com/Jee-King/CVPR2022_STNet.
引用
收藏
页码:8791 / 8800
页数:10
相关论文
共 50 条
  • [21] A Universal Event-Based Plug-In Module for Visual Object Tracking in Degraded Conditions
    Zhang, Jiqing
    Dong, Bo
    Fu, Yingkai
    Wang, Yuanchen
    Wei, Xiaopeng
    Yin, Baocai
    Yang, Xin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (05) : 1857 - 1879
  • [22] A Reconfigurable Architecture for Real-time Event-based Multi-Object Tracking
    Gao, Yizhao
    Wang, Song
    So, Hayden Kwok-Hay
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (04)
  • [23] Event-based multimedia object scheduling algorithm
    Yun, MH
    Kim, SJ
    Kim, HN
    [J]. 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS: BROADBAND CONVERGENCE NETWORK INFRASTRUCTURE, 2004, : 735 - 740
  • [24] Maritime Object Detection with Event-Based Cameras
    Sharghi, Elan
    Rodriguez, Jacob
    Mauger, Justin
    Jaszewski, Martin
    Parameswaran, Shibin
    [J]. 2022 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP, AIPR, 2022,
  • [25] Event-Based Trajectory Prediction Using Spiking Neural Networks
    Debat, Guillaume
    Chauhan, Tushar
    Cottereau, Benoit R.
    Masquelier, Timothee
    Paindavoine, Michel
    Baures, Robin
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2021, 15
  • [26] An Event-based Hierarchy Model for Object Recognition
    Nan, Ying
    Xiao, Rong
    Gao, Shaobing
    Yan, Rui
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2342 - 2347
  • [27] Polar Loss for Event-Based Object Detection
    Xu, Huachi
    Shi, Dianxi
    Jing, Luoxi
    Liu, Cong
    [J]. 2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 891 - 895
  • [28] EVENT-BASED DEBUGGING OF OBJECT ACTION PROGRAMS
    LIN, CC
    LEBLANC, RJ
    [J]. SIGPLAN NOTICES, 1989, 24 (01): : 23 - 34
  • [29] EVENT-BASED MULTIMODAL SPIKING NEURAL NETWORK WITH ATTENTION MECHANISM
    Liu, Qianhui
    Xing, Dong
    Feng, Lang
    Tang, Huajin
    Pan, Gang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8922 - 8926
  • [30] A Markovian event-based framework for stochastic spiking neural networks
    Touboul, Jonathan D.
    Faugeras, Olivier D.
    [J]. JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2011, 31 (03) : 485 - 507