Asynchronous Spatio-Temporal Memory Network for Continuous Event-Based Object Detection

被引:46
|
作者
Li, Jianing [1 ,2 ]
Li, Jia [2 ,3 ]
Zhu, Lin [1 ,2 ]
Xiang, Xijie [2 ,4 ]
Huang, Tiejun [1 ,2 ]
Tian, Yonghong [1 ,2 ]
机构
[1] Peking Univ, Sch Comp Sci, Natl Engn Res Ctr Visual Technol, Beijing 100871, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[4] Shanghai Jiao Tong Univ, Dept Elect & Comp Engn, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Cameras; Detectors; Task analysis; Streaming media; Recurrent neural networks; Meters; event cameras; event-based vision; deep neural networks; neuromorphic engineering; NEURAL-NETWORKS; VISION; PREDICTION;
D O I
10.1109/TIP.2022.3162962
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event cameras, offering extremely high temporal resolution and high dynamic range, have brought a new perspective to addressing common object detection challenges (e.g., motion blur and low light). However, how to learn a better spatio-temporal representation and exploit rich temporal cues from asynchronous events for object detection still remains an open issue. To address this problem, we propose a novel asynchronous spatio-temporal memory network (ASTMNet) that directly consumes asynchronous events instead of event images prior to processing, which can well detect objects in a continuous manner. Technically, ASTMNet learns an asynchronous attention embedding from the continuous event stream by adopting an adaptive temporal sampling strategy and a temporal attention convolutional module. Besides, a spatio-temporal memory module is designed to exploit rich temporal cues via a lightweight yet efficient inter-weaved recurrent-convolutional architecture. Empirically, it shows that our approach outperforms the state-of-the-art methods using the feed-forward frame-based detectors on three datasets by a large margin (i.e., 7.6% in the KITTI Simulated Dataset, 10.8% in the Gen1 Automotive Dataset, and 10.5% in the 1Mpx Detection Dataset). The results demonstrate that event cameras can perform robust object detection even in cases where conventional cameras fail, e.g., fast motion and challenging light conditions.
引用
收藏
页码:2975 / 2987
页数:13
相关论文
共 50 条
  • [1] 10.E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System
    Zhang, Shixiong
    Wang, Wenmin
    Li, Honglei
    Zhang, Shenyong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (02)
  • [2] Event-based incremental updating of spatio-temporal database
    周晓光
    陈军
    蒋捷
    朱建军
    李志林
    Journal of Central South University of Technology(English Edition), 2004, (02) : 192 - 198
  • [3] Event-based incremental updating of spatio-temporal database
    Zhao, XG
    Chen, J
    Jiang, J
    Zhu, JJ
    Li, ZL
    JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2004, 11 (02): : 192 - 198
  • [4] Event-based incremental updating of spatio-temporal database
    Zhou X.-G.
    Chen J.
    Jiang J.
    Zhu J.-J.
    Li Z.-L.
    Journal of Central South University of Technology, 2004, 11 (2): : 192 - 198
  • [5] An Event-based Categorization Model Using Spatio-temporal Features in a Spiking Neural Network
    Lu, Junwei
    Dong, Junfei
    Yan, Rui
    Tang, Huajin
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 385 - 390
  • [6] DSTF: Dual-Stream Spatio-Temporal Fusion Network for Event-Based Data
    Gu, Xusheng
    Qiu, Changjie
    Lin, Xiuhong
    Yang, Xinjie
    Zang, Yu
    Wang, Cheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 111 - 125
  • [7] CSTR: A Compact Spatio-Temporal Representation for Event-Based Vision
    El Shair, Zaid A.
    Hassani, Ali
    Rawashdeh, Samir A.
    IEEE ACCESS, 2023, 11 : 102899 - 102916
  • [8] Event-Based Motion Segmentation With Spatio-Temporal Graph Cuts
    Zhou, Yi
    Gallego, Guillermo
    Lu, Xiuyuan
    Liu, Siqi
    Shen, Shaojie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4868 - 4880
  • [9] Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation
    Wang, Dongsheng
    Jia, Xu
    Zhang, Yang
    Zhang, Xinyu
    Wang, Yaoyuan
    Zhang, Ziyang
    Wang, Dong
    Lu, Huchuan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2492 - 2500
  • [10] Impact of spiking neurons leakages and network recurrences on event-based spatio-temporal pattern recognition
    Bouanane, Mohamed Sadek
    Cherifi, Dalila
    Chicca, Elisabetta
    Khacef, Lyes
    FRONTIERS IN NEUROSCIENCE, 2023, 17