E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

被引:0
|
作者
Lin, Xiuhong [1 ,2 ]
Qiu, Changjie [1 ,2 ]
Cai, Zhipeng [3 ]
Shen, Siqi [1 ,2 ]
Zang, Yu [1 ,2 ]
Liu, Weiquan [1 ,2 ]
Bian, Xuesheng [1 ,5 ]
Mueller, Matthias [4 ]
Wang, Cheng [1 ,2 ]
机构
[1] Xiamen Univ XMU, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[2] XMU, Key Lab Multimedia Trusted Percept & Efficient Co, Xiamen, Peoples R China
[3] Intel Labs, Hillsboro, OR USA
[4] Apple Inc, Cupertino, CA USA
[5] Yancheng Inst Technol, Yancheng, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
VISION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event cameras have emerged as a promising vision sensor in recent years due to their unparalleled temporal resolution and dynamic range. While registration of 2D RGB images to 3D point clouds is a long-standing problem in computer vision, no prior work studies 2D-3D registration for event cameras. To this end, we propose E2PNet, the first learning-based method for event-to-point cloud registration. The core of E2PNet is a novel feature representation network called Event-Points-to-Tensor (EP2T), which encodes event data into a 2D grid-shaped feature tensor. This grid-shaped feature enables matured RGB-based frameworks to be easily used for event-to-point cloud registration, without changing hyper-parameters and the training procedure. EP2T treats the event input as spatio-temporal point clouds. Unlike standard 3D learning architectures that treat all dimensions of point clouds equally, the novel sampling and information aggregation modules in EP2T are designed to handle the inhomogeneity of the spatial and temporal dimensions. Experiments on the MVSEC and VECtor datasets demonstrate the superiority of E2PNet over hand-crafted and other learning-based methods. Compared to RGB-based registration, E2PNet is more robust to extreme illumination or fast motion due to the use of event data. Beyond 2D-3D registration, we also show the potential of EP2T for other vision tasks such as flow estimation, event-to-image reconstruction and object recognition. The source code can be found at: E2PNet.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] CSTR: A Compact Spatio-Temporal Representation for Event-Based Vision
    El Shair, Zaid A.
    Hassani, Ali
    Rawashdeh, Samir A.
    IEEE ACCESS, 2023, 11 : 102899 - 102916
  • [22] Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds
    Huang, Siyuan
    Degrees, Yichen Xie
    Zhu, Song-Chun
    Zhu, Yixin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6515 - 6525
  • [23] STTraj2Vec: A spatio-temporal trajectory representation learning approach
    Zhu, Jiahui
    Niu, Xinzheng
    Li, Fan
    Wang, Yixuan
    Fournier-Viger, Philippe
    She, Kun
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [24] Multi-Task Learning for Spatio-Temporal Event Forecasting
    Zhao, Liang
    Sun, Qian
    Ye, Jieping
    Chen, Feng
    Lu, Chang-Tien
    Ramakrishnan, Naren
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1503 - 1512
  • [25] Generative Causal Interpretation Model for Spatio-Temporal Representation Learning
    Zhao, Yu
    Deng, Pan
    Liu, Junting
    Jia, Xiaofeng
    Zhang, Jianwei
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3537 - 3548
  • [26] Learning Spatio-temporal Representation by Channel Aliasing Video Perception
    Lin, Yiqi
    Wang, Jinpeng
    Zhang, Manlin
    Ma, Andy J.
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2317 - 2325
  • [27] Spatio-Temporal Graph Representation Learning for Fraudster Group Detection
    Shehnepoor, Saeedreza
    Togneri, Roberto
    Liu, Wei
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6628 - 6642
  • [28] Similar Trajectory Search with Spatio-Temporal Deep Representation Learning
    Tedjopurnomo, David Alexander
    Li, Xiucheng
    Bao, Zhifeng
    Cong, Gao
    Choudhury, Farhana
    Qin, A. K.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [29] Dynamic Graph Representation Learning for Spatio-Temporal Neuroimaging Analysis
    Liu, Rui
    Hu, Yao
    Wu, Jibin
    Wong, Ka-Chun
    Huang, Zhi-An
    Huang, Yu-An
    Chen Tan, Kay
    IEEE TRANSACTIONS ON CYBERNETICS, 2025, 55 (03) : 1121 - 1134
  • [30] Spatio-Temporal Point Processes With Attention for Traffic Congestion Event Modeling
    Zhu, Shixiang
    Ding, Ruyi
    Zhang, Minghe
    Van Hentenryck, Pascal
    Xie, Yao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7298 - 7309