E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

被引:0
|
作者
Lin, Xiuhong [1 ,2 ]
Qiu, Changjie [1 ,2 ]
Cai, Zhipeng [3 ]
Shen, Siqi [1 ,2 ]
Zang, Yu [1 ,2 ]
Liu, Weiquan [1 ,2 ]
Bian, Xuesheng [1 ,5 ]
Mueller, Matthias [4 ]
Wang, Cheng [1 ,2 ]
机构
[1] Xiamen Univ XMU, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China
[2] XMU, Key Lab Multimedia Trusted Percept & Efficient Co, Xiamen, Peoples R China
[3] Intel Labs, Hillsboro, OR USA
[4] Apple Inc, Cupertino, CA USA
[5] Yancheng Inst Technol, Yancheng, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
VISION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event cameras have emerged as a promising vision sensor in recent years due to their unparalleled temporal resolution and dynamic range. While registration of 2D RGB images to 3D point clouds is a long-standing problem in computer vision, no prior work studies 2D-3D registration for event cameras. To this end, we propose E2PNet, the first learning-based method for event-to-point cloud registration. The core of E2PNet is a novel feature representation network called Event-Points-to-Tensor (EP2T), which encodes event data into a 2D grid-shaped feature tensor. This grid-shaped feature enables matured RGB-based frameworks to be easily used for event-to-point cloud registration, without changing hyper-parameters and the training procedure. EP2T treats the event input as spatio-temporal point clouds. Unlike standard 3D learning architectures that treat all dimensions of point clouds equally, the novel sampling and information aggregation modules in EP2T are designed to handle the inhomogeneity of the spatial and temporal dimensions. Experiments on the MVSEC and VECtor datasets demonstrate the superiority of E2PNet over hand-crafted and other learning-based methods. Compared to RGB-based registration, E2PNet is more robust to extreme illumination or fast motion due to the use of event data. Beyond 2D-3D registration, we also show the potential of EP2T for other vision tasks such as flow estimation, event-to-image reconstruction and object recognition. The source code can be found at: E2PNet.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Spatio-Temporal Fusion: A Fusion Approach for Point Cloud Sparsity Problem
    Zhao, Chongjun
    Xu, Haoran
    Xu, Hua
    Lai, Kexue
    Cen, Ming
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4836 - 4841
  • [32] Real-Time Spatio-Temporal LiDAR Point Cloud Compression
    Feng, Yu
    Liu, Shaoshan
    Zhu, Yuhao
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10766 - 10773
  • [33] Fast Motion State Estimation Based on Point Cloud by Combing Deep Learning and Spatio-Temporal Constraints
    Wu, Sidong
    Ren, Liuquan
    Zhu, Enzhi
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [34] Video2Vec: Learning Semantic Spatio-Temporal Embeddings for Video Representation
    Hu, Sheng-Hung
    Li, Yikang
    Li, Baoxin
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 811 - 816
  • [35] Spatio-Temporal EEG Representation Learning on Riemannian Manifold and Euclidean Space
    Zhang, Guangyi
    Etemad, Ali
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1469 - 1483
  • [36] Spatio-Temporal Representation Learning with Social Tie for Personalized POI Recommendation
    Dai, Shaojie
    Yu, Yanwei
    Fan, Hao
    Dong, Junyu
    DATA SCIENCE AND ENGINEERING, 2022, 7 (01) : 44 - 56
  • [37] Machine Learning Based Representative Spatio-Temporal Event Documents Classification
    Kim, Byoungwook
    Yang, Yeongwook
    Park, Ji Su
    Jang, Hong-Jun
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [38] Hierarchical Representation Learning based spatio-temporal data redundancy reduction
    Wang, Min
    Yang, Shuyuan
    Wu, Bin
    NEUROCOMPUTING, 2016, 173 : 298 - 305
  • [39] Spatio-Temporal Representation Learning with Social Tie for Personalized POI Recommendation
    Shaojie Dai
    Yanwei Yu
    Hao Fan
    Junyu Dong
    Data Science and Engineering, 2022, 7 : 44 - 56
  • [40] Urban mobility structure detection via spatio-temporal representation learning
    Duan, Xiaoqi
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (08):