E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

被引：0

作者：

Lin, Xiuhong ^{[1
,2
]}

Qiu, Changjie ^{[1
,2
]}

Cai, Zhipeng ^{[3
]}

Shen, Siqi ^{[1
,2
]}

Zang, Yu ^{[1
,2
]}

Liu, Weiquan ^{[1
,2
]}

Bian, Xuesheng ^{[1
,5
]}

Mueller, Matthias ^{[4
]}

Wang, Cheng ^{[1
,2
]}

机构：

[1] Xiamen Univ XMU, Sch Informat, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen, Peoples R China

[2] XMU, Key Lab Multimedia Trusted Percept & Efficient Co, Xiamen, Peoples R China

[3] Intel Labs, Hillsboro, OR USA

[4] Apple Inc, Cupertino, CA USA

[5] Yancheng Inst Technol, Yancheng, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

VISION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Event cameras have emerged as a promising vision sensor in recent years due to their unparalleled temporal resolution and dynamic range. While registration of 2D RGB images to 3D point clouds is a long-standing problem in computer vision, no prior work studies 2D-3D registration for event cameras. To this end, we propose E2PNet, the first learning-based method for event-to-point cloud registration. The core of E2PNet is a novel feature representation network called Event-Points-to-Tensor (EP2T), which encodes event data into a 2D grid-shaped feature tensor. This grid-shaped feature enables matured RGB-based frameworks to be easily used for event-to-point cloud registration, without changing hyper-parameters and the training procedure. EP2T treats the event input as spatio-temporal point clouds. Unlike standard 3D learning architectures that treat all dimensions of point clouds equally, the novel sampling and information aggregation modules in EP2T are designed to handle the inhomogeneity of the spatial and temporal dimensions. Experiments on the MVSEC and VECtor datasets demonstrate the superiority of E2PNet over hand-crafted and other learning-based methods. Compared to RGB-based registration, E2PNet is more robust to extreme illumination or fast motion due to the use of event data. Beyond 2D-3D registration, we also show the potential of EP2T for other vision tasks such as flow estimation, event-to-image reconstruction and object recognition. The source code can be found at: E2PNet.

引用

页数：14

共 50 条

[31] Spatio-Temporal Fusion: A Fusion Approach for Point Cloud Sparsity Problem
Zhao, Chongjun
Xu, Haoran
Xu, Hua
Lai, Kexue
Cen, Ming
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4836 - 4841
[32] Real-Time Spatio-Temporal LiDAR Point Cloud Compression
Feng, Yu
Liu, Shaoshan
Zhu, Yuhao
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10766 - 10773
[33] Fast Motion State Estimation Based on Point Cloud by Combing Deep Learning and Spatio-Temporal Constraints
Wu, Sidong
Ren, Liuquan
Zhu, Enzhi
APPLIED SCIENCES-BASEL, 2024, 14 (19):
[34] Video2Vec: Learning Semantic Spatio-Temporal Embeddings for Video Representation
Hu, Sheng-Hung
Li, Yikang
Li, Baoxin
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 811 - 816
[35] Spatio-Temporal EEG Representation Learning on Riemannian Manifold and Euclidean Space
Zhang, Guangyi
Etemad, Ali
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1469 - 1483
[36] Spatio-Temporal Representation Learning with Social Tie for Personalized POI Recommendation
Dai, Shaojie
Yu, Yanwei
Fan, Hao
Dong, Junyu
DATA SCIENCE AND ENGINEERING, 2022, 7 (01) : 44 - 56
[37] Machine Learning Based Representative Spatio-Temporal Event Documents Classification
Kim, Byoungwook
Yang, Yeongwook
Park, Ji Su
Jang, Hong-Jun
APPLIED SCIENCES-BASEL, 2023, 13 (07):
[38] Hierarchical Representation Learning based spatio-temporal data redundancy reduction
Wang, Min
Yang, Shuyuan
Wu, Bin
NEUROCOMPUTING, 2016, 173 : 298 - 305
[39] Spatio-Temporal Representation Learning with Social Tie for Personalized POI Recommendation
Shaojie Dai
Yanwei Yu
Hao Fan
Junyu Dong
Data Science and Engineering, 2022, 7 : 44 - 56
[40] Urban mobility structure detection via spatio-temporal representation learning
Duan, Xiaoqi
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (08):

← 1 2 3 4 5 →