Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving

被引：4

作者：

Pang, Bo ^{[1
]}

Xia, Hongchi ^{[1
]}

Lu, Cewu ^{[1
,2
,3
,4
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China

[3] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai, Peoples R China

[4] Shanghai Qi Zhi Inst, Shanghai, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00506

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the difficulty of annotating the 3D LiDAR data of autonomous driving, an efficient unsupervised 3D representation learning method is important. In this paper, we design the Triangle Constrained Contrast (TriCC) framework tailored for autonomous driving scenes which learns 3D unsupervised representations through both the multimodal information and dynamic of temporal sequences. We treat one camera image and two LiDAR point clouds with different timestamps as a triplet. And our key design is the consistent constraint that automatically finds matching relationships among the triplet through "self-cycle" and learns representations from it. With the matching relations across the temporal dimension and modalities, we can further conduct a triplet contrast to improve learning efficiency. To the best of our knowledge, TriCC is the first framework that unifies both the temporal and multimodal semantics, which means it utilizes almost all the information in autonomous driving scenes. And compared with previous contrastive methods, it can automatically dig out contrasting pairs with higher difficulty, instead of relying on handcrafted ones. Extensive experiments are conducted with Minkowski-UNet and VoxelNet on several semantic segmentation and 3D detection datasets. Results show that TriCC learns effective representations with much fewer training iterations and improves the SOTA results greatly on all the downstream tasks. Code and models can be found at https://bopang1996.github.io/.

引用

页码：5229 / 5239

页数：11

共 50 条

[21] Iterative BTreeNet: Unsupervised learning for large and dense 3D point cloud registration
Xi, Long
Tang, Wen
Xue, Tao
Wan, TaoRuan
NEUROCOMPUTING, 2022, 506 : 336 - 354
[22] Adversarial point cloud perturbations against 3D object detection in autonomous driving systems
Wang, Xupeng
Cai, Mumuxin
Sohel, Ferdous
Sang, Nan
Chang, Zhengwei
NEUROCOMPUTING, 2021, 466 : 27 - 36
[23] The Research of 3D Point Cloud Data Clustering Based on MEMS Lidar for Autonomous Driving
Yang, Weikang
Dong, Siwei
Li, Dagang
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (05) : 1251 - 1262
[24] Ground-distance segmentation of 3D LiDAR point cloud toward autonomous driving
Wu, Jian
Yang, Qingxiong
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2020, 9 (01)
[25] Beyond Pattern Variance: Unsupervised 3-D Action Representation Learning With Point Cloud Sequence
Tan, Bo
Xiao, Yang
Wang, Yancheng
Li, Shuai
Yang, Jianyu
Cao, Zhiguo
Zhou, Joey Tianyi
Yuan, Junsong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18186 - 18199
[26] MSL3D: 3D object detection from monocular, stereo and point cloud for autonomous driving
Chen, Wenyu
Li, Peixuan
Zhao, Huaici
NEUROCOMPUTING, 2022, 494 : 23 - 32
[27] Learning-Based Underwater Autonomous Grasping via 3D Point Cloud
Wang, Cong
Zhang, Qifeng
Li, Shuo
Wang, Xiaohui
Lane, David
Petillot, Yvan
Wang, Sen
OCEANS 2021: SAN DIEGO - PORTO, 2021,
[28] DC3DCD: Unsupervised learning for multiclass 3D point cloud change detection
de Gelis, Iris
Lefevre, Sebastien
Corpetti, Thomas
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 206 : 168 - 183
[29] 3D Point Cloud Registration with Multi-Scale Architecture and Unsupervised Transfer Learning
Horache, Sofiane
Deschaud, Jean-Emmanuel
Goulette, Francois
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 1351 - 1361
[30] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
Rao, Yongming
Lu, Jiwen
Zhou, Jie
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5375 - 5384

← 1 2 3 4 5 →