Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving

被引:4
|
作者
Pang, Bo [1 ]
Xia, Hongchi [1 ]
Lu, Cewu [1 ,2 ,3 ,4 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai, Peoples R China
[4] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年
关键词
D O I
10.1109/CVPR52729.2023.00506
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the difficulty of annotating the 3D LiDAR data of autonomous driving, an efficient unsupervised 3D representation learning method is important. In this paper, we design the Triangle Constrained Contrast (TriCC) framework tailored for autonomous driving scenes which learns 3D unsupervised representations through both the multimodal information and dynamic of temporal sequences. We treat one camera image and two LiDAR point clouds with different timestamps as a triplet. And our key design is the consistent constraint that automatically finds matching relationships among the triplet through "self-cycle" and learns representations from it. With the matching relations across the temporal dimension and modalities, we can further conduct a triplet contrast to improve learning efficiency. To the best of our knowledge, TriCC is the first framework that unifies both the temporal and multimodal semantics, which means it utilizes almost all the information in autonomous driving scenes. And compared with previous contrastive methods, it can automatically dig out contrasting pairs with higher difficulty, instead of relying on handcrafted ones. Extensive experiments are conducted with Minkowski-UNet and VoxelNet on several semantic segmentation and 3D detection datasets. Results show that TriCC learns effective representations with much fewer training iterations and improves the SOTA results greatly on all the downstream tasks. Code and models can be found at https://bopang1996.github.io/.
引用
收藏
页码:5229 / 5239
页数:11
相关论文
共 50 条
  • [21] Iterative BTreeNet: Unsupervised learning for large and dense 3D point cloud registration
    Xi, Long
    Tang, Wen
    Xue, Tao
    Wan, TaoRuan
    NEUROCOMPUTING, 2022, 506 : 336 - 354
  • [22] Adversarial point cloud perturbations against 3D object detection in autonomous driving systems
    Wang, Xupeng
    Cai, Mumuxin
    Sohel, Ferdous
    Sang, Nan
    Chang, Zhengwei
    NEUROCOMPUTING, 2021, 466 : 27 - 36
  • [23] The Research of 3D Point Cloud Data Clustering Based on MEMS Lidar for Autonomous Driving
    Yang, Weikang
    Dong, Siwei
    Li, Dagang
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, 25 (05) : 1251 - 1262
  • [24] Ground-distance segmentation of 3D LiDAR point cloud toward autonomous driving
    Wu, Jian
    Yang, Qingxiong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2020, 9 (01)
  • [25] Beyond Pattern Variance: Unsupervised 3-D Action Representation Learning With Point Cloud Sequence
    Tan, Bo
    Xiao, Yang
    Wang, Yancheng
    Li, Shuai
    Yang, Jianyu
    Cao, Zhiguo
    Zhou, Joey Tianyi
    Yuan, Junsong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18186 - 18199
  • [26] MSL3D: 3D object detection from monocular, stereo and point cloud for autonomous driving
    Chen, Wenyu
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2022, 494 : 23 - 32
  • [27] Learning-Based Underwater Autonomous Grasping via 3D Point Cloud
    Wang, Cong
    Zhang, Qifeng
    Li, Shuo
    Wang, Xiaohui
    Lane, David
    Petillot, Yvan
    Wang, Sen
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [28] DC3DCD: Unsupervised learning for multiclass 3D point cloud change detection
    de Gelis, Iris
    Lefevre, Sebastien
    Corpetti, Thomas
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 206 : 168 - 183
  • [29] 3D Point Cloud Registration with Multi-Scale Architecture and Unsupervised Transfer Learning
    Horache, Sofiane
    Deschaud, Jean-Emmanuel
    Goulette, Francois
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 1351 - 1361
  • [30] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
    Rao, Yongming
    Lu, Jiwen
    Zhou, Jie
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5375 - 5384