Self-Supervised Learning on 3D Point Clouds by Learning Discrete Generative Models

被引:36
|
作者
Eckart, Benjamin [1 ]
Yuan, Wentao [2 ]
Liu, Chao [1 ]
Kautz, Jan [1 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Univ Washington, Seattle, WA 98195 USA
关键词
D O I
10.1109/CVPR46437.2021.00815
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While recent pre-training tasks on 2D images have proven very successful for transfer learning, pre-training for 3D data remains challenging. In this work, we introduce a general method for 3D self-supervised representation learning that 1) remains agnostic to the underlying neural network architecture, and 2) specifically leverages the geometric nature of 3D point cloud data. The proposed task softly segments 3D points into a discrete number of geometric partitions. A self-supervised loss is formed under the interpretation that these soft partitions implicitly parameterize a latent Gaussian Mixture Model (GMM), and that this generative model establishes a data likelihood function. Our pretext task can therefore be viewed in terms of an encoder-decoder paradigm that squeezes learned representations through an implicitly defined parametric discrete generative model bottleneck. We show that any existing neural network architecture designed for supervised point cloud segmentation can be repurposed for the proposed unsupervised pretext task. By maximizing data likelihood with respect to the soft partitions formed by the unsupervised point-wise segmentation network, learned representations are encouraged to contain compositionally rich geometric information. In tests, we show that our method naturally induces semantic separation in feature space, resulting in state-of-the-art performance on downstream applications like model classification and semantic segmentation.
引用
收藏
页码:8244 / 8253
页数:10
相关论文
共 50 条
  • [31] DCPoint: Global-Local Dual Contrast for Self-Supervised Representation Learning of 3-D Point Clouds
    Shi, Lu
    Zhang, Guoqing
    Cao, Qi
    Zhang, Linna
    Cen, Yigang
    Cen, Yi
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (14) : 23224 - 23238
  • [32] Self-supervised graph representations with generative adversarial learning
    Sun, Xuecheng
    Wang, Zonghui
    Lu, Zheming
    Lu, Ziqian
    [J]. NEUROCOMPUTING, 2024, 592
  • [33] Self-supervised generative learning for sequential data prediction
    Xu, Ke
    Zhong, Guoqiang
    Deng, Zhaoyang
    Zhang, Kang
    Huang, Kaizhu
    [J]. APPLIED INTELLIGENCE, 2023, 53 (18) : 20675 - 20689
  • [34] Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive
    Wu, Lirong
    Lin, Haitao
    Tan, Cheng
    Gao, Zhangyang
    Li, Stan Z.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4216 - 4235
  • [35] Self-supervised generative learning for sequential data prediction
    Ke Xu
    Guoqiang Zhong
    Zhaoyang Deng
    Kang Zhang
    Kaizhu Huang
    [J]. Applied Intelligence, 2023, 53 : 20675 - 20689
  • [36] SegContrast: 3D Point Cloud Feature Representation Learning Through Self-Supervised Segment Discrimination
    Nunes, Lucas
    Marcuzzi, Rodrigo
    Chen, Xieyuanli
    Behley, Jens
    Stachniss, Cyrill
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 2116 - 2123
  • [37] Invited: Generative Self-Supervised Learning for Gate Sizing
    Nath, Siddhartha
    Pradipta, Geraldo
    Hu, Corey
    Yang, Tian
    Khailany, Brucek
    Ren, Haoxing
    [J]. PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 1331 - 1334
  • [38] CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
    Afham, Mohamed
    Dissanayake, Isuru
    Dissanayake, Dinithi
    Dharmasiri, Amaya
    Thilakarathna, Kanchana
    Rodrigo, Ranga
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9892 - 9902
  • [39] Curriculum Self-Supervised Learning for 3D CT Cardiac Image Segmentation
    Taher, Mohammad Reza Hosseinzadeh
    Ikuta, Masaki
    Soni, Ravi
    [J]. MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 145 - 156
  • [40] Imbalance-Aware Self-supervised Learning for 3D Radiomic Representations
    Li, Hongwei
    Xue, Fei-Fei
    Chaitanya, Krishna
    Luo, Shengda
    Ezhov, Ivan
    Wiestler, Benedikt
    Zhang, Jianguo
    Menze, Bjoern
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 36 - 46