Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering

被引:0
|
作者
Mei, Guofeng [1 ,3 ]
Saltori, Cristiano [2 ]
Ricci, Elisa [2 ,3 ]
Sebe, Nicu [2 ]
Wu, Qiang [1 ]
Zhang, Jian [1 ]
Poiesi, Fabio [3 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ultimo, NSW 2007, Australia
[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, Via Sommar 9, I-38123 Trento, Italy
[3] Fdn Bruno Kessler, Via Sommar 18, I-38123 Trento, Italy
关键词
Unsupervised learning; Point cloud; Data-augmentation; Clustering; Neural rendering;
D O I
10.1007/s11263-024-02027-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation has contributed to the rapid advancement of unsupervised learning on 3D point clouds. However, we argue that data augmentation is not ideal, as it requires a careful application-dependent selection of the types of augmentations to be performed, thus potentially biasing the information learned by the network during self-training. Moreover, several unsupervised methods only focus on uni-modal information, thus potentially introducing challenges in the case of sparse and textureless point clouds. To address these issues, we propose an augmentation-free unsupervised approach for point clouds, named CluRender, to learn transferable point-level features by leveraging uni-modal information for soft clustering and cross-modal information for neural rendering. Soft clustering enables self-training through a pseudo-label prediction task, where the affiliation of points to their clusters is used as a proxy under the constraint that these pseudo-labels divide the point cloud into approximate equal partitions. This allows us to formulate a clustering loss to minimize the standard cross-entropy between pseudo and predicted labels. Neural rendering generates photorealistic renderings from various viewpoints to transfer photometric cues from 2D images to the features. The consistency between rendered and real images is then measured to form a fitting loss, combined with the cross-entropy loss to self-train networks. Experiments on downstream applications, including 3D object detection, semantic segmentation, classification, part segmentation, and few-shot learning, demonstrate the effectiveness of our framework in outperforming state-of-the-art techniques.
引用
收藏
页码:3251 / 3269
页数:19
相关论文
共 50 条
  • [1] Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey
    Xiao, Aoran
    Huang, Jiaxing
    Guan, Dayan
    Zhang, Xiaoqin
    Lu, Shijian
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11321 - 11339
  • [2] Unsupervised Feature Learning for Point Cloud Understanding by Contrasting and Clustering Using Graph Convolutional Neural Networks
    Zhang, Ling
    Zhu, Zhigang
    [J]. 2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 395 - 404
  • [3] Consensus Clustering With Unsupervised Representation Learning
    Regatti, Jayanth Reddy
    Deshmukh, Aniket Anand
    Manavoglu, Eren
    Dogan, Urun
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Online Deep Clustering for Unsupervised Representation Learning
    Zhan, Xiaohang
    Xie, Jiahao
    Liu, Ziwei
    Ong, Yew-Soon
    Loy, Chen Change
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6687 - 6696
  • [5] Jigsaw Clustering for Unsupervised Visual Representation Learning
    Chen, Pengguang
    Liu, Shu
    Jia, Jiaya
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11521 - 11530
  • [6] Clustering based Point Cloud Representation Learning for 3D Analysis
    Feng, Tuo
    Wang, Wenguan
    Wang, Xiaohan
    Yang, Yi
    Zheng, Qinghua
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8249 - 8260
  • [7] URINet: Unsupervised point cloud rotation invariant representation learning via semantic and structural reasoning
    Wu, Qiuxia
    Su, Kunming
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [8] Disentangled Representation Learning for Unsupervised Neural Quantization
    Noh, Haechan
    Hyun, Sangeek
    Jeong, Woojin
    Lim, Hanshin
    Heo, Jae-Pil
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12001 - 12010
  • [9] UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering
    El Banani, Mohamed
    Gao, Luya
    Johnson, Justin
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7125 - 7135
  • [10] Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous Driving
    Pang, Bo
    Xia, Hongchi
    Lu, Cewu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5229 - 5239