Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering

被引:0
|
作者
Mei, Guofeng [1 ,3 ]
Saltori, Cristiano [2 ]
Ricci, Elisa [2 ,3 ]
Sebe, Nicu [2 ]
Wu, Qiang [1 ]
Zhang, Jian [1 ]
Poiesi, Fabio [3 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ultimo, NSW 2007, Australia
[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, Via Sommar 9, I-38123 Trento, Italy
[3] Fdn Bruno Kessler, Via Sommar 18, I-38123 Trento, Italy
关键词
Unsupervised learning; Point cloud; Data-augmentation; Clustering; Neural rendering;
D O I
10.1007/s11263-024-02027-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation has contributed to the rapid advancement of unsupervised learning on 3D point clouds. However, we argue that data augmentation is not ideal, as it requires a careful application-dependent selection of the types of augmentations to be performed, thus potentially biasing the information learned by the network during self-training. Moreover, several unsupervised methods only focus on uni-modal information, thus potentially introducing challenges in the case of sparse and textureless point clouds. To address these issues, we propose an augmentation-free unsupervised approach for point clouds, named CluRender, to learn transferable point-level features by leveraging uni-modal information for soft clustering and cross-modal information for neural rendering. Soft clustering enables self-training through a pseudo-label prediction task, where the affiliation of points to their clusters is used as a proxy under the constraint that these pseudo-labels divide the point cloud into approximate equal partitions. This allows us to formulate a clustering loss to minimize the standard cross-entropy between pseudo and predicted labels. Neural rendering generates photorealistic renderings from various viewpoints to transfer photometric cues from 2D images to the features. The consistency between rendered and real images is then measured to form a fitting loss, combined with the cross-entropy loss to self-train networks. Experiments on downstream applications, including 3D object detection, semantic segmentation, classification, part segmentation, and few-shot learning, demonstrate the effectiveness of our framework in outperforming state-of-the-art techniques.
引用
收藏
页码:3251 / 3269
页数:19
相关论文
共 50 条
  • [31] Neural scene representation and rendering
    Eslami, S. M. Ali
    Rezende, Danilo Jimenez
    Besse, Frederic
    Viola, Fabio
    Morcos, Ari S.
    Garnelo, Marta
    Ruderman, Avraham
    Rusu, Andrei A.
    Danihelka, Ivo
    Gregor, Karol
    Reichert, David P.
    Buesing, Lars
    Weber, Theophane
    Vinyals, Oriol
    Rosenbaum, Dan
    Rabinowitz, Neil
    King, Helen
    Hillier, Chloe
    Botvinick, Matt
    Wierstra, Daan
    Kavukcuoglu, Koray
    Hassabis, Demis
    SCIENCE, 2018, 360 (6394) : 1204 - +
  • [32] Fast and Unsupervised Neural Architecture Evolution for Visual Representation Learning
    Xue, Song
    Chen, Hanlin
    Xie, Chunyu
    Zhang, Baochang
    Gong, Xuan
    Doermann, David
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (03) : 22 - 32
  • [33] Neural Parametric Human Hand Modeling with Point Cloud Representation
    Yang, Jian
    Quan, Weize
    Shen, Zhen
    Yan, Dong-Ming
    Wu, Huai-Yu
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 804 - 813
  • [34] Unsupervised Point Cloud Registration by Learning Unified Gaussian Mixture Models
    Huang, Xiaoshui
    Li, Sheng
    Zuo, Yifan
    Fang, Yuming
    Zhang, Jian
    Zhao, Xiaowei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7028 - 7035
  • [35] Unsupervised Degradation Representation Learning for Unpaired Restoration of Images and Point Clouds
    Wang, Longguang
    Guo, Yulan
    Wang, Yingqian
    Dong, Xiaoyu
    Xu, Qingyu
    Yang, Jungang
    An, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 1 - 18
  • [36] Unsupervised Feedforward Feature (UFF) Learning for Point Cloud Classification and Segmentation
    Zhang, Min
    Kadam, Pranav
    Liu, Shan
    Kuo, C-C Jay
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 144 - 147
  • [37] PointClustering: Unsupervised Point Cloud Pre-training using Transformation Invariance in Clustering
    Long, Fuchen
    Yao, Ting
    Qiu, Zhaofan
    Li, Lusong
    Mei, Tao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21824 - 21834
  • [38] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
    Ru Wang
    Lin Li
    Peipei Wang
    Xiaohui Tao
    Peiyu Liu
    Multimedia Tools and Applications, 2022, 81 : 34309 - 34324
  • [39] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
    Wang, Ru
    Li, Lin
    Wang, Peipei
    Tao, Xiaohui
    Liu, Peiyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34309 - 34324
  • [40] Unsupervised Human Activity Representation Learning with Multi-task Deep Clustering
    Ma, Haojie
    Zhang, Zhijie
    Li, Wenzhong
    Lu, Sanglu
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):