Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering

被引:0
|
作者
Mei, Guofeng [1 ,3 ]
Saltori, Cristiano [2 ]
Ricci, Elisa [2 ,3 ]
Sebe, Nicu [2 ]
Wu, Qiang [1 ]
Zhang, Jian [1 ]
Poiesi, Fabio [3 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ultimo, NSW 2007, Australia
[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, Via Sommar 9, I-38123 Trento, Italy
[3] Fdn Bruno Kessler, Via Sommar 18, I-38123 Trento, Italy
关键词
Unsupervised learning; Point cloud; Data-augmentation; Clustering; Neural rendering;
D O I
10.1007/s11263-024-02027-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation has contributed to the rapid advancement of unsupervised learning on 3D point clouds. However, we argue that data augmentation is not ideal, as it requires a careful application-dependent selection of the types of augmentations to be performed, thus potentially biasing the information learned by the network during self-training. Moreover, several unsupervised methods only focus on uni-modal information, thus potentially introducing challenges in the case of sparse and textureless point clouds. To address these issues, we propose an augmentation-free unsupervised approach for point clouds, named CluRender, to learn transferable point-level features by leveraging uni-modal information for soft clustering and cross-modal information for neural rendering. Soft clustering enables self-training through a pseudo-label prediction task, where the affiliation of points to their clusters is used as a proxy under the constraint that these pseudo-labels divide the point cloud into approximate equal partitions. This allows us to formulate a clustering loss to minimize the standard cross-entropy between pseudo and predicted labels. Neural rendering generates photorealistic renderings from various viewpoints to transfer photometric cues from 2D images to the features. The consistency between rendered and real images is then measured to form a fitting loss, combined with the cross-entropy loss to self-train networks. Experiments on downstream applications, including 3D object detection, semantic segmentation, classification, part segmentation, and few-shot learning, demonstrate the effectiveness of our framework in outperforming state-of-the-art techniques.
引用
收藏
页码:3251 / 3269
页数:19
相关论文
共 50 条
  • [41] Clustering-enhanced PointCNN for Point Cloud Classification Learning
    Yu, Yikuan
    Li, Fei
    Zheng, Yu
    Han, Min
    Le, Xinyi
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [42] Biological Data Mining for Genomic Clustering Using Unsupervised Neural Learning
    Sen, Shreyas
    Narasimhan, Seetharam
    Konar, Amit
    ENGINEERING LETTERS, 2007, 14 (02)
  • [43] Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
    Yang, Siyuan
    Liu, Jun
    Lu, Shijian
    Er, Meng Hwa
    Kot, Alex C.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13403 - 13413
  • [44] Efficient point cloud representation learning with a recurrent hierarchical framework
    Wang, Ziming
    Zhang, Boxiang
    Ma, Ming
    Wang, Yue
    Du, Taoli
    Li, Wenhui
    APPLIED SOFT COMPUTING, 2025, 171
  • [45] Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space
    Huang, Tianxin
    Ding, Zhonggan
    Zhang, Jiangning
    Tai, Ying
    Zhang, Zhenyu
    Chen, Mingang
    Wang, Chengjie
    Liu, Yong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12208 - 12217
  • [46] A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning
    Yang, Shijie
    Li, Liang
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7053 - 7061
  • [47] Road Surface Modeling and Representation from Point Cloud Based on Fuzzy Clustering
    Zhang Yi
    Yan Li
    GEO-SPATIAL INFORMATION SCIENCE, 2007, 10 (04) : 276 - 281
  • [48] Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation
    Zhao, Tiancheng
    Lee, Kyusong
    Eskenazi, Maxine
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1098 - 1107
  • [49] Neural Complex Luminaires: Representation and Rendering
    Zhu, Junqiu
    Bai, Yaoyi
    Xu, Zilin
    Bako, Steve
    Velazquez-Armendariz, Edgar
    Wang, Lu
    Sen, Pradeep
    Hasan, Milos
    Yan, Ling-Qi
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04):
  • [50] DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning
    Wu, Xiaopeng
    Shi, Guangsi
    Zhao, Zexing
    Li, Mingjie
    Gao, Xiaojun
    Yan, Xiaoli
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249