Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering

被引：0

作者：

Mei, Guofeng ^{[1
,3
]}

Saltori, Cristiano ^{[2
]}

Ricci, Elisa ^{[2
,3
]}

Sebe, Nicu ^{[2
]}

Wu, Qiang ^{[1
]}

Zhang, Jian ^{[1
]}

Poiesi, Fabio ^{[3
]}

机构：

[1] Univ Technol Sydney, Fac Engn & IT, Ultimo, NSW 2007, Australia

[2] Univ Trento, Dept Informat Engn & Comp Sci DISI, Via Sommar 9, I-38123 Trento, Italy

[3] Fdn Bruno Kessler, Via Sommar 18, I-38123 Trento, Italy

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 08期

关键词：

Unsupervised learning; Point cloud; Data-augmentation; Clustering; Neural rendering;

D O I：

10.1007/s11263-024-02027-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data augmentation has contributed to the rapid advancement of unsupervised learning on 3D point clouds. However, we argue that data augmentation is not ideal, as it requires a careful application-dependent selection of the types of augmentations to be performed, thus potentially biasing the information learned by the network during self-training. Moreover, several unsupervised methods only focus on uni-modal information, thus potentially introducing challenges in the case of sparse and textureless point clouds. To address these issues, we propose an augmentation-free unsupervised approach for point clouds, named CluRender, to learn transferable point-level features by leveraging uni-modal information for soft clustering and cross-modal information for neural rendering. Soft clustering enables self-training through a pseudo-label prediction task, where the affiliation of points to their clusters is used as a proxy under the constraint that these pseudo-labels divide the point cloud into approximate equal partitions. This allows us to formulate a clustering loss to minimize the standard cross-entropy between pseudo and predicted labels. Neural rendering generates photorealistic renderings from various viewpoints to transfer photometric cues from 2D images to the features. The consistency between rendered and real images is then measured to form a fitting loss, combined with the cross-entropy loss to self-train networks. Experiments on downstream applications, including 3D object detection, semantic segmentation, classification, part segmentation, and few-shot learning, demonstrate the effectiveness of our framework in outperforming state-of-the-art techniques.

引用

页码：3251 / 3269

页数：19

共 50 条

[31] Neural scene representation and rendering
Eslami, S. M. Ali
Rezende, Danilo Jimenez
Besse, Frederic
Viola, Fabio
Morcos, Ari S.
Garnelo, Marta
Ruderman, Avraham
Rusu, Andrei A.
Danihelka, Ivo
Gregor, Karol
Reichert, David P.
Buesing, Lars
Weber, Theophane
Vinyals, Oriol
Rosenbaum, Dan
Rabinowitz, Neil
King, Helen
Hillier, Chloe
Botvinick, Matt
Wierstra, Daan
Kavukcuoglu, Koray
Hassabis, Demis
SCIENCE, 2018, 360 (6394) : 1204 - +
[32] Fast and Unsupervised Neural Architecture Evolution for Visual Representation Learning
Xue, Song
Chen, Hanlin
Xie, Chunyu
Zhang, Baochang
Gong, Xuan
Doermann, David
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (03) : 22 - 32
[33] Neural Parametric Human Hand Modeling with Point Cloud Representation
Yang, Jian
Quan, Weize
Shen, Zhen
Yan, Dong-Ming
Wu, Huai-Yu
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 804 - 813
[34] Unsupervised Point Cloud Registration by Learning Unified Gaussian Mixture Models
Huang, Xiaoshui
Li, Sheng
Zuo, Yifan
Fang, Yuming
Zhang, Jian
Zhao, Xiaowei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7028 - 7035
[35] Unsupervised Degradation Representation Learning for Unpaired Restoration of Images and Point Clouds
Wang, Longguang
Guo, Yulan
Wang, Yingqian
Dong, Xiaoyu
Xu, Qingyu
Yang, Jungang
An, Wei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 1 - 18
[36] Unsupervised Feedforward Feature (UFF) Learning for Point Cloud Classification and Segmentation
Zhang, Min
Kadam, Pranav
Liu, Shan
Kuo, C-C Jay
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 144 - 147
[37] PointClustering: Unsupervised Point Cloud Pre-training using Transformation Invariance in Clustering
Long, Fuchen
Yao, Ting
Qiu, Zhaofan
Li, Lusong
Mei, Tao
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21824 - 21834
[38] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
Ru Wang
Lin Li
Peipei Wang
Xiaohui Tao
Peiyu Liu
Multimedia Tools and Applications, 2022, 81 : 34309 - 34324
[39] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
Wang, Ru
Li, Lin
Wang, Peipei
Tao, Xiaohui
Liu, Peiyu
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34309 - 34324
[40] Unsupervised Human Activity Representation Learning with Multi-task Deep Clustering
Ma, Haojie
Zhang, Zhijie
Li, Wenzhong
Lu, Sanglu
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2021, 5 (01):

← 1 2 3 4 5 →