Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction

被引：0

作者：

Fan, Songlin ^{[1
,2
]}

Gao, Wei ^{[1
,2
]}

Li, Ge ^{[1
]}

机构：

[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China

[2] Peng Cheng Lab, Sch Elect & Comp Engn, Shenzhen 518066, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

关键词：

Point cloud compression; Semantics; Transformers; Standards; Feature extraction; Training; Circuit faults; Predictive models; Encoding; Image reconstruction; Masked position prediction; point cloud; pretraining; self-supervised learning (SSL);

D O I：

10.1109/TNNLS.2024.3479309

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Masked autoencoding has gained momentum for improving fine-tuning performance in many downstream tasks. However, it tends to focus on low-level reconstruction details, lacking high-level semantics and resulting in weak transfer capability. This article presents a novel jigsaw puzzle solver inspired by the idea that predicting the positions of disordered point cloud patches provides more semantic information, similar to how children learn by solving jigsaw puzzles. Our method adopts the mask-then-predict paradigm, erasing the positions of selected point patches rather than their contents. We first partition input point clouds into irregular patches and randomly erase the positions of some patches. Then, a Transformer-based model is used to learn high-level semantic features and regress the positions of the masked patches. This approach forces the model to focus on learning transfer-robust semantics while paying less attention to low-level details. To tie the predictions within the encoding space, we further introduce a consistency constraint on their latent representations to encourage the encoded features to contain more semantic cues. We demonstrate that a standard Transformer backbone with our pretraining scheme can capture discriminative point cloud semantic information. Furthermore, extensive experiments indicate that our method outperforms the previous best competitor across six popular downstream vision tasks, achieving new state-of-the-art performance.

引用

页数：13

共 50 条

[1] Masked Autoencoders for Point Cloud Self-supervised Learning
Pang, Yatian
Wang, Wenxiao
Tay, Francis E. H.
Liu, Wei
Tian, Yonghong
Yuan, Li
COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 604 - 621
[2] Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
Shen, Zhiqiang
Sheng, Xiaoxiao
Fan, Hehe
Wang, Longguang
Guo, Yulan
Liu, Qiong
Wen, Hao
Zhou, Xi
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16534 - 16543
[3] Masked Discrimination for Self-supervised Learning on Point Clouds
Liu, Haotian
Cai, Mu
Lee, Yong Jae
COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 657 - 675
[4] Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
Sheng, Xiaoxiao
Shen, Zhiqiang
Xiao, Gang
Wang, Longguang
Guo, Yulan
Fan, Hehe
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16469 - 16478
[5] PatchMixing Masked Autoencoders for 3D Point Cloud Self-Supervised Learning
Lin, Chengxing
Xu, Wenju
Zhu, Jian
Nie, Yongwei
Cai, Ruichu
Xu, Xuemiao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9882 - 9897
[6] Self-Supervised Point Cloud Prediction for Autonomous Driving
Du, Ronghua
Feng, Rongying
Gao, Kai
Zhang, Jinlai
Liu, Linhong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17452 - 17467
[7] GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Tian, Xiaoyu
Ran, Haoxi
Wang, Yue
Zhao, Hang
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13570 - 13580
[8] Self-supervised learning for point cloud data: A survey
Zeng, Changyu
Wang, Wei
Nguyen, Anh
Xiao, Jimin
Yue, Yutao
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[9] PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos
Shen, Zhiqiang
Sheng, Xiaoxiao
Wang, Longguang
Guo, Yulan
Liu, Qiong
Zhou, Xi
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1212 - 1222
[10] Self-Supervised Boundary Point Prediction Task for Point Cloud Domain Adaptation
Chen, Jintao
Zhang, Yan
Huang, Kun
Ma, Feifan
Tan, Zhuangbin
Xu, Zheyu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5878 - 5885

← 1 2 3 4 5 →