Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction

被引:0
|
作者
Fan, Songlin [1 ,2 ]
Gao, Wei [1 ,2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Sch Elect & Comp Engn, Shenzhen 518066, Peoples R China
关键词
Point cloud compression; Semantics; Transformers; Standards; Feature extraction; Training; Circuit faults; Predictive models; Encoding; Image reconstruction; Masked position prediction; point cloud; pretraining; self-supervised learning (SSL);
D O I
10.1109/TNNLS.2024.3479309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Masked autoencoding has gained momentum for improving fine-tuning performance in many downstream tasks. However, it tends to focus on low-level reconstruction details, lacking high-level semantics and resulting in weak transfer capability. This article presents a novel jigsaw puzzle solver inspired by the idea that predicting the positions of disordered point cloud patches provides more semantic information, similar to how children learn by solving jigsaw puzzles. Our method adopts the mask-then-predict paradigm, erasing the positions of selected point patches rather than their contents. We first partition input point clouds into irregular patches and randomly erase the positions of some patches. Then, a Transformer-based model is used to learn high-level semantic features and regress the positions of the masked patches. This approach forces the model to focus on learning transfer-robust semantics while paying less attention to low-level details. To tie the predictions within the encoding space, we further introduce a consistency constraint on their latent representations to encourage the encoded features to contain more semantic cues. We demonstrate that a standard Transformer backbone with our pretraining scheme can capture discriminative point cloud semantic information. Furthermore, extensive experiments indicate that our method outperforms the previous best competitor across six popular downstream vision tasks, achieving new state-of-the-art performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Generative Variational-Contrastive Learning for Self-Supervised Point Cloud Representation
    Wang, Bohua
    Tian, Zhiqiang
    Ye, Aixue
    Wen, Feng
    Du, Shaoyi
    Gao, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6154 - 6166
  • [22] Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning
    Liu, Yang
    Chen, Chen
    Wang, Can
    King, Xulin
    Liu, Mengyuan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1738 - 1749
  • [23] Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes
    Sun, Chao
    Zheng, Zhedong
    Wang, Xiaohan
    Xu, Mingliang
    Yang, Yi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6207 - 6218
  • [24] Cross-Architecture Relational Consistency for Point Cloud Self-Supervised Learning
    Li, Hongyu
    Zhang, Yifei
    Yang, Dongbao
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 661 - 668
  • [25] Self-Supervised Point Cloud Understanding via Mask Transformer and Contrastive Learning
    Wang, Di
    Yang, Zhi-Xin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (01) : 184 - 191
  • [26] PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning
    Li, Kang
    Zhu, Qiuquan
    Wang, Haoyu
    Wang, Shibo
    Tian, He
    Zhou, Ping
    Cao, Xin
    REMOTE SENSING, 2024, 16 (16)
  • [27] Point-AGM : Attention Guided Masked Auto-Encoder for Joint Self-supervised Learning on Point Clouds
    Liu, Jie
    Yang, Mengna
    Tian, Yu
    Li, Yancui
    Song, Da
    Li, Kang
    Cao, Xin
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [28] Self-Supervised Learning for 3-D Point Clouds Based on a Masked Linear Autoencoder
    Yang, Hongxin
    Wang, Ruisheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [29] SPINet: self-supervised point cloud frame interpolation network
    Xu, Jiawen
    Le, Xinyi
    Chen, Cailian
    Guan, Xinping
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 9951 - 9960
  • [30] SPINet: self-supervised point cloud frame interpolation network
    Jiawen Xu
    Xinyi Le
    Cailian Chen
    Xinping Guan
    Neural Computing and Applications, 2023, 35 : 9951 - 9960