Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction

被引:0
|
作者
Fan, Songlin [1 ,2 ]
Gao, Wei [1 ,2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Sch Elect & Comp Engn, Shenzhen 518066, Peoples R China
关键词
Point cloud compression; Semantics; Transformers; Standards; Feature extraction; Training; Circuit faults; Predictive models; Encoding; Image reconstruction; Masked position prediction; point cloud; pretraining; self-supervised learning (SSL);
D O I
10.1109/TNNLS.2024.3479309
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Masked autoencoding has gained momentum for improving fine-tuning performance in many downstream tasks. However, it tends to focus on low-level reconstruction details, lacking high-level semantics and resulting in weak transfer capability. This article presents a novel jigsaw puzzle solver inspired by the idea that predicting the positions of disordered point cloud patches provides more semantic information, similar to how children learn by solving jigsaw puzzles. Our method adopts the mask-then-predict paradigm, erasing the positions of selected point patches rather than their contents. We first partition input point clouds into irregular patches and randomly erase the positions of some patches. Then, a Transformer-based model is used to learn high-level semantic features and regress the positions of the masked patches. This approach forces the model to focus on learning transfer-robust semantics while paying less attention to low-level details. To tie the predictions within the encoding space, we further introduce a consistency constraint on their latent representations to encourage the encoded features to contain more semantic cues. We demonstrate that a standard Transformer backbone with our pretraining scheme can capture discriminative point cloud semantic information. Furthermore, extensive experiments indicate that our method outperforms the previous best competitor across six popular downstream vision tasks, achieving new state-of-the-art performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Masked Autoencoders for Point Cloud Self-supervised Learning
    Pang, Yatian
    Wang, Wenxiao
    Tay, Francis E. H.
    Liu, Wei
    Tian, Yonghong
    Yuan, Li
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 604 - 621
  • [2] Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
    Shen, Zhiqiang
    Sheng, Xiaoxiao
    Fan, Hehe
    Wang, Longguang
    Guo, Yulan
    Liu, Qiong
    Wen, Hao
    Zhou, Xi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16534 - 16543
  • [3] Masked Discrimination for Self-supervised Learning on Point Clouds
    Liu, Haotian
    Cai, Mu
    Lee, Yong Jae
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 657 - 675
  • [4] Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    Wang, Longguang
    Guo, Yulan
    Fan, Hehe
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16469 - 16478
  • [5] PatchMixing Masked Autoencoders for 3D Point Cloud Self-Supervised Learning
    Lin, Chengxing
    Xu, Wenju
    Zhu, Jian
    Nie, Yongwei
    Cai, Ruichu
    Xu, Xuemiao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9882 - 9897
  • [6] Self-Supervised Point Cloud Prediction for Autonomous Driving
    Du, Ronghua
    Feng, Rongying
    Gao, Kai
    Zhang, Jinlai
    Liu, Linhong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17452 - 17467
  • [7] GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
    Tian, Xiaoyu
    Ran, Haoxi
    Wang, Yue
    Zhao, Hang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13570 - 13580
  • [8] Self-supervised learning for point cloud data: A survey
    Zeng, Changyu
    Wang, Wei
    Nguyen, Anh
    Xiao, Jimin
    Yue, Yutao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [9] PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud Videos
    Shen, Zhiqiang
    Sheng, Xiaoxiao
    Wang, Longguang
    Guo, Yulan
    Liu, Qiong
    Zhou, Xi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1212 - 1222
  • [10] Self-Supervised Boundary Point Prediction Task for Point Cloud Domain Adaptation
    Chen, Jintao
    Zhang, Yan
    Huang, Kun
    Ma, Feifan
    Tan, Zhuangbin
    Xu, Zheyu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5878 - 5885