PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning

被引:0
|
作者
Li, Kang [1 ]
Zhu, Qiuquan [1 ]
Wang, Haoyu [1 ]
Wang, Shibo [1 ]
Tian, He [1 ]
Zhou, Ping [2 ]
Cao, Xin [1 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
[2] Key Sci Res Base Ancient Polychrome Pottery Conser, Emperor Qin Shihuangs Mausoleum Site Museum, Xian 710600, Peoples R China
基金
中国国家自然科学基金;
关键词
self-supervised learning; point cloud reconstruction; representation learning; variable masked autoencoder; contrastive learning; NETWORKS;
D O I
10.3390/rs16163045
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Self-supervised learning has made significant progress in point cloud processing. Currently, the primary tasks of self-supervised learning, which include point cloud reconstruction and representation learning, are trained separately due to their structural differences. This separation inevitably leads to increased training costs and neglects the potential for mutual assistance between tasks. In this paper, a self-supervised method named PointUR-RL is introduced, which integrates point cloud reconstruction and representation learning. The method features two key components: a variable masked autoencoder (VMAE) and contrastive learning (CL). The VMAE is capable of processing input point cloud blocks with varying masking ratios, ensuring seamless adaptation to both tasks. Furthermore, CL is utilized to enhance the representation learning capabilities and improve the separability of the learned representations. Experimental results confirm the effectiveness of the method in training and its strong generalization ability for downstream tasks. Notably, high-accuracy classification and high-quality reconstruction have been achieved with the public datasets ModelNet and ShapeNet, with competitive results also obtained with the ScanObjectNN real-world dataset.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Point cloud self-supervised learning for machining feature recognition
    Zhang, Hang
    Wang, Wenhu
    Zhang, Shusheng
    Wang, Zhen
    Zhang, Yajun
    Zhou, Jingtao
    Huang, Bo
    JOURNAL OF MANUFACTURING SYSTEMS, 2024, 77 : 78 - 95
  • [32] Spatial and temporal features unified self-supervised representation learning networks
    Choudhary, Rahul
    Walambe, Rahee
    Kotecha, Ketan
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 157
  • [33] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
    Wang, Rui
    Chen, Dongdong
    Wu, Zuxuan
    Chen, Yinpeng
    Dai, Xiyang
    Liu, Mengchen
    Yuan, Lu
    Jiang, Yu-Gang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6312 - 6322
  • [34] DISENTANGLED SPEECH REPRESENTATION LEARNING BASED ON FACTORIZED HIERARCHICAL VARIATIONAL AUTOENCODER WITH SELF-SUPERVISED OBJECTIVE
    Xie, Yuying
    Arildsen, Thomas
    Tan, Zheng-Hua
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [35] Towards Latent Masked Image Modeling for Self-supervised Visual Representation Learning
    Wei, Yibing
    Gupta, Abhinav
    Morgado, Pedro
    COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 1 - 17
  • [36] Cross-View Masked Model for Self-Supervised Graph Representation Learning
    Duan H.
    Yu B.
    Xie C.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 13
  • [37] Masked self-supervised ECG representation learning via multiview information bottleneck
    Yang, Shunxiang
    Lian, Cheng
    Zeng, Zhigang
    Xu, Bingrong
    Su, Yixin
    Xue, Chenyang
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (14): : 7625 - 7637
  • [38] HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
    Hsu, Wei-Ning
    Bolte, Benjamin
    Tsai, Yao-Hung Hubert
    Lakhotia, Kushal
    Salakhutdinov, Ruslan
    Mohamed, Abdelrahman
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3451 - 3460
  • [39] Masked self-supervised ECG representation learning via multiview information bottleneck
    Shunxiang Yang
    Cheng Lian
    Zhigang Zeng
    Bingrong Xu
    Yixin Su
    Chenyang Xue
    Neural Computing and Applications, 2024, 36 : 7625 - 7637
  • [40] Masked cosine similarity prediction for self-supervised skeleton-based action representation learning
    Ziliang Ren
    Ronggui Liu
    Yong Qin
    Xiangyang Gao
    Qieshi Zhang
    Pattern Analysis and Applications, 2025, 28 (2)