T-MAE : Temporal Masked Autoencoders for Point Cloud Representation Learning

被引:0
|
作者
Wei, Weijie [1 ]
Nejadasl, Fatemeh Karimi [1 ]
Gevers, Theo [1 ]
Oswald, Martin R. [1 ]
机构
[1] Univ Amsterdam, Amsterdam, Netherlands
来源
关键词
Self-supervised learning; LiDAR point cloud; 3D detection; NETWORKS;
D O I
10.1007/978-3-031-73247-8_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The scarcity of annotated data in LiDAR point cloud understanding hinders effective representation learning. Consequently, scholars have been actively investigating efficacious self-supervised pre-training paradigms. Nevertheless, temporal information, which is inherent in the LiDAR point cloud sequence, is consistently disregarded. To better utilize this property, we propose an effective pre-training strategy, namely Temporal Masked Auto-Encoders (T-MAE), which takes as input temporally adjacent frames and learns temporal dependency. A SiamWCA backbone, containing a Siamese encoder and a windowed cross-attention (WCA) module, is established for the two-frame input. Considering that the movement of an ego-vehicle alters the view of the same instance, temporal modeling also serves as a robust and natural data augmentation, enhancing the comprehension of target objects. SiamWCA is a powerful architecture but heavily relies on annotated data. Our T-MAE pre-training strategy alleviates its demand for annotated data. Comprehensive experiments demonstrate that T-MAE achieves the best performance on both Waymo and ONCE datasets among competitive self-supervised approaches.
引用
收藏
页码:178 / 195
页数:18
相关论文
共 50 条
  • [21] Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
    Zhang, Renrui
    Guo, Ziyu
    Fang, Rongyao
    Zhao, Bin
    Wang, Dong
    Qiao, Yu
    Li, Hongsheng
    Gao, Peng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [22] Contrastive Predictive Autoencoders for Dynamic Point Cloud Self-Supervised Learning
    Sheng, Xiaoxiao
    Shen, Zhiqiang
    Xiao, Gang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9802 - 9810
  • [23] Masked Motion Prediction with Semantic Contrast for Point Cloud Sequence Learning
    Han, Yuehui
    Xu, Can
    Xu, Rui
    Qian, Jianjun
    Xie, Jin
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 414 - 431
  • [24] E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning
    Lin, Xiuhong
    Qiu, Changjie
    Cai, Zhipeng
    Shen, Siqi
    Zang, Yu
    Liu, Weiquan
    Bian, Xuesheng
    Mueller, Matthias
    Wang, Cheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning
    Li, Kang
    Zhu, Qiuquan
    Wang, Haoyu
    Wang, Shibo
    Tian, He
    Zhou, Ping
    Cao, Xin
    REMOTE SENSING, 2024, 16 (16)
  • [26] Efficient point cloud representation learning with a recurrent hierarchical framework
    Wang, Ziming
    Zhang, Boxiang
    Ma, Ming
    Wang, Yue
    Du, Taoli
    Li, Wenhui
    APPLIED SOFT COMPUTING, 2025, 171
  • [27] Learning to Measure the Point Cloud Reconstruction Loss in a Representation Space
    Huang, Tianxin
    Ding, Zhonggan
    Zhang, Jiangning
    Tai, Ying
    Zhang, Zhenyu
    Chen, Mingang
    Wang, Chengjie
    Liu, Yong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12208 - 12217
  • [28] Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering
    Mei, Guofeng
    Saltori, Cristiano
    Ricci, Elisa
    Sebe, Nicu
    Wu, Qiang
    Zhang, Jian
    Poiesi, Fabio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3251 - 3269
  • [29] Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction
    Fan, Songlin
    Gao, Wei
    Li, Ge
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [30] Scalability of Learning Tasks on 3D CAE Models Using Point Cloud Autoencoders
    Rios, Thiago
    Wollstadt, Patricia
    van Stein, Bas
    Baeck, Thomas
    Xu, Zhao
    Sendhoff, Bernhard
    Menzel, Stefan
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1367 - 1374