Point Clouds are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

被引:0
|
作者
Kang, Jiachen [1 ]
Jia, Wenjing [1 ]
He, Xiangjian [2 ]
Lam, Kin Man [3 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[2] Univ Nottingham Ningbo, Sch Comp Sci, Ningbo 315100, Peoples R China
[3] Hong Kong Polytech Univ, Dept Elect & Elect Engn, Kowloon, Hong Kong, Peoples R China
关键词
Point cloud compression; Three-dimensional displays; Transformers; Task analysis; Data models; Image coding; Knowledge transfer; Cross-modal learning; point cloud understanding; self-supervision; transfer learning;
D O I
10.1109/TMM.2024.3412330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs. This paper presents PCExpert, a novel SSRL approach that reinterprets point clouds as "specialized images". This conceptual shift allows PCExpert to leverage knowledge derived from large-scale image modality in a more direct and deeper manner, via extensively sharing the parameters with a pre-trained image encoder in a multi-way Transformer architecture. The parameter sharing strategy, combined with an additional pretext task for pre-training, i.e., transformation estimation, empowers PCExpert to outperform the state of the arts in a variety of tasks, with a remarkable reduction in the number of trainable parameters. Notably, PCExpert's performance under LINEAR fine-tuning (e.g., yielding a 90.02% overall accuracy on ScanObjectNN) has already closely approximated the results obtained with FULL model fine-tuning (92.66%), demonstrating its effective representation capability.
引用
收藏
页码:10755 / 10765
页数:11
相关论文
共 50 条
  • [21] A 3D white referencing method for soybean leaves based on fusion of hyperspectral images and 3D point clouds
    Zhang, Libo
    Jin, Jian
    Wang, Liangju
    Huang, Peikui
    Ma, Dongdong
    PRECISION AGRICULTURE, 2020, 21 (06) : 1173 - 1186
  • [22] Fusion Method of Infrared Images and 3D Point Clouds Based on Cross Markers
    Yelong, Zheng
    Changyong, Li
    Ningning, Xia
    Lingyi, Li
    Guomin, Zhang
    Meirong, Zhao
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2024, 57 (10): : 1090 - 1099
  • [23] A Novel Interactive Fusion Method with Images and Point Clouds for 3D Object Detection
    Xu, Kai
    Yang, Zhile
    Xu, Yangjie
    Feng, Liangbing
    APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [24] Face Recognition on 3D Point Clouds
    Zhang, Ziyu
    Da, Feipeng
    Wang, Chenxing
    Yu, Jian
    Yu, Yi
    SEVENTH INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING (ICOPEN 2019), 2019, 11205
  • [25] 3D Object Detection Using Frustums and Attention Modules for Images and Point Clouds
    Li, Yiran
    Xie, Han
    Shin, Hyunchul
    SIGNALS, 2021, 2 (01): : 98 - 107
  • [26] Cultural relic 3D reconstruction from digital images and laser point clouds
    Liu Jie
    Zhang Jianqing
    Xu Jia
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS, 2008, : 349 - 353
  • [27] Combining spaceborne SAR images with 3D point clouds for infrastructure monitoring applications
    Anghel, Andrei
    Vasile, Gabriel
    Boudon, Remy
    d'Urso, Guy
    Girard, Alexandre
    Boldo, Didier
    Bost, Veronique
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 111 : 45 - 61
  • [28] On the Segmentation of 3D LIDAR Point Clouds
    Douillard, B.
    Underwood, J.
    Kuntz, N.
    Vlaskine, V.
    Quadros, A.
    Morton, P.
    Frenkel, A.
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011,
  • [29] Meshfree thinning of 3D point clouds
    Dyn, Nira
    Iske, Armin
    Wendland, Holger
    FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2008, 8 (04) : 409 - 425
  • [30] Generating 3D Adversarial Point Clouds
    Xiang, Chong
    Qi, Charles R.
    Li, Bo
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9128 - 9136