MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION

被引:0
|
作者
Sun, Yajie [1 ]
Zia, Ali [2 ,3 ]
Zhou, Jun
机构
[1] Griffith Univ, Sch Informat & Commun Technol, Brisbane, Qld, Australia
[2] CSIRO Agr & Food, Northam, WA, Australia
[3] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
关键词
Point cloud classification; multi-scale features; geometric features; multi-scale transformer; 3D computer vision;
D O I
10.1109/ICIP49359.2023.10223135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting and aggregating multiple feature representations from various scales have become the key to point cloud classification tasks. Vision Transformer (ViT) is a representative solution along this line, but it lacks the capability to model detailed multi-scale features and their interactions. In addition, learning efficient and effective representation from the point cloud is challenging due to its irregular, unordered, and sparse nature. To tackle these problems, we propose a novel multi-scale representation learning transformer framework, employing various geometric features beyond common Cartesian coordinates. Our approach enriches the description of point clouds by local geometric relationships and group them at multiple scales. This scale information is aggregated and then new patches can be extracted to minimize feature overlay. The bottleneck projection head is then adopted to enhance the information and feed all patches to the multi-head attention to capture the deep dependencies among representations across patches. Evaluation on public benchmark datasets shows the competitive performance of our framework on point cloud classification.
引用
下载
收藏
页码:3354 / 3358
页数:5
相关论文
共 50 条
  • [21] 6-D Object Pose Estimation Using Multiscale Point Cloud Transformer
    Zhou, Guangliang
    Wang, Deming
    Yan, Yi
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [22] Learning isometry-invariant representations for point cloud analysis
    Sun, Xiao
    Huang, Yang
    Lian, Zhouhui
    PATTERN RECOGNITION, 2023, 134
  • [23] Learning Continuous Object Representations from Point Cloud Data
    Nelson, Henry J.
    Papanikolopoulos, Nikolaos
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2446 - 2451
  • [24] Learning point cloud context information based on 3D transformer for more accurate and efficient classification
    Chen, Yiping
    Zhang, Shuai
    Lin, Weisheng
    Zhang, Shuhang
    Zhang, Wuming
    PHOTOGRAMMETRIC RECORD, 2023, 38 (184): : 603 - 616
  • [25] PCT: Point cloud transformer
    Meng-Hao Guo
    Jun-Xiong Cai
    Zheng-Ning Liu
    Tai-Jiang Mu
    Ralph R.Martin
    Shi-Min Hu
    Computational Visual Media, 2021, 7 (02) : 187 - 199
  • [26] PCT: Point cloud transformer
    Guo, Meng-Hao
    Cai, Jun-Xiong
    Liu, Zheng-Ning
    Mu, Tai-Jiang
    Martin, Ralph R.
    Hu, Shi-Min
    COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) : 187 - 199
  • [27] PCT: Point cloud transformer
    Meng-Hao Guo
    Jun-Xiong Cai
    Zheng-Ning Liu
    Tai-Jiang Mu
    Ralph R. Martin
    Shi-Min Hu
    Computational Visual Media, 2021, 7 : 187 - 199
  • [28] Classification of Airborne LiDAR Point Cloud Data Based on Multiscale Adaptive Features
    Yang Shujuan
    Zhang Keshu
    Shao Yongshe
    ACTA OPTICA SINICA, 2019, 39 (02)
  • [29] PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
    Li, Ruihui
    Li, Xianzhi
    Heng, Pheng-Ann
    Fu, Chi-Wing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6377 - 6386
  • [30] Transformer encoder with multiscale deep learning for pain classification using physiological signals
    Lu, Zhenyuan
    Ozek, Burcu
    Kamarthi, Sagar
    FRONTIERS IN PHYSIOLOGY, 2023, 14