MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION

被引:0
|
作者
Sun, Yajie [1 ]
Zia, Ali [2 ,3 ]
Zhou, Jun
机构
[1] Griffith Univ, Sch Informat & Commun Technol, Brisbane, Qld, Australia
[2] CSIRO Agr & Food, Northam, WA, Australia
[3] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
关键词
Point cloud classification; multi-scale features; geometric features; multi-scale transformer; 3D computer vision;
D O I
10.1109/ICIP49359.2023.10223135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting and aggregating multiple feature representations from various scales have become the key to point cloud classification tasks. Vision Transformer (ViT) is a representative solution along this line, but it lacks the capability to model detailed multi-scale features and their interactions. In addition, learning efficient and effective representation from the point cloud is challenging due to its irregular, unordered, and sparse nature. To tackle these problems, we propose a novel multi-scale representation learning transformer framework, employing various geometric features beyond common Cartesian coordinates. Our approach enriches the description of point clouds by local geometric relationships and group them at multiple scales. This scale information is aggregated and then new patches can be extracted to minimize feature overlay. The bottleneck projection head is then adopted to enhance the information and feed all patches to the multi-head attention to capture the deep dependencies among representations across patches. Evaluation on public benchmark datasets shows the competitive performance of our framework on point cloud classification.
引用
下载
收藏
页码:3354 / 3358
页数:5
相关论文
共 50 条
  • [31] DGC-TnT: Enhancing Point Cloud Object Classification by Dynamic Graph Convolutions With Transformer in Transformer
    Lin, Chien-Chou
    Chen, Po-Yu
    IEEE ACCESS, 2024, 12 : 111924 - 111931
  • [32] Full Transformer Framework for Robust Point Cloud Registration With Deep Information Interaction
    Chen, Guangyan
    Wang, Meiling
    Zhang, Qingxiang
    Yuan, Li
    Yue, Yufeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (10) : 1 - 15
  • [33] Learning Robust Graph-Convolutional Representations for Point Cloud Denoising
    Pistilli, Francesca
    Fracastoro, Giulia
    Valsesia, Diego
    Magli, Enrico
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 402 - 414
  • [34] PU-Transformer: Point Cloud Upsampling Transformer
    Qiu, Shi
    Anwar, Saeed
    Barnes, Nick
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 326 - 343
  • [35] Multiscale Point Cloud Geometry Compression
    Wang, Jianqiang
    Ding, Dandan
    Li, Zhu
    Ma, Zhan
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 73 - 82
  • [36] Point-voxel dual stream transformer for 3d point cloud learning
    Zhao, Tianmeng
    Zeng, Hui
    Zhang, Baoqing
    Fan, Bin
    Li, Chen
    VISUAL COMPUTER, 2024, 40 (08): : 5323 - 5339
  • [37] A Multiscale and Hierarchical Feature Extraction Method for Terrestrial Laser Scanning Point Cloud Classification
    Wang, Zhen
    Zhang, Liqiang
    Fang, Tian
    Mathiopoulos, P. Takis
    Tong, Xiaohua
    Qu, Huamin
    Xiao, Zhiqiang
    Li, Fang
    Chen, Dong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (05): : 2409 - 2425
  • [38] An Efficient and General Framework for Aerial Point Cloud Classification in Urban Scenarios
    Ozdemir, Emre
    Remondino, Fabio
    Golkar, Alessandro
    REMOTE SENSING, 2021, 13 (10)
  • [39] Local region-learning modules for point cloud classification
    Turgut, Kaya
    Dutagaci, Helin
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [40] PointHop: An Explainable Machine Learning Method for Point Cloud Classification
    Zhang, Min
    You, Haoxuan
    Kadam, Pranav
    Liu, Shan
    Kuo, C-C Jay
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1744 - 1755