MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION

被引:0
|
作者
Sun, Yajie [1 ]
Zia, Ali [2 ,3 ]
Zhou, Jun
机构
[1] Griffith Univ, Sch Informat & Commun Technol, Brisbane, Qld, Australia
[2] CSIRO Agr & Food, Northam, WA, Australia
[3] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
关键词
Point cloud classification; multi-scale features; geometric features; multi-scale transformer; 3D computer vision;
D O I
10.1109/ICIP49359.2023.10223135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting and aggregating multiple feature representations from various scales have become the key to point cloud classification tasks. Vision Transformer (ViT) is a representative solution along this line, but it lacks the capability to model detailed multi-scale features and their interactions. In addition, learning efficient and effective representation from the point cloud is challenging due to its irregular, unordered, and sparse nature. To tackle these problems, we propose a novel multi-scale representation learning transformer framework, employing various geometric features beyond common Cartesian coordinates. Our approach enriches the description of point clouds by local geometric relationships and group them at multiple scales. This scale information is aggregated and then new patches can be extracted to minimize feature overlay. The bottleneck projection head is then adopted to enhance the information and feed all patches to the multi-head attention to capture the deep dependencies among representations across patches. Evaluation on public benchmark datasets shows the competitive performance of our framework on point cloud classification.
引用
下载
收藏
页码:3354 / 3358
页数:5
相关论文
共 50 条
  • [1] Point cloud classification based on transformer
    Wu, Xianfeng
    Liu, Xinyi
    Wang, Junfei
    Lai, Zhongyuan
    Zhou, Jing
    Liu, Xia
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [2] MPCT: Multiscale Point Cloud Transformer With a Residual Network
    Wu, Yue
    Liu, Jiaming
    Gong, Maoguo
    Liu, Zhixiao
    Miao, Qiguang
    Ma, Wenping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3505 - 3516
  • [3] BubblEX: An Explainable Deep Learning Framework for Point-Cloud Classification
    Matrone, Francesca
    Paolanti, Marina
    Felicetti, Andrea
    Martini, Massimo
    Pierdicca, Roberto
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 6571 - 6587
  • [4] SparseFormer: Sparse transformer network for point cloud classification
    Wang, Yong
    Liu, Yangyang
    Zhou, Pengbo
    Geng, Guohua
    Zhang, Qi
    COMPUTERS & GRAPHICS-UK, 2023, 116 : 24 - 32
  • [5] SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation
    Sun, Xiao
    Lian, Zhouhui
    Xiao, Jianguo
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 980 - 988
  • [6] PVT: Point-voxel transformer for point cloud learning
    Zhang, Cheng
    Wan, Haocheng
    Shen, Xinyi
    Wu, Zizhao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 11985 - 12008
  • [7] Multiscale geometric window transformer for orthodontic teeth point cloud registration
    Wang, Hao
    Tian, Yan
    Xu, Yongchuan
    Xu, Jiahui
    Yang, Tao
    Lu, Yan
    Chen, Hong
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [8] Deep Closest Point: Learning Representations for Point Cloud Registration
    Wang, Yue
    Solomon, Justin M.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3522 - 3531
  • [9] ChainFrame: A Chain Framework for Point Cloud Classification
    Wang, Tianlei
    Fu, Mingsheng
    Chen, Keyu
    Li, Fan
    Qu, Hong
    Luo, Ma
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (03) : 4451 - 4462
  • [10] RST: Rough Set Transformer for Point Cloud Learning
    Sun, Xinwei
    Zeng, Kai
    SENSORS, 2023, 23 (22)