MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION

被引：0

作者：

Sun, Yajie ^{[1
]}

Zia, Ali ^{[2
,3
]}

Zhou, Jun

机构：

[1] Griffith Univ, Sch Informat & Commun Technol, Brisbane, Qld, Australia

[2] CSIRO Agr & Food, Northam, WA, Australia

[3] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

Point cloud classification; multi-scale features; geometric features; multi-scale transformer; 3D computer vision;

D O I：

10.1109/ICIP49359.2023.10223135

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Extracting and aggregating multiple feature representations from various scales have become the key to point cloud classification tasks. Vision Transformer (ViT) is a representative solution along this line, but it lacks the capability to model detailed multi-scale features and their interactions. In addition, learning efficient and effective representation from the point cloud is challenging due to its irregular, unordered, and sparse nature. To tackle these problems, we propose a novel multi-scale representation learning transformer framework, employing various geometric features beyond common Cartesian coordinates. Our approach enriches the description of point clouds by local geometric relationships and group them at multiple scales. This scale information is aggregated and then new patches can be extracted to minimize feature overlay. The bottleneck projection head is then adopted to enhance the information and feed all patches to the multi-head attention to capture the deep dependencies among representations across patches. Evaluation on public benchmark datasets shows the competitive performance of our framework on point cloud classification.

引用

下载

页码：3354 / 3358

页数：5

共 50 条

[31] DGC-TnT: Enhancing Point Cloud Object Classification by Dynamic Graph Convolutions With Transformer in Transformer
Lin, Chien-Chou
Chen, Po-Yu
IEEE ACCESS, 2024, 12 : 111924 - 111931
[32] Full Transformer Framework for Robust Point Cloud Registration With Deep Information Interaction
Chen, Guangyan
Wang, Meiling
Zhang, Qingxiang
Yuan, Li
Yue, Yufeng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (10) : 1 - 15
[33] Learning Robust Graph-Convolutional Representations for Point Cloud Denoising
Pistilli, Francesca
Fracastoro, Giulia
Valsesia, Diego
Magli, Enrico
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 402 - 414
[34] PU-Transformer: Point Cloud Upsampling Transformer
Qiu, Shi
Anwar, Saeed
Barnes, Nick
COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 326 - 343
[35] Multiscale Point Cloud Geometry Compression
Wang, Jianqiang
Ding, Dandan
Li, Zhu
Ma, Zhan
2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 73 - 82
[36] Point-voxel dual stream transformer for 3d point cloud learning
Zhao, Tianmeng
Zeng, Hui
Zhang, Baoqing
Fan, Bin
Li, Chen
VISUAL COMPUTER, 2024, 40 (08): : 5323 - 5339
[37] A Multiscale and Hierarchical Feature Extraction Method for Terrestrial Laser Scanning Point Cloud Classification
Wang, Zhen
Zhang, Liqiang
Fang, Tian
Mathiopoulos, P. Takis
Tong, Xiaohua
Qu, Huamin
Xiao, Zhiqiang
Li, Fang
Chen, Dong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (05): : 2409 - 2425
[38] An Efficient and General Framework for Aerial Point Cloud Classification in Urban Scenarios
Ozdemir, Emre
Remondino, Fabio
Golkar, Alessandro
REMOTE SENSING, 2021, 13 (10)
[39] Local region-learning modules for point cloud classification
Turgut, Kaya
Dutagaci, Helin
MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
[40] PointHop: An Explainable Machine Learning Method for Point Cloud Classification
Zhang, Min
You, Haoxuan
Kadam, Pranav
Liu, Shan
Kuo, C-C Jay
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1744 - 1755

← 1 2 3 4 5 →