DualMLP: a two-stream fusion model for 3D point cloud classification

被引：3

作者：

Paul, Sneha ^{[1
]}

Patterson, Zachary ^{[1
]}

Bouguila, Nizar ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 08期

基金：

英国科研创新办公室;

关键词：

Point cloud classification; 3D computer vision; Supervised learning; NEURAL-NETWORKS;

D O I：

10.1007/s00371-023-03114-3

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we present DualMLP, a novel 3D model that introduces the idea of a two-stream network for existing 3D models to handle the trade-off between the number of points and the computational overhead. Existing works on point clouds use a small subset of points sampled from the entire 3D object as input. Although increasing the number of input points can enhance scene understanding, it also incurs a higher computational cost for existing networks. To tackle this challenge, we propose a novel architecture called DualMLP, which effectively mitigates the linear increase in computational expense as the number of input points grows. While we evaluate this concept on PointMLP and demonstrate its effectiveness, the idea can be applied to other existing models with minimal adjustments. DualMLP consists of two branches: DenseNet and SparseNet. The SparseNet, a relatively larger network, samples a small number of points from the complete point cloud, while the DenseNet, a lightweight network, takes in a larger number of points as input. Extensive experiments on the ScanObjectNN and ModelNet40 datasets demonstrate the effectiveness of the proposed model, achieving a 1.00% and 0.81% improvement over PointMLP for ScanObjectNN and ModelNet40 while being computationally efficient than the original PointMLP. To ensure the reproducibility of our experimental results, the code for this work is publicly available at https://github.com/snehaputul/DualMLP.

引用

页码：5435 / 5449

页数：15

共 50 条

[31] Point-Sim: A Lightweight Network for 3D Point Cloud Classification
Guo, Jiachen
Luo, Wenjie
ALGORITHMS, 2024, 17 (04)
[32] An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition
Chen, Jun
Xu, Yuanping
Zhang, Chaolong
Xu, Zhijie
Meng, Xiangxiang
Wang, Jie
2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 135 - 140
[33] Improving human action recognition with two-stream 3D convolutional neural network
Van-Minh Khong
Thanh-Hai Tran
2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
[34] 3D Human Pose Estimation Using Two-Stream Architecture with Joint Training
Kang, Jian
Fan, Wanshu
Li, Yijing
Liu, Rui
Zhou, Dongsheng
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 137 (01): : 607 - 629
[35] Dynamic Gesture Recognition Combining Two-stream 3D Convolution with Attention Mechanisms
Wang Fenhua
Zhang Qiang
Huang Chao
Zhang Ran
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (05) : 1389 - 1396
[36] VirtualActionNet: A strong two-stream point cloud sequence network for human action recognition
Li, Xing
Huang, Qian
Wang, Zhijian
Yang, Tianjin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
[37] GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition
Liu, Jiaheng
Xu, Dong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4711 - 4721
[38] Fuzzy Fusion for Two-stream Action Recognition
Sousa e Santos, Anderson Carlos
Maia, Helena de Almeida
Roberto e Souza, Marcos
Vieira, Marcelo Bernardes
Pedrini, Helio
PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 117 - 123
[39] Probabilistic 3D Point Cloud Fusion on Graphics Processors for Automotive
Behmann, Nicolai
Cheng, Yihan
Schleusner, Jens
Blume, Holger
2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
[40] Evaluating Two-Stream CNN for Video Classification
Ye, Hao
Wu, Zuxuan
Zhao, Rui-Wei
Wang, Xi
Jiang, Yu-Gang
Xue, Xiangyang
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 435 - 442

← 1 2 3 4 5 →