DualMLP: a two-stream fusion model for 3D point cloud classification

被引:3
|
作者
Paul, Sneha [1 ]
Patterson, Zachary [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
来源
VISUAL COMPUTER | 2024年 / 40卷 / 08期
基金
英国科研创新办公室;
关键词
Point cloud classification; 3D computer vision; Supervised learning; NEURAL-NETWORKS;
D O I
10.1007/s00371-023-03114-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present DualMLP, a novel 3D model that introduces the idea of a two-stream network for existing 3D models to handle the trade-off between the number of points and the computational overhead. Existing works on point clouds use a small subset of points sampled from the entire 3D object as input. Although increasing the number of input points can enhance scene understanding, it also incurs a higher computational cost for existing networks. To tackle this challenge, we propose a novel architecture called DualMLP, which effectively mitigates the linear increase in computational expense as the number of input points grows. While we evaluate this concept on PointMLP and demonstrate its effectiveness, the idea can be applied to other existing models with minimal adjustments. DualMLP consists of two branches: DenseNet and SparseNet. The SparseNet, a relatively larger network, samples a small number of points from the complete point cloud, while the DenseNet, a lightweight network, takes in a larger number of points as input. Extensive experiments on the ScanObjectNN and ModelNet40 datasets demonstrate the effectiveness of the proposed model, achieving a 1.00% and 0.81% improvement over PointMLP for ScanObjectNN and ModelNet40 while being computationally efficient than the original PointMLP. To ensure the reproducibility of our experimental results, the code for this work is publicly available at https://github.com/snehaputul/DualMLP.
引用
收藏
页码:5435 / 5449
页数:15
相关论文
共 50 条
  • [31] Point-Sim: A Lightweight Network for 3D Point Cloud Classification
    Guo, Jiachen
    Luo, Wenjie
    ALGORITHMS, 2024, 17 (04)
  • [32] An Improved Two-stream 3D Convolutional Neural Network for Human Action Recognition
    Chen, Jun
    Xu, Yuanping
    Zhang, Chaolong
    Xu, Zhijie
    Meng, Xiangxiang
    Wang, Jie
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 135 - 140
  • [33] Improving human action recognition with two-stream 3D convolutional neural network
    Van-Minh Khong
    Thanh-Hai Tran
    2018 1ST INTERNATIONAL CONFERENCE ON MULTIMEDIA ANALYSIS AND PATTERN RECOGNITION (MAPR), 2018,
  • [34] 3D Human Pose Estimation Using Two-Stream Architecture with Joint Training
    Kang, Jian
    Fan, Wanshu
    Li, Yijing
    Liu, Rui
    Zhou, Dongsheng
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 137 (01): : 607 - 629
  • [35] Dynamic Gesture Recognition Combining Two-stream 3D Convolution with Attention Mechanisms
    Wang Fenhua
    Zhang Qiang
    Huang Chao
    Zhang Ran
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (05) : 1389 - 1396
  • [36] VirtualActionNet: A strong two-stream point cloud sequence network for human action recognition
    Li, Xing
    Huang, Qian
    Wang, Zhijian
    Yang, Tianjin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [37] GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition
    Liu, Jiaheng
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4711 - 4721
  • [38] Fuzzy Fusion for Two-stream Action Recognition
    Sousa e Santos, Anderson Carlos
    Maia, Helena de Almeida
    Roberto e Souza, Marcos
    Vieira, Marcelo Bernardes
    Pedrini, Helio
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 117 - 123
  • [39] Probabilistic 3D Point Cloud Fusion on Graphics Processors for Automotive
    Behmann, Nicolai
    Cheng, Yihan
    Schleusner, Jens
    Blume, Holger
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [40] Evaluating Two-Stream CNN for Video Classification
    Ye, Hao
    Wu, Zuxuan
    Zhao, Rui-Wei
    Wang, Xi
    Jiang, Yu-Gang
    Xue, Xiangyang
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 435 - 442