FMTT : Fused Multi-head Transformer with Tensor-compression for 3D Point Clouds Detection on Edge Devices

被引:0
|
作者
Wei, Zikun [1 ]
Wang, Tingting [1 ]
Ding, Chenchen [1 ]
Wang, Bohan [1 ]
Guan, Ziyi [1 ]
Huang, Hantao [1 ]
Yu, Hao [1 ]
机构
[1] Southern Univ Sci & Technol, Sch Microelect, Shenzhen, Peoples R China
关键词
Deep Learning; 3D Object Detection; Tensor Compression;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The real-time detection of 3D objects represents a grand challenge on edge devices. Existing 3D point clouds models are over-parameterized with heavy computation load. This paper proposes a highly compact model for 3D point clouds detection using tensor-compression. Compared to conventional methods, we propose a fused multi-head transformer tensor-compression (FMTT) to achieve both compact size yet with high accuracy. The FMTT leverages different ranks to extract both high and low-level features and then fuses them together to improve the accuracy. Experiments on the KITTI dataset show that the proposed FMTT can achieve 6.04x smaller than the uncompressed model from 55.09MB to 9.12MB such that the compressed model can be implemented on edge devices. It also achieves 2.62% improved accuracy in easy mode and 0.28% improved accuracy in hard mode.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Edge detection for 3D point clouds via locally max-angular gaps descriptor
    Ma, Feifan
    Zhang, Yan
    Chen, Jintao
    Qu, Chengzhi
    Huang, Kun
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (02)
  • [42] BoundED: Neural boundary and edge detection in 3D point clouds via local neighborhood statistics
    Bode, Lukas
    Weinmann, Michael
    Klein, Reinhard
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 205 : 334 - 351
  • [43] TriplClust: An Algorithm for Curve Detection in 3D Point Clouds
    Dalitz, Christoph
    Wilberg, Jens
    Aymans, Lukas
    IMAGE PROCESSING ON LINE, 2019, 9 : 26 - 46
  • [44] Road Junction Detection from 3D Point Clouds
    Habermann, Danilo
    Vido, Carlos E. O.
    Osorio, Fernando S.
    Ramos, Fabio
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4934 - 4940
  • [45] Speeding up 3D Printing Using Multi-head Slicing Algorithms
    Wang, Yuexuan
    Gu, Zhaoquan
    Song, Lei
    Li, Tongyang
    Cui, Heming
    Lau, Francis C. M.
    2017 5TH INTERNATIONAL CONFERENCE ON ENTERPRISE SYSTEMS (ES), 2017, : 99 - 106
  • [46] 3D Multi-perspective Depth Detection Using Point Clouds and Machine Learning
    Esteves, Andrew
    Bickford, Harry
    Yang, Jaesung
    Shen, Xin
    Sohn, Kiwon
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2024, 2024, 13041
  • [47] Energy-Based Multi-plane Detection from 3D Point Clouds
    Wang, Liang
    Shen, Chao
    Duan, Fuqing
    Guo, Ping
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 715 - 722
  • [48] Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection From Point Clouds
    Yin, Junbo
    Shen, Jianbing
    Gao, Xin
    Crandall, David J.
    Yang, Ruigang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9822 - 9835
  • [49] CasFormer: Cascaded Transformer Based on Dynamic Voxel Pyramid for 3D Object Detection from Point Clouds
    Li, Xinglong
    Zhang, Xiaowei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 299 - 311
  • [50] Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds
    He, Chenhang
    Li, Ruihuang
    Li, Shuai
    Zhang, Lei
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8407 - 8417