PointAcc: Efficient Point Cloud Accelerator

被引:40
|
作者
Lin, Yujun [1 ]
Zhang, Zhekai [1 ]
Tang, Haotian [1 ]
Wang, Hanrui [1 ]
Han, Song [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
point cloud; neural network accelerator; sparse convolution;
D O I
10.1145/3466752.3480084
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning on point clouds plays a vital role in a wide range of applications such as autonomous driving and AR/VR. These applications interact with people in real time on edge devices and thus require low latency and low energy. Compared to projecting the point cloud to 2D space, directly processing 3D point cloud yields higher accuracy and lower #MACs. However, the extremely sparse nature of point cloud poses challenges to hardware acceleration. For example, we need to explicitly determine the nonzero outputs and search for the nonzero neighbors (mapping operation), which is unsupported in existing accelerators. Furthermore, explicit gather and scatter of sparse features are required, resulting in large data movement overhead. In this paper, we comprehensively analyze the performance bottleneck of modern point cloud networks on CPU/GPU/TPU. To address the challenges, we then present PointAcc, a novel point cloud deep learning accelerator. PointAcc maps diverse mapping operations onto one versatile ranking-based kernel, streams the sparse computation with configurable caching, and temporally fuses consecutive dense layers to reduce the memory footprint. Evaluated on 8 point cloud models across 4 applications, PointAcc achieves 3.7x speedup and 22x energy savings over RTX 2080Ti GPU. Co-designed with light-weight neural networks, PointAcc rivals the prior accelerator Mesorasi by 100x speedup with 9.1% higher accuracy running segmentation on the S3DIS dataset. PointAcc paves the way for efficient point cloud recognition.
引用
收藏
页码:449 / 461
页数:13
相关论文
共 50 条
  • [21] Uniaxial Partitioning Strategy for Efficient Point Cloud Registration
    Souza Neto, Polycarpo
    Soares, Jose Marques
    Pereira The, George Andre
    SENSORS, 2022, 22 (08)
  • [22] An Efficient Hypergraph Approach to Robust Point Cloud Resampling
    Deng, Qinwen
    Zhang, Songyang
    Ding, Zhi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1924 - 1937
  • [23] TorchSparse plus plus : Efficient Point Cloud Engine
    Tang, Haotian
    Yang, Shang
    Liu, Zhijian
    Hong, Ke
    Yu, Zhongming
    Li, Xiuyu
    Dai, Guohao
    Wang, Yu
    Han, Song
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 202 - 209
  • [24] Low-Power LiDAR Signal Processor with Point-of-Cloud Transformation Accelerator
    Park, Seunghyun
    Park, Daejin
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 57 - 58
  • [25] Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis
    Xu, Xiuwei
    Wang, Ziwei
    Zhou, Jie
    Lu, Jiwen
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5313 - 5322
  • [26] FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
    Liu, Zhijian
    Yang, Xinyu
    Tang, Haotian
    Yang, Shang
    Han, Song
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1200 - 1211
  • [27] EFFICIENT CNC TOOL PATH PLANNING USING POINT CLOUD
    Ghogare, Sumedh
    Pande, S. S.
    PROCEEDINGS OF THE ASME 13TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, 2018, VOL 4, 2018,
  • [28] EFFICIENT CNC MACHINING OF FREEFORM SURFACES FROM POINT CLOUD
    Dhanda, Mandeep
    Pande, S. S.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2018, VOL 1A, 2018,
  • [29] Efficient point cloud representation learning with a recurrent hierarchical framework
    Wang, Ziming
    Zhang, Boxiang
    Ma, Ming
    Wang, Yue
    Du, Taoli
    Li, Wenhui
    APPLIED SOFT COMPUTING, 2025, 171
  • [30] Point Cloud Compression for Efficient Data Broadcasting: A Performance Comparison
    Nardo, Francesco
    Peressoni, Davide
    Testolina, Paolo
    Giordani, Marco
    Zanella, Andrea
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2732 - 2737