Fully Sparse Fusion for 3D Object Detection

被引:1
|
作者
Li Y. [1 ]
Fan L. [1 ]
Liu Y. [1 ]
Huang Z. [2 ]
Chen Y. [3 ]
Wang N. [2 ]
Zhang Z. [1 ]
机构
[1] Center for Research on Intelligent Perception and Computing (CRIPAC), State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing
[2] TuSimple, Beijing
[3] Centre for Artificial Intelligence and Robotics, Hong Kong Institute of Science and Innovation, Chinese Academy of Sciences (HKISI CAS), Hong Kong
关键词
3D object detection; autonomous driving; Cameras; Detectors; Feature extraction; fully sparse architecture; Instance segmentation; Laser radar; long-range perception; multi-sensor fusion; Point cloud compression; Three-dimensional displays;
D O I
10.1109/TPAMI.2024.3392303
中图分类号
学科分类号
摘要
Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually use dense Bird&#x0027;s-Eye-View (BEV) feature maps. However, the cost of such BEV feature maps is quadratic to the detection range, making it not scalable for long-range detection. Recently, LiDAR-only fully sparse architecture has been gaining attention for its high efficiency in long-range perception. In this paper, we study how to develop a multi-modal fully sparse detector. Specifically, our proposed detector integrates the well-studied 2D instance segmentation into the LiDAR side, which is parallel to the 3D instance segmentation part in the LiDAR-only baseline. The proposed instance-based fusion framework maintains full sparsity while overcoming the constraints associated with the LiDAR-only fully sparse detector. Our framework showcases state-of-the-art performance on the widely used nuScenes dataset, Waymo Open Dataset, and the long-range Argoverse 2 dataset. Notably, the inference speed of our proposed method under the long-range perception setting is 2.7&#x00D7; faster than that of other state-of-the-art multimodal 3D detection methods. Code is released at <uri>https://github.com/BraveGroup/FullySparseFusion</uri>. IEEE
引用
收藏
页码:1 / 15
页数:14
相关论文
共 50 条
  • [1] Fully Sparse 3D Object Detection
    Fan, Lue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Sparse Dense Fusion for 3D Object Detection
    Gao, Yulu
    Sima, Chonghao
    Shi, Shaoshuai
    Di, Shangzhe
    Liu, Si
    Li, Hongyang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10939 - 10946
  • [3] VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
    Chen, Yukang
    Liu, Jianhui
    Zhang, Xiangyu
    Qi, Xiaojuan
    Jia, Jiaya
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21674 - 21683
  • [4] SRFDet3D: Sparse Region Fusion based 3D Object Detection
    Erabati G.K.
    Araujo H.
    Neurocomputing, 2024, 593
  • [5] VPSNet: 3D object detection with voxel purification and fully sparse convolutional networksVPSNet: 3D object detection with voxel purification and fully sparse...J. Wen et al.
    Jia Wen
    Qi Zhang
    Guanghao Zhang
    The Journal of Supercomputing, 81 (3)
  • [6] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [7] SparseFusion3D: Sparse Sensor Fusion for 3D Object Detection by Radar and Camera in Environmental Perception
    Yu, Zedong
    Wan, Weibing
    Ren, Maiyu
    Zheng, Xiuyuan
    Fang, Zhijun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1524 - 1536
  • [8] A multilevel fusion network for 3D object detection
    Xia, Chunlong
    Wei, Ping
    Wei, Wenwen
    Zheng, Nanning
    NEUROCOMPUTING, 2021, 437 : 107 - 117
  • [9] PointPainting: Sequential Fusion for 3D Object Detection
    Vora, Sourabh
    Lang, Alex H.
    Helou, Bassam
    Beijbom, Oscar
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4603 - 4611
  • [10] Dense Voxel Fusion for 3D Object Detection
    Mahmoud, Anas
    Hu, Jordan S. K.
    Waslander, Steven L.
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 663 - 672