VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels

被引:3
|
作者
Li, Bing [1 ,2 ,3 ]
Chen, Jie [4 ,5 ]
Li, Xinde [3 ,6 ,7 ]
Xu, Rui [2 ]
Li, Qian [2 ]
Cao, Yice [2 ]
Wu, Jun [2 ]
Qu, Lei [2 ]
Li, Yingsong [2 ]
Diniz, Paulo S. R. [8 ,9 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[3] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
[4] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Peoples R China
[5] China Elect Technol Grp Corp, Res Inst 38, Hefei 230088, Peoples R China
[6] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Nanjing 210096, Peoples R China
[7] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[8] Univ Fed Rio de Janeiro, Program Elect Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
[9] Univ Fed Rio de Janeiro, Dept Elect & Comp Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
基金
中国国家自然科学基金;
关键词
Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Convolution; Data mining; Computational efficiency; Single-stage; fine-grained; lightweight; multibranch cross-sparse convolution network; compact fine-grained self-attention augmented module;
D O I
10.1109/TITS.2024.3373227
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in point cloud feature extraction tasks and the imbalance between efficiency and accuracy in single-stage point cloud 3D object detection scenarios. We develop a lightweight multibranch cross-sparse convolution network (LMCCN) that is designed to preserve the feature granularity of the original point cloud while achieving enhanced extraction efficiency. Additionally, we introduce a compact fine-grained self-attention augmented bird's eye view (BEV) feature extraction module (CFSAM). This module aims to further refine BEV features, enabling the acquisition of both locally and globally enhanced features and thereby augmentingthe perceptual capabilities of the constructed model. Without bells and whistles, the proposed method attains excellent performance on many autonomous driving benchmarks, with detection accuracies of up to 81.67% on KITTI, 72.74% on ONCE, and 84.00% on nuScenes. Moreover, it reaches a peak detection speed of 46.08 FPS, effectively balancing accuracy with speed.
引用
收藏
页码:12034 / 12048
页数:15
相关论文
共 50 条
  • [21] GridNet-3D: A Novel Real-Time 3D Object Detection Algorithm Based on Point Cloud
    YUE Yuanchen
    CAI Yunfei
    WANG Dongsheng
    ChineseJournalofElectronics, 2021, 30 (05) : 931 - 939
  • [22] GridNet-3D: A Novel Real-Time 3D Object Detection Algorithm Based on Point Cloud
    Yue Yuanchen
    Cai Yunfei
    Wang Dongsheng
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (05) : 931 - 939
  • [23] Fine-Grained Semantic Information Preservation and Misclassification-Aware Loss for 3D Point Cloud
    Zou, Yanmei
    Lin, Xuefei
    Yu, Hongshan
    Yang, Zhengeng
    Akhtar, Naveed
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7947 - 7954
  • [24] 3D point filtering algorithm for 3d object detection based on stereo image processing
    Kim J.-M.
    Park J.-M.
    Lee J.-W.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (09): : 676 - 684
  • [25] 3D Object Detection Algorithm Based on Raw Point Clouds
    Zhang, Dongdong
    Guo, Jie
    Chen, Yang
    Computer Engineering and Applications, 2024, 59 (03) : 209 - 217
  • [26] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation
    Liu, Zechen
    Wu, Zizhang
    Toth, Roland
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4289 - 4298
  • [27] Stereo Point Cloud Refinement for 3D Object Detection
    Liu, Wangchao
    Wang, Teng
    Wang, Yang
    Zhang, Xiangyu
    Lou, Xin
    2021 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2021) & 2021 IEEE CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIMEASIA 2021), 2021, : 61 - 64
  • [28] DST3D: DLA-Swin Transformer for Single-Stage Monocular 3D Object Detection
    Wu, Zhihong
    Jiang, Xin
    Xu, Ruidong
    Lu, Ke
    Zhu, Yuan
    Wu, Mingzhi
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 411 - 418
  • [29] 3D object detection in voxelized point cloud scene
    Li Rui-long
    Wu Chuan
    Zhu Ming
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (10) : 1355 - 1363
  • [30] Multi-feature enhancement based on sparse networks for single-stage 3D object detection
    Ke, Zunwang
    Lin, Chenyu
    Zhang, Tao
    Jia, Tingting
    Du, Minghua
    Wang, Gang
    Zhang, Yugui
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 111 : 123 - 135