VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels

被引:3
|
作者
Li, Bing [1 ,2 ,3 ]
Chen, Jie [4 ,5 ]
Li, Xinde [3 ,6 ,7 ]
Xu, Rui [2 ]
Li, Qian [2 ]
Cao, Yice [2 ]
Wu, Jun [2 ]
Qu, Lei [2 ]
Li, Yingsong [2 ]
Diniz, Paulo S. R. [8 ,9 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[3] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
[4] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Peoples R China
[5] China Elect Technol Grp Corp, Res Inst 38, Hefei 230088, Peoples R China
[6] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Nanjing 210096, Peoples R China
[7] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[8] Univ Fed Rio de Janeiro, Program Elect Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
[9] Univ Fed Rio de Janeiro, Dept Elect & Comp Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
基金
中国国家自然科学基金;
关键词
Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Convolution; Data mining; Computational efficiency; Single-stage; fine-grained; lightweight; multibranch cross-sparse convolution network; compact fine-grained self-attention augmented module;
D O I
10.1109/TITS.2024.3373227
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in point cloud feature extraction tasks and the imbalance between efficiency and accuracy in single-stage point cloud 3D object detection scenarios. We develop a lightweight multibranch cross-sparse convolution network (LMCCN) that is designed to preserve the feature granularity of the original point cloud while achieving enhanced extraction efficiency. Additionally, we introduce a compact fine-grained self-attention augmented bird's eye view (BEV) feature extraction module (CFSAM). This module aims to further refine BEV features, enabling the acquisition of both locally and globally enhanced features and thereby augmentingthe perceptual capabilities of the constructed model. Without bells and whistles, the proposed method attains excellent performance on many autonomous driving benchmarks, with detection accuracies of up to 81.67% on KITTI, 72.74% on ONCE, and 84.00% on nuScenes. Moreover, it reaches a peak detection speed of 46.08 FPS, effectively balancing accuracy with speed.
引用
收藏
页码:12034 / 12048
页数:15
相关论文
共 50 条
  • [31] 3D object detection based on point cloud in automatic driving scene
    Li, Hai-Sheng
    Lu, Yan-Ling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13029 - 13044
  • [32] Intracranial aneurysm detection based on 3D point cloud object detection method
    Li, Jun
    Liu, Juntong
    Wang, Jiaqi
    Wang, Peipei
    Ye, Mingquan
    COGENT ENGINEERING, 2024, 11 (01):
  • [33] 3D object detection based on point cloud in automatic driving scene
    Hai-Sheng Li
    Yan-Ling Lu
    Multimedia Tools and Applications, 2024, 83 : 13029 - 13044
  • [34] 3D Object Detection from Point Cloud Based on Deep Learning
    Hao, Ning
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [35] Detection based object labeling of 3D point cloud for indoor scenes
    Liu, Wei
    Li, Shaozi
    Cao, Donglin
    Su, Songzhi
    Ji, Rongrong
    NEUROCOMPUTING, 2016, 174 : 1101 - 1106
  • [36] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [37] DCNet: exploring fine-grained vision classification for 3D point clouds
    Wu, Rusong
    Bai, Jing
    Li, Wenjing
    Jiang, Jinzhe
    VISUAL COMPUTER, 2024, 40 (02): : 781 - 797
  • [38] DCNet: exploring fine-grained vision classification for 3D point clouds
    Rusong Wu
    Jing Bai
    Wenjing Li
    Jinzhe Jiang
    The Visual Computer, 2024, 40 (2) : 781 - 797
  • [39] MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net
    Zhang, Zheng
    Bao, Zhiping
    Wei, Yun
    Zhou, Yongsheng
    Li, Ming
    Tian, Qing
    REMOTE SENSING, 2024, 16 (17)
  • [40] 3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
    Xie, Guangda
    Li, Yang
    Wang, Yanping
    Li, Ziyi
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (12)