VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels

被引:3
|
作者
Li, Bing [1 ,2 ,3 ]
Chen, Jie [4 ,5 ]
Li, Xinde [3 ,6 ,7 ]
Xu, Rui [2 ]
Li, Qian [2 ]
Cao, Yice [2 ]
Wu, Jun [2 ]
Qu, Lei [2 ]
Li, Yingsong [2 ]
Diniz, Paulo S. R. [8 ,9 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[3] Nanjing Ctr Appl Math, Nanjing 211135, Peoples R China
[4] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Peoples R China
[5] China Elect Technol Grp Corp, Res Inst 38, Hefei 230088, Peoples R China
[6] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Nanjing 210096, Peoples R China
[7] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
[8] Univ Fed Rio de Janeiro, Program Elect Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
[9] Univ Fed Rio de Janeiro, Dept Elect & Comp Engn, COPPE Poli, BR-21941909 Rio De Janeiro, Brazil
基金
中国国家自然科学基金;
关键词
Feature extraction; Point cloud compression; Three-dimensional displays; Object detection; Convolution; Data mining; Computational efficiency; Single-stage; fine-grained; lightweight; multibranch cross-sparse convolution network; compact fine-grained self-attention augmented module;
D O I
10.1109/TITS.2024.3373227
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in point cloud feature extraction tasks and the imbalance between efficiency and accuracy in single-stage point cloud 3D object detection scenarios. We develop a lightweight multibranch cross-sparse convolution network (LMCCN) that is designed to preserve the feature granularity of the original point cloud while achieving enhanced extraction efficiency. Additionally, we introduce a compact fine-grained self-attention augmented bird's eye view (BEV) feature extraction module (CFSAM). This module aims to further refine BEV features, enabling the acquisition of both locally and globally enhanced features and thereby augmentingthe perceptual capabilities of the constructed model. Without bells and whistles, the proposed method attains excellent performance on many autonomous driving benchmarks, with detection accuracies of up to 81.67% on KITTI, 72.74% on ONCE, and 84.00% on nuScenes. Moreover, it reaches a peak detection speed of 46.08 FPS, effectively balancing accuracy with speed.
引用
收藏
页码:12034 / 12048
页数:15
相关论文
共 50 条
  • [1] Semantic Guided Fine-grained Point Cloud Quantization Framework for 3D Object Detection
    Feng, Xiaoyu
    Tang, Chen
    Zhang, Zongkai
    Sun, Wenyu
    Liu, Yongpan
    2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 390 - 395
  • [2] A Lightweight Model for 3D Point Cloud Object Detection
    Li, Ziyi
    Li, Yang
    Wang, Yanping
    Xie, Guangda
    Qu, Hongquan
    Lyu, Zhuoyang
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [3] 3D point cloud object detection algorithm based on Transformer
    Liu M.
    Yang Q.
    Hu G.
    Guo Y.
    Zhang J.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2023, 41 (06): : 1190 - 1197
  • [4] 3D Object Representations for Fine-Grained Categorization
    Krause, Jonathan
    Stark, Michael
    Deng, Jia
    Li Fei-Fei
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 554 - 561
  • [5] Point Cloud 3D Object Detection Based on Improved SECOND Algorithm
    Zhang Ying
    Jiang Liangliang
    Zhang Dongbo
    Duan Wanlin
    Sun Yue
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (08)
  • [6] 3D Pose Estimation for Fine-Grained Object Categories
    Wang, Yaming
    Tan, Xiao
    Yang, Yi
    Liu, Xiao
    Ding, Errui
    Zhou, Feng
    Davis, Larry S.
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 619 - 632
  • [7] HVPR: Hybrid Voxel-Point Representation for Single-stage 3D Object Detection
    Noh, Jongyoun
    Lee, Sanghoon
    Ham, Bumsub
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14600 - 14609
  • [8] A Single-Stage 3D Object Detection Method Based on Sparse Attention Mechanism
    Jia, Songche
    Zhang, Zhenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 414 - 425
  • [9] Rethinking IoU-based Optimization for Single-stage 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Huang, Jianqiang
    Hua, Xian-Sheng
    Zhao, Min-Jian
    Lee, Gim Hee
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 544 - 561
  • [10] Point cloud 3D object detection algorithm based on local information fusion
    Zhang, Linjie
    Chai, Zhilei
    Wang, Ning
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (11): : 2219 - 2229