Adapting Depth Distribution for 3D Object Detection with a Two-Stage Training Paradigm

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Bao, Zhongkui [3 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
[3] Anhui Univ, Hefei 230601, Peoples R China
基金
国家重点研发计划;
关键词
3D Object Detection; Depth Estimation; Two-Stage Training;
D O I
10.1007/978-981-97-5612-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lift-Splat-Shoot based 3D object detection systems aim to predict the targets' bounding boxes from images, by leveraging an explicit depth distribution that facilitates coherence between the depth and detection modules. Contrary to conventional end-to-end models that prioritize minimizing the disparity between estimated and ground-truth depth maps, our study underscores the intrinsic value of the depth distribution itself. To exploit this perspective, we introduce a novel two-stage training paradigm designed to optimize the depth and detection module separately, adopting a targeted approach to refine the depth distribution for 3D object detection. Specifically, the first stage involves training the depth module for precise depth estimation, which is supplemented by an auxiliary detection module that provides additional supervisory feedback for detection accuracy. This auxiliary component is designed to be discarded once it has served its purpose in improving the depth distribution. For the second stage, with the depth module's parameters now fixed, we train a fresh detection module from scratch under direct detection supervision. Additionally, a trainable and lightweight depth adapter is incorporated post the depth module to further adapt and polish the depth distribution, aligning it more closely with the detection objectives. Our experiments on the nuScenes dataset reveal that our approach significantly surpasses baseline models, achieving a notable 1.13% improvement on the NDS metric.
引用
收藏
页码:62 / 73
页数:12
相关论文
共 50 条
  • [21] Object Detection and Depth Estimation for 3D Trajectory Extraction
    Boukhers, Zeyd
    Shirahama, Kimiaki
    Li, Frederic
    Grzegorzek, Marcin
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
  • [22] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [23] Pulmonary Nodule Detection from 3D CT Image with a Two-Stage Network
    Liao, Miao
    Chi, Zhiwei
    Wu, Huizhu
    Di, Shuanhu
    Hu, Yonghua
    Li, Yunyi
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [24] Distribution Aware VoteNet for 3D Object Detection
    Liang, Junxiong
    An, Pei
    Ma, Jie
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1583 - 1591
  • [25] FrustumVoxNet for 3D object detection fromRGB-D or Depth images
    Shen, Xiaoke
    Stamos, Ioannis
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1687 - 1695
  • [26] BirdNet plus : Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird's Eye View
    Barrera, Alejandro
    Beltran, Jorge
    Guindel, Carlos
    Iglesias, Jose Antonio
    Garcia, Fernando
    IEEE ACCESS, 2021, 9 : 160299 - 160316
  • [27] Two-stage Co-salient Object Detection
    Wang, Zuyi
    Zhang, Lihe
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 287 - 290
  • [28] Two-Stage Unattended Object Detection Method with Proposals
    Nam Trung Pham
    Leman, Karianto
    Zhang, Jie
    Pek, Isaac
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2017, : 1 - 4
  • [29] Two-Stage Approach to Small-Object Detection
    Yu, Mingrui
    Leung, Henry
    SYSTEMS ENGINEERING, 2025,
  • [30] Salient Object Detection via Two-Stage Graphs
    Liu, Yi
    Han, Jungong
    Zhang, Qiang
    Wang, Long
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 1023 - 1037