Adapting Depth Distribution for 3D Object Detection with a Two-Stage Training Paradigm

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Bao, Zhongkui [3 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
[3] Anhui Univ, Hefei 230601, Peoples R China
基金
国家重点研发计划;
关键词
3D Object Detection; Depth Estimation; Two-Stage Training;
D O I
10.1007/978-981-97-5612-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lift-Splat-Shoot based 3D object detection systems aim to predict the targets' bounding boxes from images, by leveraging an explicit depth distribution that facilitates coherence between the depth and detection modules. Contrary to conventional end-to-end models that prioritize minimizing the disparity between estimated and ground-truth depth maps, our study underscores the intrinsic value of the depth distribution itself. To exploit this perspective, we introduce a novel two-stage training paradigm designed to optimize the depth and detection module separately, adopting a targeted approach to refine the depth distribution for 3D object detection. Specifically, the first stage involves training the depth module for precise depth estimation, which is supplemented by an auxiliary detection module that provides additional supervisory feedback for detection accuracy. This auxiliary component is designed to be discarded once it has served its purpose in improving the depth distribution. For the second stage, with the depth module's parameters now fixed, we train a fresh detection module from scratch under direct detection supervision. Additionally, a trainable and lightweight depth adapter is incorporated post the depth module to further adapt and polish the depth distribution, aligning it more closely with the detection objectives. Our experiments on the nuScenes dataset reveal that our approach significantly surpasses baseline models, achieving a notable 1.13% improvement on the NDS metric.
引用
收藏
页码:62 / 73
页数:12
相关论文
共 50 条
  • [31] Two-stage dynamic deformation for construction of 3D models
    Chen, SW
    Stockman, G
    Dai, CY
    Chuang, CP
    GRAPHICAL MODELS AND IMAGE PROCESSING, 1996, 58 (05): : 484 - 493
  • [32] Efficient 3D Correspondence Grouping by Two-Stage Filtering
    Lu, Rongrong
    Zhu, Feng
    Wu, Qingxiao
    Kong, Yanzi
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [33] Camouflaged object segmentation with prior via two-stage training
    Wang, Rui
    Shi, Caijuan
    Duan, Changyu
    Gao, Weixiang
    Zhu, Hongli
    Wei, Yunchao
    Liu, Meiqin
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [34] Depth-enhancement network for monocular 3D object detection
    Liu, Guohua
    Lian, Haiyang
    Guo, Changrui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [35] Deep Optics for Monocular Depth Estimation and 3D Object Detection
    Chang, Julie
    Wetzstein, Gordon
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10192 - 10201
  • [36] PDR: Progressive Depth Regularization for Monocular 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Zhao, Min-Jian
    Lee, Gim Hee
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7591 - 7603
  • [37] Densely Constrained Depth Estimator for Monocular 3D Object Detection
    Li, Yingyan
    Chen, Yuntao
    He, Jiawei
    Zhang, Zhaoxiang
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 718 - 734
  • [38] DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection
    Kumar, Abhinav
    Brazil, Garrick
    Corona, Enrique
    Parchami, Armin
    Liu, Xiaoming
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 664 - 683
  • [39] Two-Stage Lesion Detection Approach Based on Dimension-Decomposition and 3D Context
    Jiacheng Jiao
    Haiwei Pan
    Chunling Chen
    Tao Jin
    Yang Dong
    Jingyi Chen
    TsinghuaScienceandTechnology, 2022, 27 (01) : 103 - 113
  • [40] Two-Stage Lesion Detection Approach Based on Dimension-Decomposition and 3D Context
    Jiao, Jiacheng
    Pan, Haiwei
    Chen, Chunling
    Jin, Tao
    Dong, Yang
    Chen, Jingyi
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (01) : 103 - 113