Adapting Depth Distribution for 3D Object Detection with a Two-Stage Training Paradigm

被引:0
|
作者
Luo, Yixin [1 ,2 ]
Huang, Zhangjin [1 ,2 ]
Bao, Zhongkui [3 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Deqing Alpha Innovat Inst, Huzhou 313299, Peoples R China
[3] Anhui Univ, Hefei 230601, Peoples R China
基金
国家重点研发计划;
关键词
3D Object Detection; Depth Estimation; Two-Stage Training;
D O I
10.1007/978-981-97-5612-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lift-Splat-Shoot based 3D object detection systems aim to predict the targets' bounding boxes from images, by leveraging an explicit depth distribution that facilitates coherence between the depth and detection modules. Contrary to conventional end-to-end models that prioritize minimizing the disparity between estimated and ground-truth depth maps, our study underscores the intrinsic value of the depth distribution itself. To exploit this perspective, we introduce a novel two-stage training paradigm designed to optimize the depth and detection module separately, adopting a targeted approach to refine the depth distribution for 3D object detection. Specifically, the first stage involves training the depth module for precise depth estimation, which is supplemented by an auxiliary detection module that provides additional supervisory feedback for detection accuracy. This auxiliary component is designed to be discarded once it has served its purpose in improving the depth distribution. For the second stage, with the depth module's parameters now fixed, we train a fresh detection module from scratch under direct detection supervision. Additionally, a trainable and lightweight depth adapter is incorporated post the depth module to further adapt and polish the depth distribution, aligning it more closely with the detection objectives. Our experiments on the nuScenes dataset reveal that our approach significantly surpasses baseline models, achieving a notable 1.13% improvement on the NDS metric.
引用
收藏
页码:62 / 73
页数:12
相关论文
共 50 条
  • [41] Two-Stage RGB-Based Action Detection Using Augmented 3D Poses
    Papadopoulos, Konstantinos
    Ghorbel, Enjie
    Baptista, Renato
    Aouada, Djamila
    Ottersten, Bjoern
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 26 - 35
  • [42] 3D Multi-Object Tracking based on Two-Stage Data Association for Collaborative Perception Scenarios
    Su, Hao
    Arakawa, Shin'ichi
    Murata, Masayuki
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [43] DA-TSD: Double Attention Two-Stage 3D Object Detector from Point Clouds
    Zhao, Xinyi
    Li, Yong
    Tian, Rui
    Chen, Yunli
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 330 - 343
  • [44] DEPTH-ASSISTED JOINT DETECTION NETWORK FOR MONOCULAR 3D OBJECT DETECTION
    Lei, Jianjun
    Guo, Tingyi
    Peng, Bo
    Yu, Chuanbo
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2204 - 2208
  • [45] Two-stage salient object detection based on prior distribution learning and saliency consistency optimization
    Yunhe Wu
    Xingya Chang
    Dongyue Chen
    Lei Chen
    Tong Jia
    The Visual Computer, 2023, 39 : 5729 - 5745
  • [46] Two-stage salient object detection based on prior distribution learning and saliency consistency optimization
    Wu, Yunhe
    Chang, Xingya
    Chen, Dongyue
    Chen, Lei
    Jia, Tong
    VISUAL COMPUTER, 2023, 39 (11): : 5729 - 5745
  • [47] Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision
    Kim, Youngseok
    Kim, Sanmin
    Sim, Sangmin
    Choi, Jun Won
    Kum, Dongsuk
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1801 - 1813
  • [48] Design of Class in Unknown Object Segmentation Focusing on 3D Object Detection in Depth Image
    Amemiya, Tatsuya
    Tasaki, Tsuyoshi
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 706 - 707
  • [49] A Two-Stage Adaptive Clustering Approach for 3D Point Clouds
    Zhang, Caihong
    Wang, Shaoping
    Yu, Biao
    Li, Bichun
    Zhu, Hui
    2019 4TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS 2019), 2019, : 11 - 16
  • [50] Domain selective two-stage beamforming in 3D massive MIMO
    Gao, Tianbao
    Liu, Chen
    Song, Yunchao
    Cheng, Nan
    Qian, Mujun
    Zhang, Ran
    DIGITAL SIGNAL PROCESSING, 2022, 130