Efficient Active Learning Strategies for Monocular 3D Object Detection

被引:3
|
作者
Hekimoglu, Aral [1 ,2 ]
Schmidt, Michael [2 ]
Marcos-Ramiro, Alvaro [2 ]
Rigoll, Gerhard [1 ]
机构
[1] Tech Univ Munich, Chair Human Machine Commun, Munich, Germany
[2] BMW Grp, Munich, Germany
关键词
D O I
10.1109/IV51971.2022.9827454
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Processing camera information to perceive their 3D surrounding is essential for building scalable autonomous driving vehicles. For this task, deep learning networks provide effective real-time solutions. However, to compensate for missing depth information in cameras compared to LiDARs, a large amount of labeled data is required for training. Active learning is a training framework where the network actively participates in the data selection process to improve data efficiency and performance. In this work, we propose an active learning pipeline for 3D object detection from monocular images. The main components of our approach are (1) two training-efficient uncertainty estimation strategies, (2) a diversity-based selection strategy to select images that contain the most diverse set of objects, (3) a novel active learning strategy more suitable for training autonomous driving perception networks. Experiments show that combining our proposed uncertainty estimation methods provides a better data saving rate and reaches a higher final performance than baselines. Furthermore, we empirically show performance gains of the presented diversity-based selection strategy and the efficiency of the proposed active learning strategy.
引用
收藏
页码:295 / 302
页数:8
相关论文
共 50 条
  • [1] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
    Liu, Xianpeng
    Xue, Nan
    Wu, Tianfu
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
  • [2] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04): : 1959 - 1966
  • [3] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [4] Efficient Uncertainty Estimation for Monocular 3D Object Detection in Autonomous Driving
    Liu, Zechen
    Han, Zhihua
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2711 - 2718
  • [5] Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution
    Chen, Jiun-Han
    Shieh, Jeng-Lun
    Haq, Muhamad Amirul
    Ruan, Shanq-Jang
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (03) : 2424 - 2436
  • [6] Depth-discriminative Metric Learning for Monocular 3D Object Detection
    Choi, Wonhyeok
    Shin, Mingyu
    Im, Sunghoon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7607 - 7615
  • [8] A Survey on Monocular 3D Object Detection Algorithms Based on Deep Learning
    Wu, Junhui
    Yin, Dong
    Chen, Jie
    Wu, Yusheng
    Si, Huiping
    Lin, Kaiyan
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [9] Learning Depth-Guided Convolutions for Monocular 3D Object Detection
    Ng, Mingyu
    Huo, Yuqi
    Yi, Hongwei
    Wang, Zhe
    Shi, Jianping
    Lu, Zhiwu
    Luo, Ping
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4306 - 4315
  • [10] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156