MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
|
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [41] Object-Aware Centroid Voting for Monocular 3D Object Detection
    Bao, Wentao
    Yu, Qi
    Kong, Yu
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204
  • [42] YOLOv7-3D: A Monocular 3D Traffic Object Detection Method from a Roadside Perspective
    Ye, Zixun
    Zhang, Hongying
    Gu, Jingliang
    Li, Xue
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [43] Monocular 3D Object Detection: An Extrinsic Parameter Free Approach
    Zhou, Yunsong
    He, Yuan
    Zhu, Hongzi
    Wang, Cheng
    Li, Hongyang
    Jiang, Qinhong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7552 - 7562
  • [44] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation
    Chen, Hansheng
    Huang, Yuyao
    Tian, Wei
    Gao, Zhong
    Xiong, Lu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10374 - 10383
  • [45] Monocular 3D Object Detection Based on Uncertainty Prediction of Keypoints
    Chen, Mu
    Zhao, Huaici
    Liu, Pengfei
    MACHINES, 2022, 10 (01)
  • [46] Efficient Active Learning Strategies for Monocular 3D Object Detection
    Hekimoglu, Aral
    Schmidt, Michael
    Marcos-Ramiro, Alvaro
    Rigoll, Gerhard
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 295 - 302
  • [47] 3D Object Detection and Tracking Using Monocular Camera in CARLA
    Zhang, Yanyu
    Song, Jiahao
    Li, Shuwei
    2021 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2021, : 67 - 72
  • [48] MonoDCN: Monocular 3D object detection based on dynamic convolution
    Qu, Shenming
    Yang, Xinyu
    Gao, Yiming
    Liang, Shengbin
    PLOS ONE, 2022, 17 (10):
  • [49] MonoPGC: Monocular 3D Object Detection with Pixel Geometry Contexts
    Wu, Zizhang
    Gan, Yuanzhu
    Wang, Lei
    Chen, Guilian
    Pu, Jian
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4842 - 4849
  • [50] Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection
    Kim, Jung Uk
    Kim, Hyung-Il
    Ro, Yong Man
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2749 - 2760