3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection

被引:55
|
作者
Wang, He [1 ]
Cong, Yezhen [2 ]
Litany, Or [3 ]
Gao, Yue [2 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Tsinghua Univ, Beijing, Peoples R China
[3] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/CVPR46437.2021.01438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is an important yet demanding task that heavily relies on difficult to obtain 3D annotations. To reduce the required amount of supervision, we propose 3DIoUMatch, a novel semi-supervised method for 3D object detection applicable to both indoor and outdoor scenes. We leverage a teacher-student mutual learning framework to propagate information from the labeled to the unlabeled train set in the form of pseudo-labels. However, due to the high task complexity, we observe that the pseudo-labels suffer from significant noise and are thus not directly usable. To that end, we introduce a confidence-based filtering mechanism, inspired by FixMatch. We set confidence thresholds based upon the predicted objectness and class probability to filter low-quality pseudo-labels. While effective, we observe that these two measures do not sufficiently capture localization quality. We therefore propose to use the estimated 3D IoU as a localization metric and set category-aware self-adjusted thresholds to filter poorly localized proposals. We adopt VoteNet as our backbone detector on indoor datasets while we use PV-RCNN on the autonomous driving dataset, KITTI. Our method consistently improves state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant margins under all label ratios (including fully labeled setting). For example, when training using only 10% labeled data on ScanNet, 3DIoUMatch achieves 7.7 absolute improvement on mAP@0.25 and 8.5 absolute improvement on mAP@0.5 upon the prior art. On KITTI, we are the first to demonstrate semi-supervised 3D object detection and our method surpasses a fully supervised baseline from 1.8% to 7.6% under different label ratio and categories.
引用
收藏
页码:14610 / 14619
页数:10
相关论文
共 50 条
  • [1] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
    Wu, Xiaopei
    Peng, Liang
    Xie, Liang
    Hou, Yuenan
    Lin, Binbin
    Huang, Xiaoshui
    Liu, Haifeng
    Cai, Deng
    Ouyang, Wanli
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
  • [2] Semi-supervised 3D Object Detection with Proficient Teachers
    Yin, Junbo
    Fang, Jin
    Zhou, Dingfu
    Zhang, Liangjun
    Xu, Cheng-Zhong
    Shen, Jianbing
    Wang, Wenguan
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 727 - 743
  • [3] A semi-supervised 3D object detection method for autonomous driving
    Zhang, Jiacheng
    Liu, Huafeng
    Lu, Jianfeng
    [J]. DISPLAYS, 2022, 71
  • [4] Learning with Noisy Data for Semi-Supervised 3D Object Detection
    Chen, Zehui
    Li, Zhenyu
    Wang, Shuo
    Fu, Dengpan
    Zhao, Feng
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6906 - 6916
  • [5] Joint Semi-Supervised and Active Learning via 3D Consistency for 3D Object Detection
    Hwang, Sihwan
    Kim, Sanmin
    Kim, Youngseok
    Kum, Dongsuk
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4819 - 4825
  • [6] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
    Zhang, Dingyuan
    Liang, Dingkang
    Zou, Zhikang
    Li, Jingyu
    Ye, Xiaoqing
    Liu, Zhe
    Tan, Xiao
    Bai, Xiang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
  • [7] Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection
    Ho, Cheng-Ju
    Tai, Chen-Hsuan
    Lin, Yen-Yu
    Yang, Ming-Hsuan
    Tsai, Yi-Hsuan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] PE-MCAT: Leveraging Image Sensor Fusion and Adaptive Thresholds for Semi-Supervised 3D Object Detection
    Li, Bohao
    Song, Shaojing
    Ai, Luxia
    [J]. Sensors, 2024, 24 (21)
  • [9] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
    Tang, Yew Siang
    Lee, Gim Hee
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1931 - 1940
  • [10] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
    Liu, Binhong
    Yao, Dexin
    Yang, Rui
    Yan, Zhi
    Yang, Tao
    [J]. Journal of Intelligent and Robotic Systems: Theory and Applications, 2024, 110 (04):