3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection

被引：55

作者：

Wang, He ^{[1
]}

Cong, Yezhen ^{[2
]}

Litany, Or ^{[3
]}

Gao, Yue ^{[2
]}

Guibas, Leonidas J. ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

[2] Tsinghua Univ, Beijing, Peoples R China

[3] NVIDIA, Santa Clara, CA USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01438

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D object detection is an important yet demanding task that heavily relies on difficult to obtain 3D annotations. To reduce the required amount of supervision, we propose 3DIoUMatch, a novel semi-supervised method for 3D object detection applicable to both indoor and outdoor scenes. We leverage a teacher-student mutual learning framework to propagate information from the labeled to the unlabeled train set in the form of pseudo-labels. However, due to the high task complexity, we observe that the pseudo-labels suffer from significant noise and are thus not directly usable. To that end, we introduce a confidence-based filtering mechanism, inspired by FixMatch. We set confidence thresholds based upon the predicted objectness and class probability to filter low-quality pseudo-labels. While effective, we observe that these two measures do not sufficiently capture localization quality. We therefore propose to use the estimated 3D IoU as a localization metric and set category-aware self-adjusted thresholds to filter poorly localized proposals. We adopt VoteNet as our backbone detector on indoor datasets while we use PV-RCNN on the autonomous driving dataset, KITTI. Our method consistently improves state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant margins under all label ratios (including fully labeled setting). For example, when training using only 10% labeled data on ScanNet, 3DIoUMatch achieves 7.7 absolute improvement on mAP@0.25 and 8.5 absolute improvement on mAP@0.5 upon the prior art. On KITTI, we are the first to demonstrate semi-supervised 3D object detection and our method surpasses a fully supervised baseline from 1.8% to 7.6% under different label ratio and categories.

引用

页码：14610 / 14619

页数：10

共 50 条

[1] Semi-supervised 3D Object Detection with PatchTeacher and PillarMix
Wu, Xiaopei
Peng, Liang
Xie, Liang
Hou, Yuenan
Lin, Binbin
Huang, Xiaoshui
Liu, Haifeng
Cai, Deng
Ouyang, Wanli
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6153 - 6161
[2] Semi-supervised 3D Object Detection with Proficient Teachers
Yin, Junbo
Fang, Jin
Zhou, Dingfu
Zhang, Liangjun
Xu, Cheng-Zhong
Shen, Jianbing
Wang, Wenguan
[J]. COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 727 - 743
[3] A semi-supervised 3D object detection method for autonomous driving
Zhang, Jiacheng
Liu, Huafeng
Lu, Jianfeng
[J]. DISPLAYS, 2022, 71
[4] Learning with Noisy Data for Semi-Supervised 3D Object Detection
Chen, Zehui
Li, Zhenyu
Wang, Shuo
Fu, Dengpan
Zhao, Feng
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6906 - 6916
[5] Joint Semi-Supervised and Active Learning via 3D Consistency for 3D Object Detection
Hwang, Sihwan
Kim, Sanmin
Kim, Youngseok
Kum, Dongsuk
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 4819 - 4825
[6] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
Zhang, Dingyuan
Liang, Dingkang
Zou, Zhikang
Li, Jingyu
Ye, Xiaoqing
Liu, Zhe
Tan, Xiao
Bai, Xiang
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
[7] Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection
Ho, Cheng-Ju
Tai, Chen-Hsuan
Lin, Yen-Yu
Yang, Ming-Hsuan
Tsai, Yi-Hsuan
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[8] PE-MCAT: Leveraging Image Sensor Fusion and Adaptive Thresholds for Semi-Supervised 3D Object Detection
Li, Bohao
Song, Shaojing
Ai, Luxia
[J]. Sensors, 2024, 24 (21)
[9] Transferable Semi-Supervised 3D Object Detection From RGB-D Data
Tang, Yew Siang
Lee, Gim Hee
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1931 - 1940
[10] Semi-Supervised Online Continual Learning for 3D Object Detection in Mobile Robotics
Liu, Binhong
Yao, Dexin
Yang, Rui
Yan, Zhi
Yang, Tao
[J]. Journal of Intelligent and Robotic Systems: Theory and Applications, 2024, 110 (04):

← 1 2 3 4 5 →