Weakly Supervised 3D Object Detection from Point Clouds

被引:29
|
作者
Qin, Zengyi [1 ]
Wang, Jinglu [2 ]
Lu, Yan [2 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
3D object detection; point clouds; weakly supervised learning;
D O I
10.1145/3394171.3413805
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial task in scene understanding is 3D object detection, which aims to detect and localize the 3D bounding boxes of objects belonging to specific classes. Existing 3D object detectors heavily rely on annotated 3D bounding boxes during training, while these annotations could be expensive to obtain and only accessible in limited scenarios. Weakly supervised learning is a promising approach to reducing the annotation requirement, but existing weakly supervised object detectors are mostly for 2D detection rather than 3D. In this work, we propose VS3D, a framework for weakly supervised 3D object detection from point clouds without using any ground truth 3D bounding box for training. First, we introduce an unsupervised 3D proposal module that generates object proposals by leveraging normalized point cloud densities. Second, we present a cross-modal knowledge distillation strategy, where a convolutional neural network learns to predict the final results from the 3D object proposals by querying a teacher network pretrained on image datasets. Comprehensive experiments on the challenging KITTI dataset demonstrate the superior performance of our VS3D in diverse evaluation settings.
引用
收藏
页码:4144 / 4152
页数:9
相关论文
共 50 条
  • [31] Weakly supervised object detection with 2D and 3D regression neural networks
    Dubost, Florian
    Adams, Hieab
    Yilmaz, Pinar
    Bortsova, Gerda
    van Tulder, Gijs
    Ikram, M. Arfan
    Niessen, Wiro
    Vernooij, Meike W.
    de Bruijne, Marleen
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 65
  • [32] SS3D: Sparsely-Supervised 3D Object Detection from Point Cloud
    Liu, Chuandong
    Gao, Chenqiang
    Liu, Fangcen
    Liu, Jiang
    Meng, Deyu
    Gao, Xinbo
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8418 - 8427
  • [33] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
    Zhang, Dingyuan
    Liang, Dingkang
    Zou, Zhikang
    Li, Jingyu
    Ye, Xiaoqing
    Liu, Zhe
    Tan, Xiao
    Bai, Xiang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8339 - 8349
  • [34] Planar object detection from 3D point clouds based on pyramid voxel representation
    Hu, Zhaozheng
    Bai, Dongfang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 24343 - 24357
  • [35] DVST: Deformable Voxel Set Transformer for 3D Object Detection from Point Clouds
    Ning, Yaqian
    Cao, Jie
    Bao, Chun
    Hao, Qun
    [J]. REMOTE SENSING, 2023, 15 (23)
  • [36] Planar object detection from 3D point clouds based on pyramid voxel representation
    Zhaozheng Hu
    Dongfang Bai
    [J]. Multimedia Tools and Applications, 2017, 76 : 24343 - 24357
  • [37] 3D urban object change detection from aerial and terrestrial point clouds: A review
    Xiao, Wen
    Cao, Hui
    Tang, Miao
    Zhang, Zhenchao
    Chen, Nengcheng
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 118
  • [38] Leveraging Dynamic Occupancy Grids for 3D Object Detection in Point Clouds
    Sierra-Gonzalez, David
    Paigwar, Anshul
    Erkent, Ozgur
    Dibangoye, Jilles
    Laugier, Christian
    [J]. 16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1188 - 1193
  • [39] Language guided 3D object detection in point clouds for MEP scenes
    Li, Junjie
    Du, Shengli
    Liu, Jianfeng
    Chen, Weibiao
    Tang, Manfu
    Zheng, Lei
    Wang, Lianfa
    Ji, Chunle
    Yu, Xiao
    Yu, Wanli
    [J]. IET COMPUTER VISION, 2024, 18 (04) : 526 - 539
  • [40] SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds
    Sun, Pei
    Tan, Mingxing
    Wang, Weiyue
    Liu, Chenxi
    Xia, Fei
    Leng, Zhaoqi
    Anguelov, Dragomir
    [J]. COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 426 - 442