SOGDet: Semantic-Occupancy Guided Multi-View 3D Object Detection

被引:0
|
作者
Zhou, Qiu
Cao, Jinming [1 ]
Leng, Hanchao [2 ]
Yin, Yifang [3 ]
Kun, Yu [2 ]
Zimmermann, Roger [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Xiaomi Car, Singapore, Singapore
[3] ASTAR, I2R, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of autonomous driving, accurate and comprehensive perception of the 3D environment is crucial. Bird's Eye View (BEV) based methods have emerged as a promising solution for 3D object detection using multi-view images as input. However, existing 3D object detection methods often ignore the physical context in the environment, such as sidewalk and vegetation, resulting in sub-optimal performance. In this paper, we propose a novel approach called SOGDet (Semantic-Occupancy Guided Multi-view 3D Object Detection), that leverages a 3D semantic-occupancy branch to improve the accuracy of 3D object detection. In particular, the physical context modeled by semantic occupancy helps the detector to perceive the scenes in a more holistic view. Our SOGDet is flexible to use and can be seamlessly integrated with most existing BEV-based methods. To evaluate its effectiveness, we apply this approach to several state-of-the-art baselines and conduct extensive experiments on the exclusive nuScenes dataset. Our results show that SOGDet consistently enhance the performance of three baseline methods in terms of nuScenes Detection Score (NDS) and mean Average Precision (mAP). This indicates that the combination of 3D object detection and 3D semantic occupancy leads to a more comprehensive perception of the 3D environment, thereby aiding build more robust autonomous driving systems. The codes are available at: https://github.com/zhouqiu/SOGDet.
引用
收藏
页码:7668 / 7676
页数:9
相关论文
共 50 条
  • [1] Object Detection in Multi-view 3D Reconstruction Using Semantic and Geometric Context
    Weinshall, D.
    Golbert, A.
    [J]. CMRT13 - CITY MODELS, ROADS AND TRAFFIC 2013, 2013, II-3/W3 : 97 - 102
  • [2] Viewpoint Equivariance for Multi-View 3D Object Detection
    Chen, Dian
    Li, Jie
    Guizilini, Vitor
    Ambrus, Rares
    Gaidon, Adrien
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9213 - 9222
  • [3] Multi-view semantic learning network for point cloud based 3D object detection
    Yang, Yongguang
    Chen, Feng
    Wu, Fei
    Zeng, Deliang
    Ji, Yi-mu
    Jing, Xiao-Yuan
    [J]. NEUROCOMPUTING, 2020, 397 : 477 - 485
  • [4] MVFusion: Multi-View 3D Object Detection with Semantic-aligned Radar and Camera Fusion
    Wu, Zizhang
    Chen, Guilian
    Gan, Yuanzhu
    Wang, Lei
    Pu, Jian
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2766 - 2773
  • [5] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [6] Multi-View Object Class Detection with a 3D Geometric Model
    Liebelt, Joerg
    Schmid, Cordelia
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1688 - 1695
  • [7] CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
    Xiong, Kaixin
    Gong, Shi
    Ye, Xiaoqing
    Tan, Xiao
    Wan, Ji
    Ding, Errui
    Wang, Jingdong
    Bai, Xiang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21570 - 21579
  • [8] 3D Object Detection based on Multi-View Feature Point Matching
    Yang, Tian
    Sang, Xinzhu
    Chen, Duo
    Guo, Nan
    Wang, Peng
    Yu, Xunbo
    Yan, Binbin
    Wang, Kuiru
    Yu, Chongxiu
    [J]. AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [9] AeDet: Azimuth-invariant Multi-view 3D Object Detection
    Feng, Chengjian
    Jie, Zequn
    Zhong, Yujie
    Chu, Xiangxiang
    Ma, Lin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21580 - 21588
  • [10] BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection
    Li, Yinhao
    Ge, Zheng
    Yu, Guanyi
    Yang, Jinrong
    Wang, Zengran
    Shi, Yukang
    Sun, Jianjian
    Li, Zeming
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1477 - 1485