SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras

被引:0
|
作者
Tang, Yingqi [1 ]
Meng, Zhaotie [1 ]
Chen, Guoliang [1 ]
Cheng, Erkang [1 ]
机构
[1] Nuilmax, Shanghai, Peoples R China
来源
关键词
Autonomous Driving; 3D Object Detection; Transformer;
D O I
10.1007/978-3-031-72627-9_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The field of autonomous driving has attracted considerable interest in approaches that directly infer 3D objects in the Bird's Eye View (BEV) from multiple cameras. Some attempts have also explored utilizing 2D detectors from single images to enhance the performance of 3D detection. However, these approaches rely on a two-stage process with separate detectors, where the 2D detection results are utilized only once for token selection or query initialization. In this paper, we present a single model termed SimPB, which Simultaneously detects 2D objects in the Perspective view and 3D objects in the BEV space from multiple cameras. To achieve this, we introduce a hybrid decoder consisting of several multi-view 2D decoder layers and several 3D decoder layers, specifically designed for their respective detection tasks. A Dynamic Query Allocation module and an Adaptive Query Aggregation module are proposed to continuously update and refine the interaction between 2D and 3D results, in a cyclic 3D-2D-3D manner. Additionally, Query-group Attention is utilized to strengthen the interaction among 2D queries within each camera group. In the experiments, we evaluate our method on the nuScenes dataset and demonstrate promising results for both 2D and 3D detection tasks. Our code is available at: https://github.com/nullmax-vision/SimPB.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] 2D and 3D object detection algorithms from images: A Survey
    Chen, Wei
    Li, Yan
    Tian, Zijian
    Zhang, Fan
    ARRAY, 2023, 19
  • [2] IoU Loss for 2D/3D Object Detection
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Guan, Chenye
    Yin, Junbo
    Dai, Yuchao
    Yang, Ruigang
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 85 - 94
  • [3] Enhance the 3D Object Detection With 2D Prior
    Liu, Cheng
    IEEE ACCESS, 2024, 12 : 67161 - 67169
  • [4] A computational model that recovers the 3D shape of an object from a single 2D retinal representation
    Li, Yunfeng
    Pizlo, Zygmunt
    Steinman, Robert M.
    VISION RESEARCH, 2009, 49 (09) : 979 - 991
  • [5] RELATIONAL MODEL CONSTRUCTION AND 3D OBJECT RECOGNITION FROM SINGLE 2D MONOCHROMATIC IMAGE
    ZHANG, S
    SULLIVAN, GD
    BAKER, KD
    IMAGE AND VISION COMPUTING, 1992, 10 (05) : 313 - 318
  • [6] Improving Object Detection in 2D Images Using a 3D World Model
    Viggh, Herbert E. M.
    Cho, Peter L.
    Armstrong-Crews, Nicholas L.
    Nam, Myra
    Shah, Danelle C.
    Brown, Geoffrey E.
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2014, 2014, 9121
  • [7] 3D Object Detection and Instance Segmentation from 3D Range and 2D Color Images
    Shen, Xiaoke
    Stamos, Ioannis
    SENSORS, 2021, 21 (04) : 1 - 29
  • [8] MuTrans: Multiple Transformers for Fusing Feature Pyramid on 2D and 3D Object Detection
    Xie, Bangquan
    Yang, Liang
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4407 - 4415
  • [9] 3D CURVED OBJECT RECOGNITION FROM MULTIPLE 2D CAMERA VIEWS
    LIU, CH
    TSAI, WH
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1990, 50 (02): : 177 - 187
  • [10] Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles
    Srivastava, Siddharth
    Jurie, Frederic
    Sharma, Gaurav
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4504 - 4511