Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引:5
|
作者
Zhou, Dingfu [1 ,2 ]
Song, Xibin [1 ,2 ]
Fang, Jin [1 ,2 ]
Dai, Yuchao [3 ]
Li, Hongdong [4 ]
Zhang, Liangjun [1 ,2 ]
机构
[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China
[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;
D O I
10.1109/TITS.2022.3154022
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.
引用
收藏
页码:18568 / 18580
页数:13
相关论文
共 50 条
  • [1] CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
    Hu, Xuzhong
    Duan, Zaipeng
    Huang, Xiao
    Xu, Ziwen
    Ming, Delie
    Ma, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 11 - 15
  • [2] Ground-Aware Monocular 3D Object Detection for Autonomous Driving
    Liu, Yuxuan
    Yixuan, Yuan
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 919 - 926
  • [3] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
    Jing Zhou
    Zixin Gong
    Junchi Zhang
    Neural Processing Letters, 56
  • [4] Improving 3D Object Detection with Context-Aware and Dimensional Interaction Attention
    Zhou, Jing
    Gong, Zixin
    Zhang, Junchi
    NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [5] Context-Aware 3D Object Localization from Single Calibrated Images: A Study of Basketballs
    Caio, Marcello Davide
    Van Zandycke, Gabriel
    De Vleeschouwer, Christophe
    PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT ANALYSIS IN SPORTS, MMSPORTS 2023, 2023, : 49 - 54
  • [6] Coherent Object Detection with 3D Geometric Context from a Single Image
    Pan, Jiyan
    Kanade, Takeo
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2576 - 2583
  • [7] 3D object detection based on image and LIDAR fusion for autonomous driving
    Chen G.
    Yi H.
    Mao Z.
    International Journal of Vehicle Information and Communication Systems, 2023, 8 (03) : 237 - 251
  • [8] Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook
    Song, Ziying
    Liu, Lin
    Jia, Feiyang
    Luo, Yadan
    Jia, Caiyan
    Zhang, Guoxin
    Yang, Lei
    Wang, Li
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 15407 - 15436
  • [9] Context-Aware 3D Object Streaming for Mobile Games
    Rahimi, Hesam
    Shirehjini, Ali Asghar Nazari
    Shirmohammadi, Shervin
    2011 10TH ANNUAL WORKSHOP ON NETWORK AND SYSTEMS SUPPORT FOR GAMES (NETGAMES 2011), 2011,
  • [10] Context-aware 3D object anchoring for mobile robots
    Guenther, Martin
    Ruiz-Sarmiento, J. R.
    Galindo, Cipriano
    Gonzalez-Jimenez, Javier
    Hertzberg, Joachim
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 12 - 32