Context-Aware 3D Object Detection From a Single Image in Autonomous Driving

被引:5
|
作者
Zhou, Dingfu [1 ,2 ]
Song, Xibin [1 ,2 ]
Fang, Jin [1 ,2 ]
Dai, Yuchao [3 ]
Li, Hongdong [4 ]
Zhang, Liangjun [1 ,2 ]
机构
[1] Baidu Res, Robot & Autonomous Driving Lab, Beijing 100085, Peoples R China
[2] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
[3] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710060, Peoples R China
[4] Australian Natl Univ, Coll Engn & Comp Sci, Canberra, ACT 0200, Australia
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Three-dimensional displays; Object detection; Training; Feature extraction; Task analysis; Sensors; Detectors; Monocular 3D object detection; context-aware feature aggregation; self-attention; RECOGNITION; MODEL;
D O I
10.1109/TITS.2022.3154022
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Camera sensors have been widely used in Driver-Assistance and Autonomous Driving Systems due to their rich texture information. Recently, with the development of deep learning techniques, many approaches have been proposed to detect objects in 3D from a single frame, however, there is still much room for improvement. In this paper, we generally review the recently proposed state-of-the-art monocular-based 3D object detection approaches first. Based on the analysis of the disadvantage of previous center-based frameworks, a novel feature aggregation strategy has been proposed to boost the 3D object detection by exploring the context information. Specifically, an Instance-Guided Spatial Attention (IGSA) module is proposed to collect the local instance information and the Channel-Wise Feature Attention (CWFA) module is employed for aggregating the global context information. In addition, an instance-guided object regression strategy is also proposed to alleviate the influence of center location prediction uncertainty in the inference process. Finally, the proposed approach has been verified on the public 3D object detection benchmark. The experimental results show that the proposed approach can significantly boost the performance of the baseline method on both 3D detection and 2D Bird's-Eye View among all three categories. Furthermore, our method outperforms all the monocular-based methods (even these trained with depth as auxiliary inputs) and achieves state-of-the-art performance on the KITTI benchmark.
引用
收藏
页码:18568 / 18580
页数:13
相关论文
共 50 条
  • [21] On Offline Evaluation of 3D Object Detection for Autonomous Driving
    Schreier, Tim
    Renz, Katrin
    Geiger, Andreas
    Chitta, Kashyap
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4086 - 4091
  • [22] 3D object detection algorithms in autonomous driving: A review
    Ren K.-Y.
    Gu M.-Y.
    Yuan Z.-Q.
    Yuan S.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (04): : 865 - 889
  • [23] SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection
    Bhattacharyya, Prarthana
    Huang, Chengjie
    Czarnecki, Krzysztof
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3022 - 3031
  • [24] A Survey on 3D Object Detection Methods for Autonomous Driving Applications
    Arnold, Eduardo
    Al-Jarrah, Omar Y.
    Dianati, Mehrdad
    Fallah, Saber
    Oxtoby, David
    Mouzakitis, Alex
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (10) : 3782 - 3795
  • [25] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Liu, Liu
    Yin, Junbo
    Dai, Yuchao
    Li, Hongdong
    Yang, Ruigang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1836 - 1846
  • [26] A survey on 3D object detection in real time for autonomous driving
    Contreras, Marcelo
    Jain, Aayush
    Bhatt, Neel P.
    Banerjee, Arunava
    Hashemi, Ehsan
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [27] A Review of 3D Object Detection for Autonomous Driving of Electric Vehicles
    Dai, Deyun
    Chen, Zonghai
    Bao, Peng
    Wang, Jikai
    WORLD ELECTRIC VEHICLE JOURNAL, 2021, 12 (03)
  • [28] LiDAR-based 3D Object Detection for Autonomous Driving
    Li, Zirui
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 507 - 512
  • [29] Context-Aware Network for 3D Human Pose Estimation from Monocular RGB Image
    Yin, Binyi
    Zhang, Dongbo
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [30] PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving
    Zheng, Wenqi
    Xie, Han
    Chen, Yunfan
    Roh, Jeongjin
    Shin, Hyunchul
    APPLIED SCIENCES-BASEL, 2022, 12 (07):