Multi-Camera 3D Object Detection for Autonomous Driving Using Deep Learning and Self-Attention Mechanism

被引:2
|
作者
Hazarika, Ananya [1 ]
Vyas, Amit [1 ]
Rahmati, Mehdi [1 ]
Wang, Yan [2 ]
机构
[1] Cleveland State Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44115 USA
[2] Ford Motor Co, Dearborn, MI 48126 USA
关键词
Autonomous vehicles; deep learning; object detection; vision transformer;
D O I
10.1109/ACCESS.2023.3288112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the absence of depth-centric sensors, 3D object detection using only conventional cameras becomes ill-posed and inaccurate due to the lack of depth information in the RGB image. We propose a multi-camera perception solution to predict the 3D properties of the vehicle obtained from the aggregated information from multiple static infrastructure-installed cameras. While a multi-bin regression loss has been adopted to predict the orientation of a 3D bounding box using a convolutional neural network, combining it with the geometrical constraints of a 2D bounding box to form a 3D bounding box is not accurate enough for all the driving scenarios and orientations. This paper leverages a vision transformer that overcomes the drawbacks of convolutional neural networks when there are no external LiDAR or pseudo-LiDAR pre-trained datasets available for depth map estimation, particularly in occluded regions. By combining the predicted 3D boxes from various cameras using an average weighted score algorithm, we determine the best bounding box with the highest confidence score. Comprehensive simulations for performance analysis are shown from the results obtained by utilizing the KITTI standard data generated from the CARLA simulator.
引用
收藏
页码:64608 / 64620
页数:13
相关论文
共 50 条
  • [1] SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
    Wei, Yi
    Zhao, Linqing
    Zheng, Wenzhao
    Zhu, Zheng
    Zhou, Jie
    Lu, Jiwen
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21672 - 21683
  • [2] Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles
    Pha Nguyen
    Kha Gia Quach
    Chi Nhan Duong
    Ngan Le
    Xuan-Bac Nguyen
    Khoa Luu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2568 - 2577
  • [3] PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer
    Jiang, Yanqin
    Zhang, Li
    Miao, Zhenwei
    Zhu, Xiatian
    Gao, Jin
    Hu, Weimin
    Jiang, Yu-Gang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 1042 - 1050
  • [4] SAT-GCN: Self-attention graph convolutional network-based 3D object detection for autonomous driving
    Wang, Li
    Song, Ziying
    Zhang, Xinyu
    Wang, Chenfei
    Zhang, Guoxin
    Zhu, Lei
    Li, Jun
    Liu, Huaping
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [5] Enhancing Multi-modal Features Using Local Self-attention for 3D Object Detection
    Li, Hao
    Zhang, Zehan
    Zhao, Xian
    Wang, Yulong
    Shen, Yuxi
    Pu, Shiliang
    Mao, Hui
    [J]. COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 532 - 549
  • [6] Improving Deep Multi-modal 3D Object Detection for Autonomous Driving
    Khamsehashari, Razieh
    Schill, Kerstin
    [J]. 2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 263 - 267
  • [7] Survey on deep learning-based 3D object detection in autonomous driving
    Liang, Zhenming
    Huang, Yingping
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2023, 45 (04) : 761 - 776
  • [8] SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving
    Tian, Daxin
    Lin, Chunmian
    Zhou, Jianshan
    Duan, Xuting
    Cao, Yue
    Zhao, Dezong
    Cao, Dongpu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4099 - 4110
  • [9] Deep learning-based fusion networks with high-order attention mechanism for 3D object detection in autonomous driving scenarios
    Jiang, Haiyang
    Lu, Yuanyao
    Zhang, Duona
    Shi, Yuntao
    Wang, Jingxuan
    [J]. APPLIED SOFT COMPUTING, 2024, 152
  • [10] Deep learning-based fusion networks with high-order attention mechanism for 3D object detection in autonomous driving scenarios
    Jiang, Haiyang
    Lu, Yuanyao
    Zhang, Duona
    Shi, Yuntao
    Wang, Jingxuan
    [J]. Applied Soft Computing, 2024, 152