Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer

被引:0
|
作者
She, Xiangyang [1 ]
Yan, Weijia [1 ]
Dong, Lihong [1 ]
机构
[1] College of Computer Science and Technology, Xi'an University of Science and Technology, Xi'an,710054, China
关键词
D O I
10.3778/j.issn.1002-8331.2307-0084
中图分类号
学科分类号
摘要
Aiming at the current problems of leakage and poor multi-scale target detection in monocular 3D object detection, a monocular 3D object detection algorithm for autonomous driving based on Contextual Transformer (CM-RTM3D) is proposed. Firstly, Contextual Transformer (CoT) is introduced into the ResNet-50 network to construct the ResNet-Transformer architecture for feature extraction. Secondly, the multi-scale spatial perception (MSP) module is designed to improve the loss of shallow features through scale-space response operations, embedding the coordinate attention mechanism (CA) along both horizontal and vertical spatial directions, and generating soft weights of importance at each scale using the softmax function. Finally, the Huber loss function is used instead of the L1 loss function in the offset loss. The experimental results show that, compared with the RTM3D algorithm on the KITTI autopilot dataset, the algorithm in this paper improves AP3D by 4.84, 3.82, and 5.36 percentage points, and APBEV by 4.75, 6.26, and 3.56 percentage points, respectively, at the three difficulty levels of easy, medium, and difficult. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:178 / 189
相关论文
共 50 条
  • [1] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [2] Efficient Uncertainty Estimation for Monocular 3D Object Detection in Autonomous Driving
    Liu, Zechen
    Han, Zhihua
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2711 - 2718
  • [3] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Chen, Yi-Nan
    Dai, Hang
    Ding, Yong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
  • [4] Monocular 3D object detection using dual quadric for autonomous driving
    Li, Peixuan
    Zhao, Huaici
    NEUROCOMPUTING, 2021, 441 : 151 - 160
  • [5] Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
    Tao, Chongben
    Cao, Jiecheng
    Wang, Chen
    Zhang, Zufeng
    Gao, Zhen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3962 - 3975
  • [6] Ground-Aware Monocular 3D Object Detection for Autonomous Driving
    Liu, Yuxuan
    Yixuan, Yuan
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 919 - 926
  • [7] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
    Alaba, Simegnew Yihunie
    Ball, John E.
    IEEE ACCESS, 2024, 12 : 50165 - 50176
  • [8] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 5973 - 5988
  • [9] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
    Chaofeng Ji
    Guizhong Liu
    Dan Zhao
    Multimedia Tools and Applications, 2022, 81 : 5973 - 5988
  • [10] A review of 3D object detection based on autonomous driving
    Wang, Huijuan
    Chen, Xinyue
    Yuan, Quanbo
    Liu, Peng
    VISUAL COMPUTER, 2025, 41 (03): : 1757 - 1775