Monocular 3D Object Detection for Autonomous Driving Based on Contextual Transformer

被引：0

作者：

She, Xiangyang ^{[1
]}

Yan, Weijia ^{[1
]}

Dong, Lihong ^{[1
]}

机构：

[1] College of Computer Science and Technology, Xi'an University of Science and Technology, Xi'an,710054, China

来源：

Computer Engineering and Applications | 2024年 / 60卷 / 19期

关键词：

D O I：

10.3778/j.issn.1002-8331.2307-0084

中图分类号：

学科分类号：

摘要：

Aiming at the current problems of leakage and poor multi-scale target detection in monocular 3D object detection, a monocular 3D object detection algorithm for autonomous driving based on Contextual Transformer (CM-RTM3D) is proposed. Firstly, Contextual Transformer (CoT) is introduced into the ResNet-50 network to construct the ResNet-Transformer architecture for feature extraction. Secondly, the multi-scale spatial perception (MSP) module is designed to improve the loss of shallow features through scale-space response operations, embedding the coordinate attention mechanism (CA) along both horizontal and vertical spatial directions, and generating soft weights of importance at each scale using the softmax function. Finally, the Huber loss function is used instead of the L1 loss function in the offset loss. The experimental results show that, compared with the RTM3D algorithm on the KITTI autopilot dataset, the algorithm in this paper improves AP3D by 4.84, 3.82, and 5.36 percentage points, and APBEV by 4.75, 6.26, and 3.56 percentage points, respectively, at the three difficulty levels of easy, medium, and difficult. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.

引用

页码：178 / 189

共 50 条

[1] Monocular 3D Object Detection for Autonomous Driving
Chen, Xiaozhi
Kundu, Kaustav
Zhang, Ziyu
Ma, Huimin
Fidler, Sanja
Urtasun, Raquel
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
[2] Efficient Uncertainty Estimation for Monocular 3D Object Detection in Autonomous Driving
Liu, Zechen
Han, Zhihua
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2711 - 2718
[3] Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
Chen, Yi-Nan
Dai, Hang
Ding, Yong
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 877 - 887
[4] Monocular 3D object detection using dual quadric for autonomous driving
Li, Peixuan
Zhao, Huaici
NEUROCOMPUTING, 2021, 441 : 151 - 160
[5] Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
Tao, Chongben
Cao, Jiecheng
Wang, Chen
Zhang, Zufeng
Gao, Zhen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3962 - 3975
[6] Ground-Aware Monocular 3D Object Detection for Autonomous Driving
Liu, Yuxuan
Yixuan, Yuan
Liu, Ming
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 919 - 926
[7] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
Alaba, Simegnew Yihunie
Ball, John E.
IEEE ACCESS, 2024, 12 : 50165 - 50176
[8] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
Ji, Chaofeng
Liu, Guizhong
Zhao, Dan
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 5973 - 5988
[9] Monocular 3D object detection via estimation of paired keypoints for autonomous driving
Chaofeng Ji
Guizhong Liu
Dan Zhao
Multimedia Tools and Applications, 2022, 81 : 5973 - 5988
[10] A review of 3D object detection based on autonomous driving
Wang, Huijuan
Chen, Xinyue
Yuan, Quanbo
Liu, Peng
VISUAL COMPUTER, 2025, 41 (03): : 1757 - 1775

← 1 2 3 4 5 →