Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution

被引：0

作者：

Chen, Jiun-Han ^{[1
]}

Shieh, Jeng-Lun ^{[1
]}

Haq, Muhamad Amirul ^{[1
]}

Ruan, Shanq-Jang ^{[1
]}

机构：

[1] Natl Taiwan Univ Sci & Technol, Dept Elect & Comp Engn, Taipei 10607, Taiwan

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 03期

关键词：

Three-dimensional displays; Object detection; Solid modeling; Feature extraction; Training; Computational modeling; Task analysis; 3D object detection; monocular camera; driving scene understanding; auxiliary learning; deep learning;

D O I：

10.1109/TITS.2023.3319556

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

In autonomous driving systems, the monocular 3D object detection algorithm is a crucial component. The safety of autonomous vehicles heavily depends on a well-designed detection system. Therefore, developing a robust and efficient 3D object detection algorithm is a major goal for institutes and researchers. Having a 3D sense is essential in autonomous vehicles and robotics, as it allows the system to understand its surroundings and react accordingly. Compared with stereo-based and Lidar-based methods, monocular 3D Object detection is a challenging task as it only utilizes 2D information to generate complex 3D features, making it low-cost, less computationally intensive, and with great potential. However, the performance of monocular methods is impaired due to the lack of depth information. In this paper, we propose a simple, end-to-end, and effective network for monocular 3D object detection without the use of external training data. Our work is inspired by auxiliary learning, in which we use a robust feature extractor as our backbone and multiple regression heads to learn auxiliary knowledge. These auxiliary regression heads will be discarded after training for improved inference efficiency, allowing us to take advantage of auxiliary learning and enabling the model to learn critical information more conceptually. The proposed method achieves 17.28% and 20.10% for the moderate level of the Car category on the KITTI benchmark test set and validation set, respectively, which outperforms the previous monocular 3D object detection approaches.

引用

页码：2424 / 2436

页数：13

共 50 条

[1] Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
Liu, Xianpeng
Xue, Nan
Wu, Tianfu
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1810 - 1818
[2] MonoDCN: Monocular 3D object detection based on dynamic convolution
Qu, Shenming
Yang, Xinyu
Gao, Yiming
Liang, Shengbin
[J]. PLOS ONE, 2022, 17 (10):
[3] Boosting Monocular 3D Object Detection With Object-Centric Auxiliary Depth Supervision
Kim, Youngseok
Kim, Sanmin
Sim, Sangmin
Choi, Jun Won
Kum, Dongsuk
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1801 - 1813
[4] MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection
Li, Zhenglin
Zheng, Wenbo
Yang, Le
Ma, Liyan
Zhou, Yang
Peng, Yan
[J]. CYBORG AND BIONIC SYSTEMS, 2024, 5
[5] Efficient Active Learning Strategies for Monocular 3D Object Detection
Hekimoglu, Aral
Schmidt, Michael
Marcos-Ramiro, Alvaro
Rigoll, Gerhard
[J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 295 - 302
[6] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04): : 1959 - 1966
[7] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
[8] Learning Deformable Network for 3D Object Detection on Point Clouds
Zhang, Wanyi
Fu, Xiuhua
Li, Wei
[J]. MOBILE INFORMATION SYSTEMS, 2021, 2021
[9] 3D Object Detection Based on Proposal Generation Network Utilizing Monocular Images
ul Haq, Qazi Mazhar
Haq, Muhamad Amirul
Ruan, Shanq-Jang
Liang, Pei-Jung
Gao, De-Qin
[J]. IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (05) : 47 - 53
[10] Depth-discriminative Metric Learning for Monocular 3D Object Detection
Choi, Wonhyeok
Shin, Mingyu
Im, Sunghoon
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →