DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

被引：17

作者：

Kumar, Abhinav ^{[1
]}

Brazil, Garrick ^{[2
]}

Corona, Enrique ^{[3
]}

Parchami, Armin ^{[3
]}

Liu, Xiaoming ^{[1
]}

机构：

[1] Michigan State Univ, E Lansing, MI 48824 USA

[2] Meta AI, Menlo Pk, CA USA

[3] Ford Motor Co, Detroit, MI USA

来源：

COMPUTER VISION, ECCV 2022, PT IX | 2022年 / 13669卷

关键词：

Equivariance; Projective manifold; Monocular 3D detection;

D O I：

10.1007/978-3-031-20077-9_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the vanilla blocks are not designed for. This paper takes the first step towards convolutions equivariant to arbitrary 3D translations in the projective manifold. Since the depth is the hardest to estimate for monocular detection, this paper proposes Depth EquiVarIAnt NeTwork (DEVIANT) built with existing scale equivariant steerable blocks. As a result, DEVIANT is equivariant to the depth translations in the projective manifold whereas vanilla networks are not. The additional depth equivariance forces the DEVIANT to learn consistent depth estimates, and therefore, DEVIANT achieves state-of-the-art monocular 3D detection results on KITTI and Waymo datasets in the image-only category and performs competitively to methods using extra information. Moreover, DEVIANT works better than vanilla networks in cross-dataset evaluation.

引用

页码：664 / 683

页数：20

共 50 条

[1] Depth-enhancement network for monocular 3D object detection
Liu, Guohua
Lian, Haiyang
Guo, Changrui
MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
[2] Categorical Depth Distribution Network for Monocular 3D Object Detection
Reading, Cody
Harakeh, Ali
Chae, Julia
Waslander, Steven L.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8551 - 8560
[3] DEPTH-ASSISTED JOINT DETECTION NETWORK FOR MONOCULAR 3D OBJECT DETECTION
Lei, Jianjun
Guo, Tingyi
Peng, Bo
Yu, Chuanbo
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2204 - 2208
[4] Monocular 3D Object Detection with Depth from Motion
Wang, Tai
Pang, Jiangmiao
Lin, Dahua
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
[5] Deep Optics for Monocular Depth Estimation and 3D Object Detection
Chang, Julie
Wetzstein, Gordon
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10192 - 10201
[6] PDR: Progressive Depth Regularization for Monocular 3D Object Detection
Sheng, Hualian
Cai, Sijia
Zhao, Na
Deng, Bing
Zhao, Min-Jian
Lee, Gim Hee
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7591 - 7603
[7] Densely Constrained Depth Estimator for Monocular 3D Object Detection
Li, Yingyan
Chen, Yuntao
He, Jiawei
Zhang, Zhaoxiang
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 718 - 734
[8] A New Monocular 3D Object Detection with Neural Network
Hong, Weijie
Liu, Yiguang
Zheng, Yunan
Wang, Ying
Shi, Xuelei
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 174 - 185
[9] Monocular 3D object detection with thermodynamic loss and decoupled instance depth
Liu, Gang
Xie, Xiaoxiao
Yu, Qingchen
CONNECTION SCIENCE, 2024, 36 (01)
[10] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
Huang, Kuan-Chih
Wu, Tsung-Han
Su, Hung-Ting
Hsu, Winston H.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4002 - 4011

← 1 2 3 4 5 →