DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

被引:17
|
作者
Kumar, Abhinav [1 ]
Brazil, Garrick [2 ]
Corona, Enrique [3 ]
Parchami, Armin [3 ]
Liu, Xiaoming [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
[2] Meta AI, Menlo Pk, CA USA
[3] Ford Motor Co, Detroit, MI USA
来源
关键词
Equivariance; Projective manifold; Monocular 3D detection;
D O I
10.1007/978-3-031-20077-9_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern neural networks use building blocks such as convolutions that are equivariant to arbitrary 2D translations. However, these vanilla blocks are not equivariant to arbitrary 3D translations in the projective manifold. Even then, all monocular 3D detectors use vanilla blocks to obtain the 3D coordinates, a task for which the vanilla blocks are not designed for. This paper takes the first step towards convolutions equivariant to arbitrary 3D translations in the projective manifold. Since the depth is the hardest to estimate for monocular detection, this paper proposes Depth EquiVarIAnt NeTwork (DEVIANT) built with existing scale equivariant steerable blocks. As a result, DEVIANT is equivariant to the depth translations in the projective manifold whereas vanilla networks are not. The additional depth equivariance forces the DEVIANT to learn consistent depth estimates, and therefore, DEVIANT achieves state-of-the-art monocular 3D detection results on KITTI and Waymo datasets in the image-only category and performs competitively to methods using extra information. Moreover, DEVIANT works better than vanilla networks in cross-dataset evaluation.
引用
收藏
页码:664 / 683
页数:20
相关论文
共 50 条
  • [1] Depth-enhancement network for monocular 3D object detection
    Liu, Guohua
    Lian, Haiyang
    Guo, Changrui
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [2] Categorical Depth Distribution Network for Monocular 3D Object Detection
    Reading, Cody
    Harakeh, Ali
    Chae, Julia
    Waslander, Steven L.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8551 - 8560
  • [3] DEPTH-ASSISTED JOINT DETECTION NETWORK FOR MONOCULAR 3D OBJECT DETECTION
    Lei, Jianjun
    Guo, Tingyi
    Peng, Bo
    Yu, Chuanbo
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2204 - 2208
  • [4] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [5] Deep Optics for Monocular Depth Estimation and 3D Object Detection
    Chang, Julie
    Wetzstein, Gordon
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10192 - 10201
  • [6] PDR: Progressive Depth Regularization for Monocular 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Zhao, Min-Jian
    Lee, Gim Hee
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7591 - 7603
  • [7] Densely Constrained Depth Estimator for Monocular 3D Object Detection
    Li, Yingyan
    Chen, Yuntao
    He, Jiawei
    Zhang, Zhaoxiang
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 718 - 734
  • [8] A New Monocular 3D Object Detection with Neural Network
    Hong, Weijie
    Liu, Yiguang
    Zheng, Yunan
    Wang, Ying
    Shi, Xuelei
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 174 - 185
  • [9] Monocular 3D object detection with thermodynamic loss and decoupled instance depth
    Liu, Gang
    Xie, Xiaoxiao
    Yu, Qingchen
    CONNECTION SCIENCE, 2024, 36 (01)
  • [10] MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer
    Huang, Kuan-Chih
    Wu, Tsung-Han
    Su, Hung-Ting
    Hsu, Winston H.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4002 - 4011