AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

被引:47
|
作者
Liu, Zongdai [1 ]
Zhou, Dingfu [1 ]
Lu, Feixiang [1 ]
Fang, Jin [1 ]
Zhang, Liangjun [1 ]
机构
[1] Baidu Res, Natl Engn Lab Deep Learning Technol & Applicat, Robot & Autonomous Driving Lab, Beijing, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCV48922.2021.01535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for incorporating the shape-aware 2D/3D constraints into the 3D detection framework. Specifically, we employ the deep neural network to learn distinguished 2D keypoints in the 2D image domain and regress their corresponding 3D coordinates in the local 3D object coordinate first. Then the 2D/3D geometric constraints are built by these correspondences for each object to boost the detection performance. For generating the ground truth of 2D/3D keypoints, an automatic model-fitting approach has been proposed by fitting the deformed 3D object model and the object mask in the 2D image. The proposed framework has been verified on the public KITTI dataset and the experimental results demonstrate that by using additional geometrical constraints the detection performance has been significantly improved as compared to the baseline method. More importantly, the proposed framework achieves state-of-the-art performance with real time. Data and code will be available at https://github.com/ zongdai/AutoShape
引用
收藏
页码:15621 / 15630
页数:10
相关论文
共 50 条
  • [21] Real-Time Monocular Object-Model Aware Sparse SLAM
    Hosseinzadeh, Mehdi
    Li, Kejie
    Latif, Yasir
    Reid, Ian
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7123 - 7129
  • [22] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction
    Ku, Jason
    Pon, Alex D.
    Waslander, Steven L.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11859 - 11868
  • [23] Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection
    Cserni, Márton
    Rövid, András
    [J]. IEEE Access, 2024, 12 : 167153 - 167167
  • [24] SASD: A Shape-Aware Saliency Object Detection Approach for RGB-D Images
    Zi, Lingling
    Cong, Xin
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 179 - 190
  • [25] TR3D: TOWARDS REAL-TIME INDOOR 3D OBJECT DETECTION
    Rukhovich, Danila
    Vorontsova, Anna
    Konushin, Anton
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 281 - 285
  • [26] A 3D Convolutional Neural Network Towards Real-time Amodal 3D Object Detection
    Sun, Hao
    Meng, Zehui
    Du, Xinxin
    Ang, Marcelo H., Jr.
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8331 - 8338
  • [27] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [28] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [29] An Approach to 3D Object Detection in Real-Time for Cognitive Robotics Experiments
    Vidal-Soroa, Daniel
    Furelos, Pedro
    Bellas, Francisco
    Antonio Becerra, Jose
    [J]. ROBOT2022: FIFTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 1, 2023, 589 : 283 - 294
  • [30] PIXOR: Real-time 3D Object Detection from Point Clouds
    Yang, Bin
    Luo, Wenjie
    Urtasun, Raquel
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7652 - 7660