AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

被引:47
|
作者
Liu, Zongdai [1 ]
Zhou, Dingfu [1 ]
Lu, Feixiang [1 ]
Fang, Jin [1 ]
Zhang, Liangjun [1 ]
机构
[1] Baidu Res, Natl Engn Lab Deep Learning Technol & Applicat, Robot & Autonomous Driving Lab, Beijing, Peoples R China
关键词
ACCURATE;
D O I
10.1109/ICCV48922.2021.01535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for incorporating the shape-aware 2D/3D constraints into the 3D detection framework. Specifically, we employ the deep neural network to learn distinguished 2D keypoints in the 2D image domain and regress their corresponding 3D coordinates in the local 3D object coordinate first. Then the 2D/3D geometric constraints are built by these correspondences for each object to boost the detection performance. For generating the ground truth of 2D/3D keypoints, an automatic model-fitting approach has been proposed by fitting the deformed 3D object model and the object mask in the 2D image. The proposed framework has been verified on the public KITTI dataset and the experimental results demonstrate that by using additional geometrical constraints the detection performance has been significantly improved as compared to the baseline method. More importantly, the proposed framework achieves state-of-the-art performance with real time. Data and code will be available at https://github.com/ zongdai/AutoShape
引用
收藏
页码:15621 / 15630
页数:10
相关论文
共 50 条
  • [1] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [2] Real-Time Stereo 3D Car Detection With Shape-Aware Non-Uniform Sampling
    Gao, Aqi
    Cao, Jiale
    Pang, Yanwei
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4027 - 4037
  • [3] Convolutional Shape-Aware Representation for 3D Object Classification
    Ghodrati, Hamed
    Luciano, Lorenzo
    Ben Hamza, A.
    [J]. NEURAL PROCESSING LETTERS, 2019, 49 (02) : 797 - 817
  • [4] Convolutional Shape-Aware Representation for 3D Object Classification
    Hamed Ghodrati
    Lorenzo Luciano
    A. Ben Hamza
    [J]. Neural Processing Letters, 2019, 49 : 797 - 817
  • [5] Real-Time 3D Object Detection and Tracking in Monocular Images of Cluttered Environment
    Du, Guoguang
    Wang, Kai
    Nan, Yibing
    Lian, Shiguo
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 119 - 130
  • [6] Deep shape-aware descriptor for nonrigid 3D object retrieval
    Ghodrati, Hamed
    Ben Hamza, A.
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2016, 5 (03) : 151 - 164
  • [7] Hardware-Aware Latency Pruning for Real-Time 3D Object Detection
    Shen, Maying
    Mao, Lei
    Chen, Joshua
    Hsu, Justin
    Sun, Xinglong
    Knieps, Oliver
    Maxim, Carmen
    Alvarez, Jose M.
    [J]. 2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [8] Object-Aware Centroid Voting for Monocular 3D Object Detection
    Bao, Wentao
    Yu, Qi
    Kong, Yu
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2197 - 2204
  • [9] Real-Time 3D Object Detection on Crowded Pedestrians
    Lu, Bin
    Li, Qing
    Liang, Yanju
    [J]. SENSORS, 2023, 23 (21)
  • [10] Real-time 3D Object Detection in Unstructured Environments
    Rui, Wang
    Ying, Liang
    [J]. PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 183 - 188