Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

被引:0
|
作者
Liu, Feng [1 ]
Liu, Xiaoming [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring 3D locations and shapes of multiple objects from a single 2D image is a long-standing objective of computer vision. Most of the existing works either predict one of these 3D properties or focus on solving both for a single object. One fundamental challenge lies in how to learn an effective representation of the image that is well-suited for 3D detection and reconstruction. In this work, we propose to learn a regular grid of 3D voxel features from the input image which is aligned with 3D scene space via a 3D feature lifting operator. Based on the 3D voxel features, our novel CenterNet-3D detection head formulates the 3D detection as keypoint detection in the 3D space. Moreover, we devise an efficient coarse-to-fine reconstruction module, including coarse-level voxelization and a novel local PCASDF shape representation, which enables fine detail reconstruction and one order of magnitude faster inference than prior methods. With complementary supervision from both 3D detection and reconstruction, one enables the 3D voxel features to be geometry and context preserving, benefiting both tasks. The effectiveness of our approach is demonstrated through 3D detection and reconstruction in single object and multiple object scenarios. Code is available at http://cvlab.cse.msu.edu/project-mdr.html.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Research on 3D Reconstruction Based on a Single Image
    Yu Yong-yan
    Wang Zhi-jian
    PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 3 - 10
  • [32] The Research of 3D Reconstruction Based on Single Image
    Zhang, Yong
    Zhang, Li
    PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER, COMMUNICATION, CONTROL AND AUTOMATION, 2013, 68 : 59 - 62
  • [33] A 3D Reconstruction Algorithm of Multiple Objects
    Li, Heyi
    Zhang, Tao
    Li, Xiaohan
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 2673 - 2678
  • [34] Nonrigid 3D Reconstruction from a Single Image
    Ma, Wen-juan
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 138 - 142
  • [35] MEMORY AND PROCESSING ARCHITECTURE FOR 3D VOXEL-BASED IMAGERY
    KAUFMAN, A
    BAKALASH, R
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1988, 8 (06) : 10 - 23
  • [36] Voxel-based, image source-independent 3D asymmetry quantification in the maxillofacial region
    Lin, Yu-Cheng
    Fang, Jing-Jing
    MANAGEMENT, MANUFACTURING AND MATERIALS ENGINEERING, PTS 1 AND 2, 2012, 452-453 : 165 - 169
  • [37] A voxel-based 3D indoor model to support 3D pedestrian evacuation simulations
    Xie, Ruihang
    Zlatanova, Sisi
    Aleksandrov, Mitko
    Lee, Jinwoo
    Journal of Building Engineering, 2024, 98
  • [38] Self-learning Voxel-based Multi-camera Occlusion Maps for 3D Reconstruction
    Slembrouck, Maarten
    Van Cauwelaert, Dimitri
    Van Hamme, David
    Van Haerenborgh, Dirk
    Van Hese, Peter
    Veelaert, Peter
    Philips, Wilfried
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 502 - 509
  • [39] 3D hand reconstruction from a single image based on biomechanical constraints
    Li, Guiqing
    Wu, Zihui
    Liu, Yuxin
    Zhang, Huiqian
    Nie, Yongwei
    Mao, Aihua
    VISUAL COMPUTER, 2021, 37 (9-11): : 2699 - 2711
  • [40] A Geometry-Based 3D Reconstruction from a Single Omnidirectional Image
    Wahyono
    Joko, Hariyono
    Vavilin, Andrey
    Jo, Kang-Hyun
    PROCEEDINGS OF THE 19TH KOREA-JAPAN JOINT WORKSHOP ON FRONTIERS OF COMPUTER VISION (FCV 2013), 2013, : 295 - 299