Holistic 3D Scene Understanding from a Single Image with Implicit Representation

被引:35
|
作者
Zhang, Cheng [2 ]
Cui, Zhaopeng [1 ]
Zhang, Yinda [3 ]
Zeng, Bing [2 ]
Pollefeys, Marc [4 ]
Liu, Shuaicheng [2 ]
机构
[1] Zhejiang Univ, State Key Lab Cad & CG, Hangzhou, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Google, Mountain View, CA 94043 USA
[4] Swiss Fed Inst Technol, Zurich, Switzerland
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.00872
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.
引用
收藏
页码:8829 / 8838
页数:10
相关论文
共 50 条
  • [31] Color Constancy Using 3D Scene Geometry Derived From a Single Image
    Elfiky, Noha
    Gevers, Theo
    Gijsenij, Arjan
    Gonzalez, Jordi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3855 - 3868
  • [32] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
    Yin, Wei
    Zhang, Jianming
    Wang, Oliver
    Niklaus, Simon
    Chen, Simon
    Liu, Yifan
    Shen, Chunhua
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
  • [33] Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
    Huang, Siyuan
    Qi, Siyuan
    Xiao, Yinxue
    Zhu, Yixin
    Wu, Ying Nian
    Zhu, Song-Chun
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [34] 3D Traffic Scene Understanding from Movable Platforms
    Geiger, Andreas
    Lauer, Martin
    Wojek, Christian
    Stiller, Christoph
    Urtasun, Raquel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 1012 - 1025
  • [35] Semi-Supervised 3D Holistic Human Mesh Reconstruction from A Single Image
    Santoso, Joshua
    Williem
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [36] Machine learning for scene 3D reconstruction using a single image
    Knyaz, Vladimir
    [J]. OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI, 2021, 11353
  • [37] Single Image 3D Without a Single 3D Image
    Fouhey, David F.
    Hussain, Wajahat
    Gupta, Abhinav
    Hebert, Martial
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1053 - 1061
  • [38] Reconstruction and Representation for 3D Implicit Surfaces
    Wang, Chung-Shing
    Chang, Teng-Rucy
    Lin, Man-Ching
    [J]. ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 364 - +
  • [39] Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding
    Wang, Yunsong
    Zhao, Na
    Lee, Gim Hee
    [J]. 2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 158 - 168
  • [40] Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-Training
    Gao, Yipeng
    Wang, Zeyu
    Zheng, Wei-Shi
    Xie, Cihang
    Zhou, Yuyin
    [J]. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2024, : 22998 - 23008