Holistic 3D Scene Understanding from a Single Image with Implicit Representation

被引:35
|
作者
Zhang, Cheng [2 ]
Cui, Zhaopeng [1 ]
Zhang, Yinda [3 ]
Zeng, Bing [2 ]
Pollefeys, Marc [4 ]
Liu, Shuaicheng [2 ]
机构
[1] Zhejiang Univ, State Key Lab Cad & CG, Hangzhou, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Google, Mountain View, CA 94043 USA
[4] Swiss Fed Inst Technol, Zurich, Switzerland
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.00872
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.
引用
收藏
页码:8829 / 8838
页数:10
相关论文
共 50 条
  • [1] Holistic 3D Scene Understanding from a Single Geo-tagged Image
    Wang, Shenlong
    Fidler, Sanja
    Urtasun, Raquel
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3964 - 3972
  • [2] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
    Huang, Siyuan
    Qi, Siyuan
    Zhu, Yixin
    Xiao, Yinxue
    Xu, Yuanlu
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 194 - 211
  • [3] Stylizing 3D Scene via Implicit Representation and HyperNetwork
    Chiang, Pei-Ze
    Tsai, Meng-Shiun
    Tseng, Hung-Yu
    Lai, Wei-Sheng
    Chiu, Wei-Chen
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 215 - 224
  • [4] Holistic Scene Understanding for 3D Object Detection with RGBD cameras
    Lin, Dahua
    Fidler, Sanja
    Urtasun, Raquel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1417 - 1424
  • [5] Holistic 3D Body Reconstruction From a Blurred Single Image
    Santoso, Joshua
    Williem
    Park, In Kyu
    [J]. IEEE ACCESS, 2022, 10 : 115399 - 115410
  • [6] Holistic 3D Human and Scene Mesh Estimation from Single View Images
    Weng, Zhenzhen
    Yeung, Serena
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 334 - 343
  • [7] FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
    Zuo, Xingxing
    Samangouei, Pouya
    Zhou, Yunwen
    Di, Yan
    Li, Mingyang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [8] Inferring 3D scene structure from a single polarization image
    Rahmann, S
    [J]. POLARIZATION AND COLOR TECHNIQUES IN INDUSTRIAL INSPECTION, 1999, 3826 : 22 - 33
  • [9] Learning to Recover 3D Scene Shape from a Single Image
    Yin, Wei
    Zhang, Jianming
    Wang, Oliver
    Niklaus, Simon
    Mai, Long
    Chen, Simon
    Shen, Chunhua
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 204 - 213
  • [10] Panoptic 3D Scene Reconstruction From a Single RGB Image
    Dahnert, Manuel
    Hou, Ji
    Niessner, Matthias
    Dai, Angela
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34