Holistic 3D Scene Understanding from a Single Image with Implicit Representation

被引：35

作者：

Zhang, Cheng ^{[2
]}

Cui, Zhaopeng ^{[1
]}

Zhang, Yinda ^{[3
]}

Zeng, Bing ^{[2
]}

Pollefeys, Marc ^{[4
]}

Liu, Shuaicheng ^{[2
]}

机构：

[1] Zhejiang Univ, State Key Lab Cad & CG, Hangzhou, Peoples R China

[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[3] Google, Mountain View, CA 94043 USA

[4] Swiss Fed Inst Technol, Zurich, Switzerland

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR46437.2021.00872

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.

引用

页码：8829 / 8838

页数：10

共 50 条

[1] Holistic 3D Scene Understanding from a Single Geo-tagged Image
Wang, Shenlong
Fidler, Sanja
Urtasun, Raquel
[J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3964 - 3972
[2] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
Huang, Siyuan
Qi, Siyuan
Zhu, Yixin
Xiao, Yinxue
Xu, Yuanlu
Zhu, Song-Chun
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 194 - 211
[3] Stylizing 3D Scene via Implicit Representation and HyperNetwork
Chiang, Pei-Ze
Tsai, Meng-Shiun
Tseng, Hung-Yu
Lai, Wei-Sheng
Chiu, Wei-Chen
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 215 - 224
[4] Holistic Scene Understanding for 3D Object Detection with RGBD cameras
Lin, Dahua
Fidler, Sanja
Urtasun, Raquel
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1417 - 1424
[5] Holistic 3D Body Reconstruction From a Blurred Single Image
Santoso, Joshua
Williem
Park, In Kyu
[J]. IEEE ACCESS, 2022, 10 : 115399 - 115410
[6] Holistic 3D Human and Scene Mesh Estimation from Single View Images
Weng, Zhenzhen
Yeung, Serena
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 334 - 343
[7] FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
Zuo, Xingxing
Samangouei, Pouya
Zhou, Yunwen
Di, Yan
Li, Mingyang
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
[8] Inferring 3D scene structure from a single polarization image
Rahmann, S
[J]. POLARIZATION AND COLOR TECHNIQUES IN INDUSTRIAL INSPECTION, 1999, 3826 : 22 - 33
[9] Learning to Recover 3D Scene Shape from a Single Image
Yin, Wei
Zhang, Jianming
Wang, Oliver
Niklaus, Simon
Mai, Long
Chen, Simon
Shen, Chunhua
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 204 - 213
[10] Panoptic 3D Scene Reconstruction From a Single RGB Image
Dahnert, Manuel
Hou, Ji
Niessner, Matthias
Dai, Angela
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →