Holistic 3D Scene Understanding from a Single Image with Implicit Representation

被引：35

作者：

Zhang, Cheng ^{[2
]}

Cui, Zhaopeng ^{[1
]}

Zhang, Yinda ^{[3
]}

Zeng, Bing ^{[2
]}

Pollefeys, Marc ^{[4
]}

Liu, Shuaicheng ^{[2
]}

机构：

[1] Zhejiang Univ, State Key Lab Cad & CG, Hangzhou, Peoples R China

[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[3] Google, Mountain View, CA 94043 USA

[4] Swiss Fed Inst Technol, Zurich, Switzerland

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR46437.2021.00872

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate estimation of both shapes and layout especially for the cluttered scene due to the heavy occlusion between objects. We propose to utilize the latest deep implicit representation to solve this challenge. We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features. A novel physical violation loss is also proposed to avoid incorrect context between objects. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods in terms of object shape, scene layout estimation, and 3D object detection.

引用

页码：8829 / 8838

页数：10

共 50 条

[31] Color Constancy Using 3D Scene Geometry Derived From a Single Image
Elfiky, Noha
Gevers, Theo
Gijsenij, Arjan
Gonzalez, Jordi
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3855 - 3868
[32] Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image
Yin, Wei
Zhang, Jianming
Wang, Oliver
Niklaus, Simon
Chen, Simon
Liu, Yifan
Shen, Chunhua
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6480 - 6494
[33] Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
Huang, Siyuan
Qi, Siyuan
Xiao, Yinxue
Zhu, Yixin
Wu, Ying Nian
Zhu, Song-Chun
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[34] 3D Traffic Scene Understanding from Movable Platforms
Geiger, Andreas
Lauer, Martin
Wojek, Christian
Stiller, Christoph
Urtasun, Raquel
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 1012 - 1025
[35] Semi-Supervised 3D Holistic Human Mesh Reconstruction from A Single Image
Santoso, Joshua
Williem
[J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
[36] Machine learning for scene 3D reconstruction using a single image
Knyaz, Vladimir
[J]. OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI, 2021, 11353
[37] Single Image 3D Without a Single 3D Image
Fouhey, David F.
Hussain, Wajahat
Gupta, Abhinav
Hebert, Martial
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1053 - 1061
[38] Reconstruction and Representation for 3D Implicit Surfaces
Wang, Chung-Shing
Chang, Teng-Rucy
Lin, Man-Ching
[J]. ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 364 - +
[39] Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding
Wang, Yunsong
Zhao, Na
Lee, Gim Hee
[J]. 2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 158 - 168
[40] Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-Training
Gao, Yipeng
Wang, Zeyu
Zheng, Wei-Shi
Xie, Cihang
Zhou, Yuyin
[J]. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2024, : 22998 - 23008

← 1 2 3 4 5 →