3D-Aware Scene Manipulation via Inverse Graphics

被引:0
|
作者
Yao, Shunyu [1 ,2 ]
Hsu, Tzu-Ming Harry [2 ]
Zhu, Jun-Yan [2 ]
Wu, Jiajun [2 ]
Torralba, Antonio [2 ]
Freeman, William T. [3 ]
Tenenbaum, Joshua B. [2 ,3 ]
机构
[1] Tsinghua Univ, IIIS, Beijing, Peoples R China
[2] MIT, CSAIL, Cambridge, MA 02139 USA
[3] MIT, CSAIL, Google Res, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We aim to obtain an interpretable, expressive, and disentangled scene representation that contains comprehensive structural and textural information for each object. Previous scene representations learned by neural networks are often uninterpretable, limited to a single object, or lacking 3D knowledge. In this work, we propose 3D scene de-rendering networks (3D-SDN) to address the above issues by integrating disentangled representations for semantics, geometry, and appearance into a deep generative model. Our scene encoder performs inverse graphics, translating a scene into a structured object-wise representation. Our decoder has two components: a differentiable shape renderer and a neural texture generator. The disentanglement of semantics, geometry, and appearance supports 3D-aware scene manipulation, e.g., rotating and moving objects freely while keeping the consistent shape and texture, and changing the object appearance without affecting its shape. Experiments demonstrate that our editing scheme based on 3D-SDN is superior to its 2D counterpart.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] 3D-Aware Indoor Scene Synthesis with Depth Priors
    Shi, Zifan
    Shen, Yujun
    Zhu, Jiapeng
    Yeung, Dit-Yan
    Chen, Qifeng
    [J]. COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 406 - 422
  • [2] 3D-Aware Scene Change Captioning From Multiview Images
    Qiu, Yue
    Satoh, Yutaka
    Suzuki, Ryota
    Iwata, Kenji
    Kataoka, Hirokatsu
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03): : 4743 - 4750
  • [3] Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
    Do, Hoseok
    Yoo, EunKyung
    Kim, Taehyeong
    Lee, Chul
    Choi, Tin Young
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8529 - 8538
  • [4] 3D-Aware Face Swapping
    Li, Yixuan
    Ma, Chao
    Yan, Yichao
    Zhu, Wenhan
    Yang, Xiaokang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12705 - 12714
  • [5] 3D-aware Conditional Image Synthesis
    Deng, Kangle
    Yang, Gengshan
    Ramanan, Deva
    Zhu, Jun-Yan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4434 - 4445
  • [6] DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis
    Xu, Yinghao
    Chai, Menglei
    Shi, Zifan
    Peng, Sida
    Skorokhodov, Ivan
    Siarohin, Aliaksandr
    Yang, Ceyuan
    Shen, Yujun
    Lee, Hsin-Ying
    Zhou, Bolei
    Tulyakov, Sergey
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4402 - 4412
  • [7] 3D-aware Blending with Generative NeRFs
    Kim, Hyunsu
    Lee, Gayoung
    Choi, Yunjey
    Kim, Jin-Hwa
    Zhu, Jun-Yan
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22849 - 22861
  • [8] 3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification
    Zhang, Jiazhao
    Dai, Liu
    Meng, Fanpeng
    Fan, Qingnan
    Chen, Xuelin
    Xu, Kai
    Wang, He
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6672 - 6682
  • [9] Semantic 3D-Aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field
    Ma, Tianxiang
    Li, Bingchuan
    He, Qian
    Dong, Jing
    Tan, Tieniu
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1878 - 1886
  • [10] 3D-aware Image Synthesis via Learning Structural and Textural Representations
    Xu, Yinghao
    Peng, Sida
    Yang, Ceyuan
    Shen, Yujun
    Zhou, Bolei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18409 - 18418