Intrinsic Image Diffusion for Indoor Single-view Material Estimation

被引:1
|
作者
Kocsis, Peter [1 ]
Sitzmann, Vincent [2 ]
Niessner, Matthias [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] MIT, EECS, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52733.2024.00497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Intrinsic Image Diffusion, a generative model for appearance decomposition of indoor scenes. Given a single input view, we sample multiple possible material explanations represented as albedo, roughness, and metallic maps. Appearance decomposition poses a considerable challenge in computer vision due to the inherent ambiguity between lighting and material properties and the lack of real datasets. To address this issue, we advocate for a probabilistic formulation, where instead of attempting to directly predict the true material properties, we employ a conditional generative model to sample from the solution space. Furthermore, we show that utilizing the strong learned prior of recent diffusion models trained on large-scale real-world images can be adapted to material estimation and highly improves the generalization to real images. Our method produces significantly sharper, more consistent, and more detailed materials, outperforming state-of-the-art methods by 1.5dB on PSNR and by 45% better FID score on albedo prediction. We demonstrate the effectiveness of our approach through experiments on both synthetic and real-world datasets.
引用
收藏
页码:5198 / 5208
页数:11
相关论文
共 50 条
  • [1] SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion
    Ho, Hsuan-I
    Song, Jie
    Hilliges, Otmar
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 538 - 549
  • [2] Aerial Single-View Depth Completion With Image-Guided Uncertainty Estimation
    Teixeira, Lucas
    Oswald, Martin R.
    Pollefeys, Marc
    Chli, Margarita
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 1055 - 1062
  • [3] Multi-View Image Generation from a Single-View
    Zhao, Bo
    Wu, Xiao
    Cheng, Zhi-Qi
    Liu, Hao
    Jie, Zequn
    Feng, Jiashi
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 383 - 391
  • [4] Weakly-supervised Single-view Image Relighting
    Yi, Renjiao
    Zhu, Chenyang
    Xu, Kai
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8402 - 8411
  • [5] Translational photometric alignment of single-view image sequences
    Harrison, Adam P.
    Joseph, Dileepan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2012, 116 (06) : 765 - 776
  • [6] NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
    Deng, Congyue
    Jiang, Chiyu Max
    Qi, Charles R.
    Yan, Xinchen
    Zhou, Yin
    Guibas, Leonidas
    Anguelov, Dragomir
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20637 - 20647
  • [7] SINGLE-VIEW MAMMOGRAPHY
    WEISHAAR, J
    PATEROK, EM
    MULLER, A
    WILLGEROTH, F
    DEUTSCHE MEDIZINISCHE WOCHENSCHRIFT, 1976, 101 (51) : 1865 - 1866
  • [8] Generating full-view face images from a single-view image
    Zhong, Lei
    Bai, ChangMin
    Li, Jianfeng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Generating full-view face images from a single-view image
    Zhong, Lei
    Bai, ChangMin
    Li, Jianfeng
    Proceedings of the International Joint Conference on Neural Networks, 2021, 2021-July
  • [10] Single-View Food Portion Estimation Based on Geometric Models
    Fang, Shaobo
    Liu, Chang
    Zhu, Fengqing
    Delp, Edward J.
    Boushey, Carol J.
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 385 - 390