Intrinsic Image Diffusion for Indoor Single-view Material Estimation

被引:1
|
作者
Kocsis, Peter [1 ]
Sitzmann, Vincent [2 ]
Niessner, Matthias [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] MIT, EECS, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52733.2024.00497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Intrinsic Image Diffusion, a generative model for appearance decomposition of indoor scenes. Given a single input view, we sample multiple possible material explanations represented as albedo, roughness, and metallic maps. Appearance decomposition poses a considerable challenge in computer vision due to the inherent ambiguity between lighting and material properties and the lack of real datasets. To address this issue, we advocate for a probabilistic formulation, where instead of attempting to directly predict the true material properties, we employ a conditional generative model to sample from the solution space. Furthermore, we show that utilizing the strong learned prior of recent diffusion models trained on large-scale real-world images can be adapted to material estimation and highly improves the generalization to real images. Our method produces significantly sharper, more consistent, and more detailed materials, outperforming state-of-the-art methods by 1.5dB on PSNR and by 45% better FID score on albedo prediction. We demonstrate the effectiveness of our approach through experiments on both synthetic and real-world datasets.
引用
收藏
页码:5198 / 5208
页数:11
相关论文
共 50 条
  • [21] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry
    Bae, Gwangbin
    Budvytis, Ignas
    Cipolla, Roberto
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2832 - 2841
  • [22] A single-view based framework for robust estimation of height and position of moving people
    Lee, Seok-Han
    Choi, Jong-Soo
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2007, 4872 : 562 - 574
  • [23] Weakly Supervised Monocular 3D Detection with a Single-View Image
    Jiang, Xueying
    Jin, Sheng
    Lu, Lewei
    Zhang, Xiaoqin
    Lu, Shijian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10508 - 10518
  • [24] A Single-View Based Framework for Robust Estimation of Heights and Positions of Moving People
    Lee, Seok-Han
    Kim, Tae-Eun
    Choi, Jong-Soo
    2010 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS ICCE, 2010,
  • [25] Mass detection in single-view mammograms
    Abdel-Mottaleb, Mohamed
    Carman, Charles S.
    Hill, Charles R.
    Eliot, Gail
    Mankovich, Nicholas J.
    Journal of Digital Imaging, 1997, 10 (3 Suppl 1): : 222 - 223
  • [26] Mass detection in single-view mammograms
    AbdelMottaleb, M
    Carman, CS
    Hill, CR
    Eliot, G
    Mankovich, NJ
    JOURNAL OF DIGITAL IMAGING, 1997, 10 (03) : 222 - 223
  • [27] On the Uncertain Single-View Depths in Colonoscopies
    Rodriguez-Puigvert, Javier
    Recasens, David
    Civera, Javier
    Martinez-Cantin, Ruben
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 130 - 140
  • [28] Single-View Distance-Estimation-Based Formation Control of Robotic Swarms
    Fidan, Baris
    Gazi, Veysel
    Zhai, Shaohao
    Cen, Na
    Karatas, Engin
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2013, 60 (12) : 5781 - 5791
  • [29] 3D Reconstruction and Estimation from Single-view 2D Image by Deep Learning A Survey
    Shan, Yongfeng
    Liang, Christy Jie
    Xu, Min
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1 - 7
  • [30] SINGLE-VIEW RECAPTURED IMAGE DETECTION BASED ON PHYSICS-BASED FEATURES
    Gao, Xinting
    Ng, Tian-Tsong
    Qiu, Bo
    Chang, Shih-Fu
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1469 - 1474