High-resolution image reconstruction with latent diffusion models from human brain activity

被引:37
|
作者
Takagi, Yu [1 ,2 ]
Nishimoto, Shinji [1 ,2 ]
机构
[1] Osaka Univ, Grad Sch Frontier Biosci, Suita, Osaka, Japan
[2] NICT, CiNet, Osaka, Japan
关键词
NATURAL IMAGES;
D O I
10.1109/CVPR52729.2023.01389
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reconstructing visual experiences from human brain activity offers a unique way to understand how the brain represents the world, and to interpret the connection between computer vision models and our visual system. While deep generative models have recently been employed for this task, reconstructing realistic images with high semantic fidelity is still a challenging problem. Here, we propose a new method based on a diffusion model (DM) to reconstruct images from human brain activity obtained via functional magnetic resonance imaging (fMRI). More specifically, we rely on a latent diffusion model (LDM) termed Stable Diffusion. This model reduces the computational cost of DMs, while preserving their high generative performance. We also characterize the inner mechanisms of the LDM by studying how its different components (such as the latent vector of image Z, conditioning inputs C, and different elements of the denoising U-Net) relate to distinct brain functions. We show that our proposed method can reconstruct high-resolution images with high fidelity in straight-forward fashion, without the need for any additional training and fine-tuning of complex deep-learning models. We also provide a quantitative interpretation of different LDM components from a neuroscientific perspective. Overall, our study proposes a promising method for reconstructing images from human brain activity, and provides a new framework for understanding DMs. Please check out our webpage at https://sites.google.com/view/stablediffusion-withbrain/.
引用
收藏
页码:14453 / 14463
页数:11
相关论文
共 50 条
  • [21] Biorthogonal wavelet system for high-resolution image reconstruction
    Shen, LX
    Sun, QY
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1997 - 2011
  • [22] High-resolution image reconstruction and its fast algorithm
    Pei, Shengwei
    Du, Minghui
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 8 - +
  • [23] HIGH-RESOLUTION IMAGE-RECONSTRUCTION BY SIMULATED ANNEALING
    NUMNONDA, T
    ANDREWS, M
    KAKARALA, R
    [J]. OPTICS COMMUNICATIONS, 1994, 108 (1-3) : 24 - 30
  • [24] An edge-preserving high-resolution image reconstruction
    Discepoli, M
    Gerace, I
    Pandolfi, R
    [J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 77 - 82
  • [25] Phase-constrained reconstruction of high-resolution multi-shot diffusion weighted image
    Huang, Yiman
    Zhang, Xinlin
    Guo, Hua
    Chen, Huijun
    Guo, Di
    Huang, Feng
    Xu, Qin
    Qu, Xiaobo
    [J]. JOURNAL OF MAGNETIC RESONANCE, 2020, 312
  • [26] Generation of Orthoimage from High-Resolution DEM and High-Resolution Image
    Saati, M.
    Amini, J.
    Sadeghian, S.
    Hosseini, S. A.
    [J]. SCIENTIA IRANICA, 2008, 15 (05) : 568 - 574
  • [27] HIGH-RESOLUTION ANATOMY FROM IN-SITU HUMAN BRAIN
    TOGA, AW
    AMBACH, KL
    SCHLUENDER, S
    [J]. NEUROIMAGE, 1994, 1 (04) : 334 - 344
  • [28] Reconstruction of high-resolution image frames from a sequence of low-resolution and compressed observations
    Segall, CA
    Molina, R
    Katsaggelos, AK
    Mateos, J
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1701 - 1704
  • [29] High-resolution image reconstruction from rotated and translated low-resolution images with multisensors
    Wen, YW
    Ng, MK
    Ching, WK
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2004, 14 (02) : 75 - 83
  • [30] High-resolution image reconstruction from digital video by exploitation of nonglobal motion
    Tuinstra, TR
    Hardie, RC
    [J]. OPTICAL ENGINEERING, 1999, 38 (05) : 806 - 814