Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

被引:16
|
作者
Chung, Hyungjin [1 ,2 ]
Ryu, Dohoon [1 ]
Mccann, Michael T. [2 ]
Klasky, Marc L. [2 ]
Ye, Jong Chul [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
基金
新加坡国家研究基金会;
关键词
CONVOLUTIONAL NEURAL-NETWORK; COMPUTED-TOMOGRAPHY; RECONSTRUCTION; ALGORITHM;
D O I
10.1109/CVPR52729.2023.02159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers, acting as the prior of the distribution, while the information of the forward model can be granted at the sampling stage. Nonetheless, as the generative process remains in the same high dimensional (i.e. identical to data dimension) space, the models have not been extended to 3D inverse problems due to the extremely high memory and computational cost. In this paper, we combine the ideas from the conventional model-based iterative reconstruction with the modern diffusion models, which leads to a highly effective method for solving 3D medical image reconstruction tasks such as sparse-view tomography, limited angle tomography, compressed sensing MRI from pre-trained 2D diffusion models. In essence, we propose to augment the 2D diffusion prior with a model-based prior in the remaining direction at test time, such that one can achieve coherent reconstructions across all dimensions. Our method can be run in a single commodity GPU, and establishes the new state-of-the-art, showing that the proposed method can perform reconstructions of high fidelity and accuracy even in the most extreme cases (e.g. 2-view 3D tomography). We further reveal that the generalization capacity of the proposed method is surprisingly high, and can be used to reconstruct volumes that are entirely different from the training dataset. Code available: https://github.com/HJ-harry/DiffusionMBIR
引用
下载
收藏
页码:22542 / 22551
页数:10
相关论文
共 50 条
  • [1] Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models
    Lee, Suhyeon
    Chung, Hyungjin
    Park, Minyoung
    Park, Jonghyuk
    Ryu, Wi-Sun
    Ye, Jong Chul
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10676 - 10686
  • [2] Guiding 3D Digital Content Generation with Pre-Trained Diffusion Models
    Li, Jing
    Li, Zhengping
    Jiang, Peizhe
    Wang, Lijun
    Li, Xiaoxue
    Hao, Yuwen
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1220 - 1230
  • [3] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
    Zhang, Renrui
    Wang, Liuhui
    Qiao, Yu
    Gao, Peng
    Li, Hongsheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21769 - 21780
  • [4] Action Recognition in Videos Using Pre-Trained 2D Convolutional Neural Networks
    Kim, Jun-Hwa
    Won, Chee Sun
    IEEE ACCESS, 2020, 8 : 60179 - 60188
  • [5] 3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
    Zhu, Ziyu
    Ma, Xiaojian
    Chen, Yixin
    Deng, Zhidong
    Huang, Siyuan
    Li, Qing
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2899 - 2909
  • [6] MHAGuideNet: a 3D pre-trained guidance model for Alzheimer’s Disease diagnosis using 2D multi-planar sMRI images
    Yuanbi Nie
    Qiushi Cui
    Wenyuan Li
    Yang Lü
    Tianqing Deng
    BMC Medical Imaging, 24 (1)
  • [7] 3D Semantic Novelty Detection via Large-Scale Pre-Trained Models
    Rabino, Paolo
    Alliegro, Antonio
    Tommasi, Tatiana
    IEEE Access, 2024, 12 : 135352 - 135361
  • [8] MLPG refinement techniques for 2D and 3D diffusion problems
    Mazzia, Annamaria
    Pini, Giorgio
    Sartoretto, Flavio
    CMES - Computer Modeling in Engineering and Sciences, 2014, 102 (06): : 475 - 497
  • [9] MLPG Refinement Techniques for 2D and 3D Diffusion Problems
    Mazzia, Annamaria
    Pini, Giorgio
    Sartoretto, Flavio
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2014, 102 (06): : 475 - 497
  • [10] Leveraging Pre-Trained 3D Object Detection Models For Fast Ground Truth Generation
    Lee, Jungwook
    Walsh, Sean
    Harakeh, Ali
    Waslander, Steven L.
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2504 - 2510