Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

被引：16

作者：

Chung, Hyungjin ^{[1
,2
]}

Ryu, Dohoon ^{[1
]}

Mccann, Michael T. ^{[2
]}

Klasky, Marc L. ^{[2
]}

Ye, Jong Chul ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] Los Alamos Natl Lab, Los Alamos, NM 87545 USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

基金：

新加坡国家研究基金会;

关键词：

CONVOLUTIONAL NEURAL-NETWORK; COMPUTED-TOMOGRAPHY; RECONSTRUCTION; ALGORITHM;

D O I：

10.1109/CVPR52729.2023.02159

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers, acting as the prior of the distribution, while the information of the forward model can be granted at the sampling stage. Nonetheless, as the generative process remains in the same high dimensional (i.e. identical to data dimension) space, the models have not been extended to 3D inverse problems due to the extremely high memory and computational cost. In this paper, we combine the ideas from the conventional model-based iterative reconstruction with the modern diffusion models, which leads to a highly effective method for solving 3D medical image reconstruction tasks such as sparse-view tomography, limited angle tomography, compressed sensing MRI from pre-trained 2D diffusion models. In essence, we propose to augment the 2D diffusion prior with a model-based prior in the remaining direction at test time, such that one can achieve coherent reconstructions across all dimensions. Our method can be run in a single commodity GPU, and establishes the new state-of-the-art, showing that the proposed method can perform reconstructions of high fidelity and accuracy even in the most extreme cases (e.g. 2-view 3D tomography). We further reveal that the generalization capacity of the proposed method is surprisingly high, and can be used to reconstruct volumes that are entirely different from the training dataset. Code available: https://github.com/HJ-harry/DiffusionMBIR

引用

下载

页码：22542 / 22551

页数：10

共 50 条

[1] Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models
Lee, Suhyeon
Chung, Hyungjin
Park, Minyoung
Park, Jonghyuk
Ryu, Wi-Sun
Ye, Jong Chul
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10676 - 10686
[2] Guiding 3D Digital Content Generation with Pre-Trained Diffusion Models
Li, Jing
Li, Zhengping
Jiang, Peizhe
Wang, Lijun
Li, Xiaoxue
Hao, Yuwen
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1220 - 1230
[3] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
Zhang, Renrui
Wang, Liuhui
Qiao, Yu
Gao, Peng
Li, Hongsheng
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21769 - 21780
[4] Action Recognition in Videos Using Pre-Trained 2D Convolutional Neural Networks
Kim, Jun-Hwa
Won, Chee Sun
IEEE ACCESS, 2020, 8 : 60179 - 60188
[5] 3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
Zhu, Ziyu
Ma, Xiaojian
Chen, Yixin
Deng, Zhidong
Huang, Siyuan
Li, Qing
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2899 - 2909
[6] MHAGuideNet: a 3D pre-trained guidance model for Alzheimer’s Disease diagnosis using 2D multi-planar sMRI images
Yuanbi Nie
Qiushi Cui
Wenyuan Li
Yang Lü
Tianqing Deng
BMC Medical Imaging, 24 (1)
[7] 3D Semantic Novelty Detection via Large-Scale Pre-Trained Models
Rabino, Paolo
Alliegro, Antonio
Tommasi, Tatiana
IEEE Access, 2024, 12 : 135352 - 135361
[8] MLPG refinement techniques for 2D and 3D diffusion problems
Mazzia, Annamaria
Pini, Giorgio
Sartoretto, Flavio
CMES - Computer Modeling in Engineering and Sciences, 2014, 102 (06): : 475 - 497
[9] MLPG Refinement Techniques for 2D and 3D Diffusion Problems
Mazzia, Annamaria
Pini, Giorgio
Sartoretto, Flavio
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2014, 102 (06): : 475 - 497
[10] Leveraging Pre-Trained 3D Object Detection Models For Fast Ground Truth Generation
Lee, Jungwook
Walsh, Sean
Harakeh, Ali
Waslander, Steven L.
2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2504 - 2510

← 1 2 3 4 5 →