Fast Inference in Denoising Diffusion Models via MMD Finetuning

被引：0

作者：

Aiello, Emanuele ^{[1
]}

Valsesia, Diego ^{[1
]}

Magli, Enrico ^{[1
]}

机构：

[1] Politecn Torino, Dept Elect & Telecommun, I-10129 Turin, Italy

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Training; Diffusion models; Noise reduction; Diffusion processes; Computational modeling; Extraterrestrial measurements; Image synthesis; Denoising diffusion models; fast inference; image generation; MMD;

D O I：

10.1109/ACCESS.2024.3436698

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Denoising Diffusion Models (DDMs) have become a popular tool for generating high-quality samples from complex data distributions. These models are able to capture sophisticated patterns and structures in the data, and can generate samples that are highly diverse and representative of the underlying distribution. However, one of the main limitations of diffusion models is the complexity of sample generation, since a large number of inference timesteps is required to faithfully capture the data distribution. In this paper, we present MMD-DDM, a novel method for fast sampling of diffusion models. Our approach is based on the idea of using the Maximum Mean Discrepancy (MMD) to finetune the learned distribution with a given budget of timesteps. This allows the finetuned model to significantly improve the speed-quality trade-off, by substantially increasing fidelity in inference regimes with few steps or, equivalently, by reducing the required number of steps to reach a target fidelity, thus paving the way for a more practical adoption of diffusion models in a wide range of applications. We evaluate our approach on unconditional image generation with extensive experiments across the CIFAR-10, CelebA, ImageNet and LSUN-Church datasets. Our findings show that the proposed method is able to produce high-quality samples in a fraction of the time required by widely-used diffusion models, and outperforms state-of-the-art techniques for accelerated sampling. Code will be available at: https://github.com/diegovalsesia/MMD-DDM.

引用

页码：106912 / 106923

页数：12

共 50 条

[31] Accelerating SPECT Imaging for Dosimetry via Projection Interpolation using Denoising Diffusion Probabilistic Models
Toosi, Amirhosein
Kurkowska, Sara
Polson, Luke
Colpo, Nadine
Dellar, Conor
Parulekar, Wendy
Saad, Fred
Chi, Kim
Benard, Francois
Fernandez, Pedro Esquinas
Rahmim, Arman
Uribe, Carlos
JOURNAL OF NUCLEAR MEDICINE, 2024, 65
[32] Seismic Data Interpolation via Denoising Diffusion Implicit Models With Coherence-Corrected Resampling
Wei, Xiaoli
Zhang, Chunxia
Wang, Hongtao
Tan, Chengli
Xiong, Deng
Jiang, Baisong
Zhang, Jiangshe
Kim, Sang-Woon
IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
[33] Semi-Supervised CT Denoising via Text-Guided Mamba Diffusion Models
Su, Bo
Hu, Xiangyun
Xu, Jiabo
Deng, Kai
Zha, Yunfei
Wan, Jun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[34] Seismic Data Interpolation via Denoising Diffusion Implicit Models With Coherence-Corrected Resampling
Wei, Xiaoli
Zhang, Chunxia
Wang, Hongtao
Tan, Chengli
Xiong, Deng
Jiang, Baisong
Zhang, Jiangshe
Kim, Sang-Woon
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[35] SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
Luo, Jinglong
Zhang, Yehong
Zhang, Zhuo
Zhang, Jiaqi
Mu, Xin
Wang, Hui
Yu, Yue
Xu, Zenglin
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13333 - 13348
[36] Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing
Wand, M. P.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 137 - 156
[37] A FAST EXPLICIT DIFFUSION ALGORITHM OF FRACTIONAL ORDER ANISOTROPIC DIFFUSION FOR IMAGE DENOISING
Zhang, Zhiguang
Liu, Qiang
Gao, Tianling
INVERSE PROBLEMS AND IMAGING, 2021, 15 (06) : 1451 - 1469
[38] A denoising approach via wavelet domain diffusion and image domain diffusion
Xiaobo Zhang
Multimedia Tools and Applications, 2017, 76 : 13545 - 13561
[39] A denoising approach via wavelet domain diffusion and image domain diffusion
Zhang, Xiaobo
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13545 - 13561
[40] Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions
Li, Jiahuan
Zhou, Hao
Huang, Shujian
Cheng, Shanbo
Chen, Jiajun
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 576 - 592

← 1 2 3 4 5 →