Fast Inference in Denoising Diffusion Models via MMD Finetuning

被引:0
|
作者
Aiello, Emanuele [1 ]
Valsesia, Diego [1 ]
Magli, Enrico [1 ]
机构
[1] Politecn Torino, Dept Elect & Telecommun, I-10129 Turin, Italy
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Diffusion models; Noise reduction; Diffusion processes; Computational modeling; Extraterrestrial measurements; Image synthesis; Denoising diffusion models; fast inference; image generation; MMD;
D O I
10.1109/ACCESS.2024.3436698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Denoising Diffusion Models (DDMs) have become a popular tool for generating high-quality samples from complex data distributions. These models are able to capture sophisticated patterns and structures in the data, and can generate samples that are highly diverse and representative of the underlying distribution. However, one of the main limitations of diffusion models is the complexity of sample generation, since a large number of inference timesteps is required to faithfully capture the data distribution. In this paper, we present MMD-DDM, a novel method for fast sampling of diffusion models. Our approach is based on the idea of using the Maximum Mean Discrepancy (MMD) to finetune the learned distribution with a given budget of timesteps. This allows the finetuned model to significantly improve the speed-quality trade-off, by substantially increasing fidelity in inference regimes with few steps or, equivalently, by reducing the required number of steps to reach a target fidelity, thus paving the way for a more practical adoption of diffusion models in a wide range of applications. We evaluate our approach on unconditional image generation with extensive experiments across the CIFAR-10, CelebA, ImageNet and LSUN-Church datasets. Our findings show that the proposed method is able to produce high-quality samples in a fraction of the time required by widely-used diffusion models, and outperforms state-of-the-art techniques for accelerated sampling. Code will be available at: https://github.com/diegovalsesia/MMD-DDM.
引用
收藏
页码:106912 / 106923
页数:12
相关论文
共 50 条
  • [31] Accelerating SPECT Imaging for Dosimetry via Projection Interpolation using Denoising Diffusion Probabilistic Models
    Toosi, Amirhosein
    Kurkowska, Sara
    Polson, Luke
    Colpo, Nadine
    Dellar, Conor
    Parulekar, Wendy
    Saad, Fred
    Chi, Kim
    Benard, Francois
    Fernandez, Pedro Esquinas
    Rahmim, Arman
    Uribe, Carlos
    JOURNAL OF NUCLEAR MEDICINE, 2024, 65
  • [32] Seismic Data Interpolation via Denoising Diffusion Implicit Models With Coherence-Corrected Resampling
    Wei, Xiaoli
    Zhang, Chunxia
    Wang, Hongtao
    Tan, Chengli
    Xiong, Deng
    Jiang, Baisong
    Zhang, Jiangshe
    Kim, Sang-Woon
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [33] Semi-Supervised CT Denoising via Text-Guided Mamba Diffusion Models
    Su, Bo
    Hu, Xiangyun
    Xu, Jiabo
    Deng, Kai
    Zha, Yunfei
    Wan, Jun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [34] Seismic Data Interpolation via Denoising Diffusion Implicit Models With Coherence-Corrected Resampling
    Wei, Xiaoli
    Zhang, Chunxia
    Wang, Hongtao
    Tan, Chengli
    Xiong, Deng
    Jiang, Baisong
    Zhang, Jiangshe
    Kim, Sang-Woon
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
    Luo, Jinglong
    Zhang, Yehong
    Zhang, Zhuo
    Zhang, Jiaqi
    Mu, Xin
    Wang, Hui
    Yu, Yue
    Xu, Zenglin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13333 - 13348
  • [36] Fast Approximate Inference for Arbitrarily Large Semiparametric Regression Models via Message Passing
    Wand, M. P.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 137 - 156
  • [37] A FAST EXPLICIT DIFFUSION ALGORITHM OF FRACTIONAL ORDER ANISOTROPIC DIFFUSION FOR IMAGE DENOISING
    Zhang, Zhiguang
    Liu, Qiang
    Gao, Tianling
    INVERSE PROBLEMS AND IMAGING, 2021, 15 (06) : 1451 - 1469
  • [38] A denoising approach via wavelet domain diffusion and image domain diffusion
    Xiaobo Zhang
    Multimedia Tools and Applications, 2017, 76 : 13545 - 13561
  • [39] A denoising approach via wavelet domain diffusion and image domain diffusion
    Zhang, Xiaobo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13545 - 13561
  • [40] Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions
    Li, Jiahuan
    Zhou, Hao
    Huang, Shujian
    Cheng, Shanbo
    Chen, Jiajun
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 576 - 592