Wavelet Diffusion Models are fast and scalable Image Generators

被引:15
|
作者
Phung, Hao [1 ]
Dao, Quan [1 ]
Tran, Anh [1 ]
机构
[1] VinAI Res, Hanoi, Vietnam
关键词
D O I
10.1109/CVPR52729.2023.00983
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used in real-time applications. A recent DiffusionGAN method significantly decreases the models' running time by reducing the number of sampling steps from thousands to several, but their speeds still largely lag behind the GAN counterparts. This paper aims to reduce the speed gap by proposing a novel wavelet-based diffusion scheme. We extract low-and-high frequency components from both image and feature levels via wavelet decomposition and adaptively handle these components for faster processing while maintaining good generation quality. Furthermore, we propose to use a reconstruction term, which effectively boosts the model training convergence. Experimental results on CelebA-HQ, CIFAR-10, LSUN-Church, and STL-10 datasets prove our solution is a stepping-stone to offering real-time and high-fidelity diffusion models. Our code and pre-trained checkpoints are available at https://github.com/VinAIResearch/WaveDiff.git.
引用
收藏
页码:10199 / 10208
页数:10
相关论文
共 50 条
  • [21] Fast multiresolution image operations in the wavelet domain
    Drori, I
    Lischinski, D
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2003, 9 (03) : 395 - 411
  • [22] Fast wavelet histogram techniques for image indexing
    Mandal, MK
    Aboulnasr, T
    Panchanathan, S
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) : 99 - 110
  • [23] Fast wavelet histogram techniques for image indexing
    Mandal, MK
    Aboulnasr, T
    Panchanathan, S
    [J]. IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES - PROCEEDINGS, 1998, : 68 - 72
  • [24] Fast wavelet transform for color image compression
    Sun, YL
    Bow, ST
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 541 - 544
  • [25] Region-based and scalable image compression by wavelet localization
    Lee, MS
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 2, PROCEEDINGS, 2003, : 451 - 454
  • [26] REGULARITY SCALABLE IMAGE CODING BASED ON WAVELET SINGULARITY DETECTION
    Ho, Charlotte Yuk-Fan
    Hsung, Tai-Chiu
    Lun, Daniel Pak-Kong
    Ling, Bingo Wing-Kuen
    Tam, Peter Kwong-Shun
    Siu, Wan-Chi
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2008, 8 (01) : 109 - 134
  • [27] A Multiresolution Robust Watermarking Approach for Scalable Wavelet Image Compression
    Danyali, Habibollah
    Amiri, Mehran Deljavan
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2008, 5259 : 57 - 66
  • [28] Efficient, highly scalable wavelet image coding for network applications
    Gan, Tao
    Zhou, Nan
    Zhu, Weile
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 493 - +
  • [29] Efficient image generation with Contour Wavelet Diffusion
    Zhang, Dimeng
    Li, JiaYao
    Chen, Zilong
    Zou, Yuntao
    [J]. Computers and Graphics (Pergamon), 2024, 124
  • [30] Scalable image coding using reversible integer wavelet transforms
    Bilgin, A
    Sementilli, PJ
    Sheng, F
    Marcellin, MW
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (11) : 1972 - 1977