ToDo: Token Downsampling for Efficient Generation of High-Resolution Images

被引:0
|
作者
Smith, Ethan [1 ]
Saxena, Nayan [1 ]
Saha, Aninda [1 ]
机构
[1] Leonardo AI Res Lab, North Sydney, NSW, Australia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention has been a crucial component in the success of image diffusion models, however, their quadratic computational complexity limits the sizes of images we can process within reasonable time and memory constraints. This paper investigates the importance of dense attention in generative image models, which often contain redundant features, making them suitable for sparser attention mechanisms. We propose a novel training-free method ToDo that relies on token downsampling of key and value tokens to accelerate Stable Diffusion inference by up to 2x for common sizes and up to 4.5x or more for high resolutions like 2048x2048. We demonstrate that our approach outperforms previous methods in balancing efficient throughput and fidelity.
引用
收藏
页码:8801 / 8804
页数:4
相关论文
共 50 条
  • [1] Quantum adversarial generation of high-resolution images
    Ma, Quangong
    Hao, Chaolong
    Si, Nianwen
    Chen, Geng
    Zhang, Jiale
    Qu, Dan
    EPJ QUANTUM TECHNOLOGY, 2025, 12 (01)
  • [2] Efficient Localization of Multitype Barcodes in High-Resolution Images
    Yi, Jinwang
    Xiao, Yuanbiao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [3] ESOD: Efficient Small Object Detection on High-Resolution Images
    Liu, Kai
    Fu, Zhihang
    Jin, Sheng
    Chen, Ze
    Zhou, Fan
    Jiang, Rongxin
    Chen, Yaowu
    Ye, Jieping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 183 - 195
  • [4] An efficient photogrammetric stereo matching method for high-resolution images
    Li, Yingsong
    Zheng, Shunyi
    Wang, Xiaonan
    Ma, Hao
    COMPUTERS & GEOSCIENCES, 2016, 97 : 58 - 66
  • [5] Efficient and Stable Generation of High-Resolution Hair and Fur With ConvNet Using Adaptive Strand Geometry Images
    Kim, Jong-Hyun
    Lee, Jung
    IEEE ACCESS, 2023, 11 : 81101 - 81112
  • [6] Efficient Contour Generation on GPU for Multivalued High Resolution Images
    Butt, Muhammad Usman
    Morris, John
    Patel, Nitish
    Tsoi, Joseph Kit Pui
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1375 - 1380
  • [7] High-resolution images in seconds
    不详
    BRITISH DENTAL JOURNAL, 2024, 236 (09) : 718 - 718
  • [8] METEOSAT HIGH-RESOLUTION IMAGES
    CHRISTIESON, ML
    WIRELESS WORLD, 1982, 88 (1559): : 61 - 64
  • [9] HIGH-RESOLUTION PLATED IMAGES
    ROLKER, JH
    CARSON, B
    GARLAND, T
    JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 1973, 120 (08) : C237 - C237
  • [10] METEOSAT HIGH-RESOLUTION IMAGES
    CHRISTIESON, ML
    WIRELESS WORLD, 1982, 88 (1561): : 83 - 84