DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer

被引:1
|
作者
Levi, Elad [1 ]
Brosh, Eli [1 ]
Mykhailych, Mykola [1 ]
Perez, Meir [1 ]
机构
[1] Wix Com, Tel Aviv, Israel
关键词
D O I
10.1109/ICCV51070.2023.00201
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating visual layouts is an essential ingredient of graphic design. The ability to condition layout generation on a partial subset of component attributes is critical to real-world applications that involve user interaction. Recently, diffusion models have demonstrated high-quality generative performances in various domains. However, it is unclear how to apply diffusion models to the natural representation of layouts which consists of a mix of discrete ( class) and continuous (location, size) attributes. To address the conditioning layout generation problem, we introduce DLT, a joint discrete-continuous diffusion model. DLT is a transformer-based model which has a flexible conditioning mechanism that allows for conditioning on any given subset of all the layout component classes, locations, and sizes. Our method outperforms state-of-the-art generative models on various layout generation datasets with respect to different metrics and conditioning settings. Additionally, we validate the effectiveness of our proposed conditioning mechanism and the joint continuous-diffusion process. This joint process can be incorporated into a wide range of mixed discrete-continuous generative tasks. More information can be found on our project webpage: https://wix-incubator.github.io/DLT
引用
收藏
页码:2106 / 2115
页数:10
相关论文
共 40 条
  • [1] BLT: Bidirectional Layout Transformer for Controllable Layout Generation
    Kong, Xiang
    Jiang, Lu
    Chang, Huiwen
    Zhang, Han
    Hao, Yuan
    Gong, Haifeng
    Essa, Irfan
    [J]. COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 474 - 490
  • [2] Geometry Aligned Variational Transformer for Image-conditioned Layout Generation
    Cao, Yunning
    Ma, Ye
    Zhou, Min
    Liu, Chuanbin
    Xie, Hongtao
    Ge, Tiezheng
    Jiang, Yuning
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1561 - 1571
  • [3] DiffPattern: Layout Pattern Generation via Discrete Diffusion
    Wang, Zixiao
    Shen, Yunheng
    Zhao, Wenqian
    Bai, Yang
    Chen, Guojin
    Farnia, Farzan
    Yu, Bei
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [4] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
    Inoue, Naoto
    Kikuchi, Kotaro
    Simo-Serra, Edgar
    Otani, Mayu
    Yamaguchi, Kota
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10167 - 10176
  • [5] PLay: Parametrically Conditioned Layout Generation using Latent Diffusion
    Cheng, Chin-Yi
    Huang, Forrest
    Li, Gang
    Li, Yang
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [6] LayoutDM: Transformer-based Diffusion Model for Layout Generation
    Chai, Shang
    Zhuang, Liansheng
    Yan, Fengying
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18349 - 18358
  • [7] Self-refined variational transformer for image-conditioned layout generation
    Cao, Yunning
    Liu, Chuanbin
    Ma, Ye
    Zhou, Min
    Ge, Tiezheng
    Jiang, Yuning
    Xie, Hongtao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [8] Variational Transformer Networks for Layout Generation
    Arroyo, Diego Martin
    Postels, Janis
    Tombari, Federico
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13637 - 13647
  • [9] LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models
    Zhang, Junyi
    Guo, Jiaqi
    Sun, Shizhao
    Lou, Jian-Guang
    Zhang, Dongmei
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7192 - 7202
  • [10] Element-conditioned GAN for graphic layout generation
    Chen, Liuqing
    Jing, Qianzhi
    Zhou, Yunzhan
    Li, Zhaoxing
    Shi, Lei
    Sun, Lingyun
    [J]. NEUROCOMPUTING, 2024, 591