Using text-to-image generation for architectural design ideation

被引:21
|
作者
Paananen, Ville [1 ]
Oppenlaender, Jonas [2 ]
Visuri, Aku [1 ]
机构
[1] Univ Oulu, Ctr Ubiquitous Comp, Pentti Kaiteran Katu 1, Oulu 90014, Finland
[2] Elisa Corp, Helsinki, Finland
基金
芬兰科学院;
关键词
Architecture; design creativity; generative artificial intelligence; text-to-image generation; CREATIVITY;
D O I
10.1177/14780771231222783
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Text-to-image generation has become very popular in various domains requiring creativity. This article investigates the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.
引用
收藏
页码:458 / 474
页数:17
相关论文
共 50 条
  • [41] Text-to-image generation combined with mutual information maximization
    Mo J.
    Xu K.
    Lin L.
    Ouyang N.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (05): : 180 - 188
  • [42] Emergent Text-to-Image Generation Using Short Neologism Prompts and Negative Prompts
    Kanada, Yasusi
    2024 NICOGRAPH INTERNATIONAL, NICOINT 2024, 2024, : 86 - 86
  • [43] EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models
    Yang, Jingyuan
    Feng, Jiawei
    Huang, Hui
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6358 - 6368
  • [44] Automated Generation of Lung Cytological Images from Image Findings Using Text-to-Image Technology
    Teramoto, Atsushi
    Kiriyama, Yuka
    Michiba, Ayano
    Yazawa, Natsuki
    Tsukamoto, Tetsuya
    Imaizumi, Kazuyoshi
    Fujita, Hiroshi
    COMPUTERS, 2024, 13 (11)
  • [45] Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation
    Chen, Zhuowei
    Mao, Zhendong
    Fang, Shancheng
    Hu, Bo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4327 - 4335
  • [46] GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation
    Gong, Jingzhi
    Li, Sisi
    D'Aloisio, Giordano
    Ding, Zishuo
    Ye, Yulong
    Langdon, William B.
    Sarro, Federica
    SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2024, 2024, 14767 : 70 - 76
  • [47] Attention-Bridged Modal Interaction for Text-to-Image Generation
    Tan, Hongchen
    Yin, Baocai
    Xu, Kaiqiang
    Wang, Huasheng
    Liu, Xiuping
    Li, Xin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5400 - 5413
  • [48] DR-GAN: Distribution Regularization for Text-to-Image Generation
    Tan, Hongchen
    Liu, Xiuping
    Yin, Baocai
    Li, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10309 - 10323
  • [49] Prompt Stealing Attacks Against Text-to-Image Generation Models
    Shen, Xinyue
    Qu, Yiting
    Backes, Michael
    Zhang, Yang
    PROCEEDINGS OF THE 33RD USENIX SECURITY SYMPOSIUM, SECURITY 2024, 2024, : 5823 - 5840
  • [50] Determinant Point Process Sampling Method for Text-to-Image Generation
    Li X.
    Li G.
    Zhang E.
    Gu G.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (02): : 246 - 255