Using text-to-image generation for architectural design ideation

被引：21

作者：

Paananen, Ville ^{[1
]}

Oppenlaender, Jonas ^{[2
]}

Visuri, Aku ^{[1
]}

机构：

[1] Univ Oulu, Ctr Ubiquitous Comp, Pentti Kaiteran Katu 1, Oulu 90014, Finland

[2] Elisa Corp, Helsinki, Finland

来源：

INTERNATIONAL JOURNAL OF ARCHITECTURAL COMPUTING | 2024年 / 22卷 / 03期

基金：

芬兰科学院;

关键词：

Architecture; design creativity; generative artificial intelligence; text-to-image generation; CREATIVITY;

D O I：

10.1177/14780771231222783

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Text-to-image generation has become very popular in various domains requiring creativity. This article investigates the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.

引用

页码：458 / 474

页数：17

共 50 条

[41] Text-to-image generation combined with mutual information maximization
Mo J.
Xu K.
Lin L.
Ouyang N.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (05): : 180 - 188
[42] Emergent Text-to-Image Generation Using Short Neologism Prompts and Negative Prompts
Kanada, Yasusi
2024 NICOGRAPH INTERNATIONAL, NICOINT 2024, 2024, : 86 - 86
[43] EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models
Yang, Jingyuan
Feng, Jiawei
Huang, Hui
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6358 - 6368
[44] Automated Generation of Lung Cytological Images from Image Findings Using Text-to-Image Technology
Teramoto, Atsushi
Kiriyama, Yuka
Michiba, Ayano
Yazawa, Natsuki
Tsukamoto, Tetsuya
Imaizumi, Kazuyoshi
Fujita, Hiroshi
COMPUTERS, 2024, 13 (11)
[45] Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation
Chen, Zhuowei
Mao, Zhendong
Fang, Shancheng
Hu, Bo
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4327 - 4335
[46] GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation
Gong, Jingzhi
Li, Sisi
D'Aloisio, Giordano
Ding, Zishuo
Ye, Yulong
Langdon, William B.
Sarro, Federica
SEARCH-BASED SOFTWARE ENGINEERING, SSBSE 2024, 2024, 14767 : 70 - 76
[47] Attention-Bridged Modal Interaction for Text-to-Image Generation
Tan, Hongchen
Yin, Baocai
Xu, Kaiqiang
Wang, Huasheng
Liu, Xiuping
Li, Xin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5400 - 5413
[48] DR-GAN: Distribution Regularization for Text-to-Image Generation
Tan, Hongchen
Liu, Xiuping
Yin, Baocai
Li, Xin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10309 - 10323
[49] Prompt Stealing Attacks Against Text-to-Image Generation Models
Shen, Xinyue
Qu, Yiting
Backes, Michael
Zhang, Yang
PROCEEDINGS OF THE 33RD USENIX SECURITY SYMPOSIUM, SECURITY 2024, 2024, : 5823 - 5840
[50] Determinant Point Process Sampling Method for Text-to-Image Generation
Li X.
Li G.
Zhang E.
Gu G.
Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (02): : 246 - 255

← 1 2 3 4 5 →