FloorDiffusion: Diffusion model-based conditional floorplan image generation method using parameter-efficient fine-tuning and image inpainting

被引:0
|
作者
Shim, Jonghwa [1 ]
Moon, Jaeuk [1 ]
Kim, Hyeonwoo [1 ]
Hwang, Eenjun [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul 02841, South Korea
来源
基金
新加坡国家研究基金会;
关键词
Deep learning; Conditional floorplan generation; Diffusion models; Parameter-efficient fine-tuning; Image inpainting;
D O I
10.1016/j.jobe.2024.110320
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The conditional generation of high-quality floorplan images using deep-learning methods is challenging because the generated floorplans are required to match specific conditions, such as floorplan silhouettes and spatial layouts. Recently, diffusion models have emerged as alternatives of conditional generative adversarial networks in image generation, offering higher image quality, pairing-free training datasets, and adaptability to various image domains via parameter fine-tuning of pretrained diffusion models. However, diffusion models are rarely used for floorplan generation because when fine-tuning them on image domains that were not learned in pretraining, such as floorplans, the quality of the generated images is poor and tuning takes a long time. This phenomenon arises from the so-called catastrophic forgetting problem, where traditional fine-tuning methods that update all parameters easily destroy the knowledge of pretrained diffusion models. To address this problem, we propose FloorDiffusion, a diffusion model-based conditional floorplan generation method. In this method, only a few key parameters of the pretrained diffusion model are fine-tuned, which allows adaptation to the floorplan domain while retaining its useful knowledge. Then, the fine-tuned diffusion model performs conditional floorplan generation by inpainting the unfinished regions of the input conditional image. Comparative experiments with existing methods demonstrate that our method can produce more architecturally realistic floorplan images with up to 72 % image quality improvement. It can also generate various floorplan images for a single input condition image. Finally, ablation studies show that all components of the proposed method are essential for optimal operation.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Image-Based Hot Pepper Disease and Pest Diagnosis Using Transfer Learning and Fine-Tuning
    Gu, Yeong Hyeon
    Yin, Helin
    Jin, Dong
    Park, Jong-Han
    Yoo, Seong Joon
    FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [32] Segment anything model-based crack segmentation using low-rank adaption fine-tuning
    Guo, Yapeng
    Xu, Yang
    Cui, Hongtao
    Dang, Minghao
    Li, Shunlong
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2024,
  • [33] An adaptive image inpainting method based on the modified Mumford-Shah model and multiscale parameter estimation
    Thanh, D. N. H.
    Prasath, V. B. S.
    Son, N. V.
    Hieu, L. M.
    COMPUTER OPTICS, 2019, 43 (02) : 251 - 257
  • [34] Instance segmentation of mouse brain scanning electron microscopy images based on fine-tuning nature image model
    Cheng, Ao
    Zhao, Guoqiang
    Zhang, Ruobing
    Wang, Lirong
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (18): : 2836 - 2845
  • [35] S-SAM: SVD-Based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
    Paranjape, Jay N.
    Sikder, Shameema
    Vedula, S. Swaroop
    Patel, Vishal M.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 720 - 730
  • [36] Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any Unseen Style
    Lu, Haoming
    Tunanyan, Hazarapet
    Wang, Kai
    Navasardyan, Shant
    Wang, Zhangyang
    Shi, Humphrey
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14267 - 14276
  • [37] A Model-Based Image Steganography Method Using Watson's Visual Model
    Fakhredanesh, Mohammad
    Safabakhsh, Reza
    Rahmati, Mohammad
    ETRI JOURNAL, 2014, 36 (03) : 479 - 489
  • [38] Efficient Framework for Model-Based Tomographic Image Reconstruction Using Wavelet Packets
    Rosenthal, Amir
    Jetzfellner, Thomas
    Razansky, Daniel
    Ntziachristos, Vasilis
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2012, 31 (07) : 1346 - 1357
  • [39] Diffusion model-based image generative method for quality monitoring of direct grain harvesting
    Zhang, Shuohua
    Liu, Lei
    Li, Guorun
    Du, Yuefeng
    Wu, Xiuheng
    Song, Zhenghe
    Li, Xiaoyu
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 233
  • [40] Canny Edge Detection Model in MRI Image Segmentation Using Optimized Parameter Tuning Method
    Radhakrishnan, Meera
    Panneerselvam, Anandan
    Nachimuthu, Nandhagopal
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (06): : 1185 - 1199