Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

被引:0
|
作者
Kwon, Gihyun [1 ,2 ]
Jenni, Simon [2 ]
Li, Dingzeyu [2 ]
Lee, Joon-Young [2 ]
Ye, Jong Chul [1 ]
Heilbron, Fabian Caba [2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] Adobe, San Jose, CA 95110 USA
关键词
D O I
10.1109/CVPR52733.2024.00848
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing customized text-to-image diffusion models at inference time. Specifically, the method breaks the process into two steps: creating a template image aligned with the semantics of input prompts, and then personalizing the template using a concept fusion strategy. The fusion strategy incorporates the appearance of the target concepts into the template image while retaining its structural details. The results indicate that our method can generate multiple custom concepts with higher identity fidelity compared to alternative approaches. Furthermore, the method is shown to seamlessly handle more than two concepts and closely follow the semantic meaning of the input prompt without blending appearances across different subjects.
引用
收藏
页码:8880 / 8889
页数:10
相关论文
共 50 条
  • [1] Multi-Concept Customization of Text-to-Image Diffusion
    Kumari, Nupur
    Zhang, Bingliang
    Zhang, Richard
    Shechtman, Eli
    Zhu, Jun-Yan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1931 - 1941
  • [2] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
    Gong, Chao
    Chen, Kai
    Wei, Zhipeng
    Chen, Jingjing
    Jiang, Yu-Gang
    COMPUTER VISION - ECCV 2024, PT LIII, 2025, 15111 : 73 - 88
  • [3] RMP-adapter: A region-based Multiple Prompt Adapter for multi-concept customization in text-to-image diffusion model
    Jiang, Zeyu
    Po, Lai-Man
    Xu, Xuyuan
    Wang, Yexin
    Wu, Haoxuan
    Liu, Yuyang
    Li, Kun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [4] Bimodal Learning for Multi-concept Image Query
    Xu, HaiJiao
    Pan, Peng
    Lu, YanSheng
    Xu, ChunYan
    Chen, Deng
    2014 TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2014, : 205 - 209
  • [5] Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models
    Zhao, Juntu
    Deng, Junyu
    Ye, Yixin
    Li, Chongxuan
    Deng, Zhijie
    Wang, Dequan
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 318 - 333
  • [6] CONCEPTBED: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models
    Patel, Maitreya
    Gokhale, Tejas
    Baral, Chitta
    Yang, Yezhou
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14554 - 14562
  • [7] Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
    Huang, Chi-Pin
    Chang, Kai-Po
    Tsai, Chung-Ting
    Lai, Yung-Hsuan
    Yang, Fu-En
    Wang, Yu-Chiang Frank
    COMPUTER VISION - ECCV 2024, PT XL, 2025, 15098 : 360 - 376
  • [8] All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models
    Hong, Seunghoo
    Lee, Juhun
    Woo, Simon S.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21143 - 21151
  • [9] Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
    Huang, Chi-Pin
    Chang, Kai-Po
    Tsai, Chung-Ting
    Lai, Yung-Hsuan
    Yang, Fu-En
    Wang, Yu-Chiang Frank
    arXiv, 2023,
  • [10] Image retrieval based on multi-concept detector and semantic correlation
    Xu HaiJiao
    Huang ChangQin
    Pan Peng
    Zhao GanSen
    Xu ChunYan
    Lu YanSheng
    Chen Deng
    Wu JiYi
    SCIENCE CHINA-INFORMATION SCIENCES, 2015, 58 (12) : 1 - 15