Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

被引：0

作者：

Kwon, Gihyun ^{[1
,2
]}

Jenni, Simon ^{[2
]}

Li, Dingzeyu ^{[2
]}

Lee, Joon-Young ^{[2
]}

Ye, Jong Chul ^{[1
]}

Heilbron, Fabian Caba ^{[2
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] Adobe, San Jose, CA 95110 USA

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年

关键词：

D O I：

10.1109/CVPR52733.2024.00848

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing customized text-to-image diffusion models at inference time. Specifically, the method breaks the process into two steps: creating a template image aligned with the semantics of input prompts, and then personalizing the template using a concept fusion strategy. The fusion strategy incorporates the appearance of the target concepts into the template image while retaining its structural details. The results indicate that our method can generate multiple custom concepts with higher identity fidelity compared to alternative approaches. Furthermore, the method is shown to seamlessly handle more than two concepts and closely follow the semantic meaning of the input prompt without blending appearances across different subjects.

引用

页码：8880 / 8889

页数：10

共 50 条

[1] Multi-Concept Customization of Text-to-Image Diffusion
Kumari, Nupur
Zhang, Bingliang
Zhang, Richard
Shechtman, Eli
Zhu, Jun-Yan
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1931 - 1941
[2] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Gong, Chao
Chen, Kai
Wei, Zhipeng
Chen, Jingjing
Jiang, Yu-Gang
COMPUTER VISION - ECCV 2024, PT LIII, 2025, 15111 : 73 - 88
[3] RMP-adapter: A region-based Multiple Prompt Adapter for multi-concept customization in text-to-image diffusion model
Jiang, Zeyu
Po, Lai-Man
Xu, Xuyuan
Wang, Yexin
Wu, Haoxuan
Liu, Yuyang
Li, Kun
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
[4] Bimodal Learning for Multi-concept Image Query
Xu, HaiJiao
Pan, Peng
Lu, YanSheng
Xu, ChunYan
Chen, Deng
2014 TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2014, : 205 - 209
[5] Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models
Zhao, Juntu
Deng, Junyu
Ye, Yixin
Li, Chongxuan
Deng, Zhijie
Wang, Dequan
COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 318 - 333
[6] CONCEPTBED: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models
Patel, Maitreya
Gokhale, Tejas
Baral, Chitta
Yang, Yezhou
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14554 - 14562
[7] Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Huang, Chi-Pin
Chang, Kai-Po
Tsai, Chung-Ting
Lai, Yung-Hsuan
Yang, Fu-En
Wang, Yu-Chiang Frank
COMPUTER VISION - ECCV 2024, PT XL, 2025, 15098 : 360 - 376
[8] All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models
Hong, Seunghoo
Lee, Juhun
Woo, Simon S.
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21143 - 21151
[9] Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers
Huang, Chi-Pin
Chang, Kai-Po
Tsai, Chung-Ting
Lai, Yung-Hsuan
Yang, Fu-En
Wang, Yu-Chiang Frank
arXiv, 2023,
[10] Image retrieval based on multi-concept detector and semantic correlation
Xu HaiJiao
Huang ChangQin
Pan Peng
Zhao GanSen
Xu ChunYan
Lu YanSheng
Chen Deng
Wu JiYi
SCIENCE CHINA-INFORMATION SCIENCES, 2015, 58 (12) : 1 - 15

← 1 2 3 4 5 →