Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

被引：0

作者：

Kwon, Gihyun ^{[1
,2
]}

Jenni, Simon ^{[2
]}

Li, Dingzeyu ^{[2
]}

Lee, Joon-Young ^{[2
]}

Ye, Jong Chul ^{[1
]}

Heilbron, Fabian Caba ^{[2
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] Adobe, San Jose, CA 95110 USA

来源：

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024 | 2024年

关键词：

D O I：

10.1109/CVPR52733.2024.00848

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing customized text-to-image diffusion models at inference time. Specifically, the method breaks the process into two steps: creating a template image aligned with the semantics of input prompts, and then personalizing the template using a concept fusion strategy. The fusion strategy incorporates the appearance of the target concepts into the template image while retaining its structural details. The results indicate that our method can generate multiple custom concepts with higher identity fidelity compared to alternative approaches. Furthermore, the method is shown to seamlessly handle more than two concepts and closely follow the semantic meaning of the input prompt without blending appearances across different subjects.

引用

页码：8880 / 8889

页数：10

共 50 条

[21] Hybrid algorithm for multi-concept acquisition and its application
Chen, Zhaoqian
Liu, Hong
Zhou, Rong
Chen, Shifu
Jisuanji Xuebao/Chinese Journal of Computers, 1996, 19 (10): : 753 - 761
[22] Holistic Evaluation of Text-to-Image Models
Lee, Tony
Yasunaga, Michihiro
Meng, Chenlin
Mai, Yifan
Park, Joon Sung
Gupta, Agrim
Zhang, Yunzhi
Narayanan, Deepak
Teufel, Hannah Benita
Bellagente, Marco
Kang, Minguk
Park, Taesung
Leskovec, Jure
Zhu, Jun-Yan
Li Fei-Fei
Wu, Jiajun
Ermon, Stefano
Liang, Percy
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[23] Debiasing Text-to-Image Diffusion Models
He, Ruifei
Xue, Chuhui
Tan, Haoru
Zhang, Wenqing
Yu, Yingchen
Bai, Song
Qi, Xiaojuan
PROCEEDINGS OF THE 1ST ACM MULTIMEDIA WORKSHOP ON MULTI-MODAL MISINFORMATION GOVERNANCE IN THE ERA OF FOUNDATION MODELS, MIS 2024, 2024, : 29 - 36
[24] MULTI-CONCEPT LEARNING WITH LARGE-SCALE MULTIMEDIA LEXICONS
Xie, Lexing
Yan, Rong
Yang, Jun
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2148 - 2151
[25] Multi-Concept Representation Learning for Knowledge Graph Completion
Wang, Jiapu
Wang, Boyue
Gao, Junbin
Hu, Yongli
Yin, Baocai
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (01)
[26] Data complexity measures for classification of a multi-concept dataset
Sowkarthika B.
Gyanchandani M.
Wadhvani R.
Shukla S.
Multimedia Tools and Applications, 2025, 84 (2) : 571 - 602
[27] Multi-Semantic Fusion Generative Adversarial Network for Text-to-Image Generation
Huang, Pingda
Liu, Yedan
Fu, Chunjiang
Zhao, Liang
2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 159 - 164
[28] Semantic Multi-concept Annotation for Tabular Data in Financial Documents
Nararatwong, Rungsiman
Shi, Yuting
Kertkeidkachorn, Natthawut
Ichise, Ryutaro
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 514 - 529
[29] Decoupling Control in Text-to-Image Diffusion Models
Cao, Shitong
Zhang, Xuejie
Wang, Jin
Zhou, Xiaobing
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 312 - 322
[30] A Method of General Multi-Concept Learning Based on Cognitive Model
Zhu, Shisong
Wang, Yunjia
Huang, Xiaobo
SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 339 - +

← 1 2 3 4 5 →