Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

被引:0
|
作者
Kwon, Gihyun [1 ,2 ]
Jenni, Simon [2 ]
Li, Dingzeyu [2 ]
Lee, Joon-Young [2 ]
Ye, Jong Chul [1 ]
Heilbron, Fabian Caba [2 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] Adobe, San Jose, CA 95110 USA
关键词
D O I
10.1109/CVPR52733.2024.00848
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing customized text-to-image diffusion models at inference time. Specifically, the method breaks the process into two steps: creating a template image aligned with the semantics of input prompts, and then personalizing the template using a concept fusion strategy. The fusion strategy incorporates the appearance of the target concepts into the template image while retaining its structural details. The results indicate that our method can generate multiple custom concepts with higher identity fidelity compared to alternative approaches. Furthermore, the method is shown to seamlessly handle more than two concepts and closely follow the semantic meaning of the input prompt without blending appearances across different subjects.
引用
收藏
页码:8880 / 8889
页数:10
相关论文
共 50 条
  • [21] Hybrid algorithm for multi-concept acquisition and its application
    Chen, Zhaoqian
    Liu, Hong
    Zhou, Rong
    Chen, Shifu
    Jisuanji Xuebao/Chinese Journal of Computers, 1996, 19 (10): : 753 - 761
  • [22] Holistic Evaluation of Text-to-Image Models
    Lee, Tony
    Yasunaga, Michihiro
    Meng, Chenlin
    Mai, Yifan
    Park, Joon Sung
    Gupta, Agrim
    Zhang, Yunzhi
    Narayanan, Deepak
    Teufel, Hannah Benita
    Bellagente, Marco
    Kang, Minguk
    Park, Taesung
    Leskovec, Jure
    Zhu, Jun-Yan
    Li Fei-Fei
    Wu, Jiajun
    Ermon, Stefano
    Liang, Percy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Debiasing Text-to-Image Diffusion Models
    He, Ruifei
    Xue, Chuhui
    Tan, Haoru
    Zhang, Wenqing
    Yu, Yingchen
    Bai, Song
    Qi, Xiaojuan
    PROCEEDINGS OF THE 1ST ACM MULTIMEDIA WORKSHOP ON MULTI-MODAL MISINFORMATION GOVERNANCE IN THE ERA OF FOUNDATION MODELS, MIS 2024, 2024, : 29 - 36
  • [24] MULTI-CONCEPT LEARNING WITH LARGE-SCALE MULTIMEDIA LEXICONS
    Xie, Lexing
    Yan, Rong
    Yang, Jun
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2148 - 2151
  • [25] Multi-Concept Representation Learning for Knowledge Graph Completion
    Wang, Jiapu
    Wang, Boyue
    Gao, Junbin
    Hu, Yongli
    Yin, Baocai
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (01)
  • [26] Data complexity measures for classification of a multi-concept dataset
    Sowkarthika B.
    Gyanchandani M.
    Wadhvani R.
    Shukla S.
    Multimedia Tools and Applications, 2025, 84 (2) : 571 - 602
  • [27] Multi-Semantic Fusion Generative Adversarial Network for Text-to-Image Generation
    Huang, Pingda
    Liu, Yedan
    Fu, Chunjiang
    Zhao, Liang
    2023 IEEE 8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, ICBDA, 2023, : 159 - 164
  • [28] Semantic Multi-concept Annotation for Tabular Data in Financial Documents
    Nararatwong, Rungsiman
    Shi, Yuting
    Kertkeidkachorn, Natthawut
    Ichise, Ryutaro
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 514 - 529
  • [29] Decoupling Control in Text-to-Image Diffusion Models
    Cao, Shitong
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 312 - 322
  • [30] A Method of General Multi-Concept Learning Based on Cognitive Model
    Zhu, Shisong
    Wang, Yunjia
    Huang, Xiaobo
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 339 - +