ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引:2
|
作者
Chen, Dar-Yen [1 ]
Tennent, Hamish [1 ]
Hsu, Ching-Wen [1 ]
机构
[1] PicCollage, Taipei, Taiwan
关键词
D O I
10.1109/CVPR52733.2024.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.
引用
收藏
页码:8619 / 8628
页数:10
相关论文
共 50 条
  • [1] StyleDrop: Text-to-Image Generation in Any Style
    Sohn, Kihyuk
    Ruiz, Nataniel
    Lee, Kimin
    Chin, Daniel Castro
    Blok, Irina
    Chang, Huiwen
    Barber, Jarred
    Jiang, Lu
    Entis, Glenn
    Li, Yuanzhen
    Hao, Yuan
    Essa, Irfan
    Rubinstein, Michael
    Krishnan, Dilip
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Generative adversarial text-to-image generation with style image constraint
    Zekang Wang
    Li Liu
    Huaxiang Zhang
    Dongmei Liu
    Yu Song
    Multimedia Systems, 2023, 29 : 3291 - 3303
  • [3] Generative adversarial text-to-image generation with style image constraint
    Wang, Zekang
    Liu, Li
    Zhang, Huaxiang
    Liu, Dongmei
    Song, Yu
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3291 - 3303
  • [4] A Multi-Level Encoder for Text Summarization
    Liu, Junshuai
    Xin, Xin
    Li, Li
    Liu, Shaozhuang
    Ma, Xiaoyu
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 952 - 957
  • [5] Device independent layout and style editing using multi-level style sheets
    Dees, Walter
    COMPUTER-AIDED DESIGN OF USER INTERFACES V, 2007, : 183 - 190
  • [6] Interactive Multi-level Stroke Control for Neural Style Transfer
    Reimann, Max
    Buchheim, Benito
    Semmo, Amir
    Doellner, Jurgen
    Trapp, Matthias
    2021 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2021), 2021, : 1 - 8
  • [7] DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
    Ahn, Namhyuk
    Lee, Junsoo
    Lee, Chunggi
    Kim, Kunhee
    Kim, Daesik
    Nam, Seung-Hun
    Hong, Kibeom
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 674 - 681
  • [8] Application of multi-level adaptive neural network based on optimization algorithm in image style transfer
    Li, Hong-an
    Wang, Lanye
    Liu, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73127 - 73149
  • [9] Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
    Shan, Shawn
    Cryan, Jenna
    Wenger, Emily
    Zheng, Haitao
    Hanocka, Rana
    Zhao, Ben Y.
    PROCEEDINGS OF THE 32ND USENIX SECURITY SYMPOSIUM, 2023, : 2187 - 2204
  • [10] Thangka mural style transfer based on progressive style-attentional network and multi-level loss function
    Fang, Jie
    Li, Hang
    Jia, Ying
    Ji, Liqi
    Chen, Xin
    Wang, Nianyi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)