ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引:2
|
作者
Chen, Dar-Yen [1 ]
Tennent, Hamish [1 ]
Hsu, Ching-Wen [1 ]
机构
[1] PicCollage, Taipei, Taiwan
关键词
D O I
10.1109/CVPR52733.2024.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.
引用
收藏
页码:8619 / 8628
页数:10
相关论文
共 50 条
  • [21] Story-level Text Style Transfer: A Proposal
    Qian, Yusu
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 8 - 12
  • [22] SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer
    Li, Jicheng
    Feng, Yang
    Ou, Jiao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [23] Workspaces: A multi-level architectural style for synchronous groupware
    Phillips, WG
    Graham, TCN
    INTERACTIVE SYSTEMS: DESIGN, SPECIFICATION, AND VERIFICATION, 2003, 2844 : 92 - 106
  • [24] Multi-level network based on transformer encoder for fine-grained image–text matching
    Lei Yang
    Yong Feng
    Mingliang Zhou
    Xiancai Xiong
    Yongheng Wang
    Baohua Qiang
    Multimedia Systems, 2023, 29 : 1981 - 1994
  • [25] Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder
    Cho, Jaehyeong
    Shimoda, Wataru
    Yanai, Keiji
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5176 - 5183
  • [26] Latent Style: multi-style image transfer via latent style coding and skip connection
    Hu, Jingfei
    Wu, Guang
    Wang, Hua
    Zhang, Jicong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 359 - 368
  • [27] Latent Style: multi-style image transfer via latent style coding and skip connection
    Jingfei Hu
    Guang Wu
    Hua Wang
    Jicong Zhang
    Signal, Image and Video Processing, 2022, 16 : 359 - 368
  • [28] Multi-style image transfer system using conditional cycleGAN
    Tu, Ching-Ting
    Lin, Hwei Jen
    Tsia, Yihjia
    IMAGING SCIENCE JOURNAL, 2021, 69 (1-4): : 1 - 14
  • [29] Token-level disentanglement for unsupervised text style transfer
    Hu, Yahao
    Tao, Wei
    Xie, Yifei
    Sun, Yi
    Pan, Zhisong
    NEUROCOMPUTING, 2023, 560
  • [30] Evolutionary multi-level network synthesis in given design style
    Luba, T
    Moraga, C
    Yanushkevich, S
    Opoka, M
    Shmerko, V
    30TH IEEE INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC, PROCEEDINGS, 2000, : 253 - 258