ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引:2
|
作者
Chen, Dar-Yen [1 ]
Tennent, Hamish [1 ]
Hsu, Ching-Wen [1 ]
机构
[1] PicCollage, Taipei, Taiwan
关键词
D O I
10.1109/CVPR52733.2024.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.
引用
收藏
页码:8619 / 8628
页数:10
相关论文
共 50 条
  • [41] Foreground and background separated image style transfer with a single text condition
    Yu, Yue
    Wang, Jianming
    Li, Nengli
    IMAGE AND VISION COMPUTING, 2024, 143
  • [42] Bimodal Neural Style Transfer for Image Generation Based on Text Prompts
    Gutierrez, Diego
    Mendoza, Marcelo
    CULTURE AND COMPUTING, C&C 2023, 2023, 14035 : 379 - 390
  • [43] Non-parallel text style transfer with domain adaptation and an attention model
    Mingxuan Hu
    Min He
    Applied Intelligence, 2021, 51 : 4609 - 4622
  • [44] Non-parallel text style transfer with domain adaptation and an attention model
    Hu, Mingxuan
    He, Min
    APPLIED INTELLIGENCE, 2021, 51 (07) : 4609 - 4622
  • [45] Recognition-based character segmentation for multi-level writing style
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    Charoenkwan, Phasit
    Marukatat, Sanparith
    Ho, Shinn-Ying
    Chaijaruwanich, Jeerayut
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) : 21 - 39
  • [46] Recognition-based character segmentation for multi-level writing style
    Papangkorn Inkeaw
    Jakramate Bootkrajang
    Phasit Charoenkwan
    Sanparith Marukatat
    Shinn-Ying Ho
    Jeerayut Chaijaruwanich
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 21 - 39
  • [47] MSCap: Multi-Style Image Captioning with Unpaired Stylized Text
    Guo, Longteng
    Liu, Jing
    Yao, Peng
    Li, Jiangwei
    Lu, Hanqing
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4199 - 4208
  • [48] Text Style Transfer based on Multi-factor Disentanglement and Mixture
    Zhu, Anna
    Yin, Zhanhui
    Iwana, Brian Kenji
    Zhou, Xinyu
    Xiong, Shengwu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2430 - 2440
  • [49] Unsupervised Text Style Transfer using Language Models as Discriminators
    Yang, Zichao
    Hu, Zhiting
    Dyer, Chris
    Xing, Eric P.
    Berg-Kirkpatrick, Taylor
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [50] Text Style Transfer Using Partly-Shared Decoder
    Halevi, Gil
    Wadhawan, Kahini
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1563 - 1567