ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

被引:2
|
作者
Chen, Dar-Yen [1 ]
Tennent, Hamish [1 ]
Hsu, Ching-Wen [1 ]
机构
[1] PicCollage, Taipei, Taiwan
关键词
D O I
10.1109/CVPR52733.2024.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.
引用
收藏
页码:8619 / 8628
页数:10
相关论文
共 50 条
  • [31] Application of design style in evolutionary multi-level networks synthesis
    Luba, T
    Moraga, C
    Yanushkevich, S
    Shmerko, V
    Kolodziejczyk, J
    PROCEEDINGS OF THE 26TH EUROMICRO CONFERENCE, VOLS I AND II, 2000, : 156 - 163
  • [32] Multi-layer feature fusion based image style transfer with arbitrary text condition
    Yu, Yue
    Xing, Jingshuo
    Li, Nengli
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 132
  • [33] Multi-level network based on transformer encoder for fine-grained image-text matching
    Yang, Lei
    Feng, Yong
    Zhou, Mingliang
    Xiong, Xiancai
    Wang, Yongheng
    Qiang, Baohua
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 1981 - 1994
  • [34] Style transfer network for complex multi-stroke text
    Fangmei Chen
    Yuying Wang
    Sheng Xu
    Fasheng Wang
    Fuming Sun
    Xu Jia
    Multimedia Systems, 2023, 29 : 1291 - 1300
  • [35] Style transfer network for complex multi-stroke text
    Chen, Fangmei
    Wang, Yuying
    Xu, Sheng
    Wang, Fasheng
    Sun, Fuming
    Jia, Xu
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1291 - 1300
  • [36] Multi-level Encoder-Decoder Architectures for Image Restoration
    Mastan, Indra Deep
    Raman, Shanmuganathan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1728 - 1737
  • [37] A Review of Text Style Transfer Using Deep Learning
    Toshevska M.
    Gievska S.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (05): : 669 - 684
  • [38] Unsupervised Automatic Text Style Transfer Using LSTM
    Han, Mengqiao
    Wu, Ou
    Niu, Zhendong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 281 - 292
  • [39] Arbitrary Style Transfer via Multi-Adaptation Network
    Deng, Yingying
    Tang, Fan
    Dong, Weiming
    Sun, Wen
    Huang, Feiyue
    Xu, Changsheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2719 - 2727
  • [40] A Multi-Level Cross-Attention Image Registration Method for Visible and Infrared Small Unmanned Aerial Vehicle Targets via Image Style Transfer
    Jiang, Wen
    Pan, Hanxin
    Wang, Yanping
    Li, Yang
    Lin, Yun
    Bi, Fukun
    REMOTE SENSING, 2024, 16 (16)