FeaST: Feature-guided Style Transfer for high-fidelity art synthesis

被引:0
|
作者
Png, Feature-guided Style Wen Hao [1 ]
Aun, Yichiet [1 ]
Gan, Ming Lee [1 ]
机构
[1] Univ Tunku Abdul Rahman UTAR, Fac Informat & Commun Technol, Kampar 31900, Malaysia
来源
COMPUTERS & GRAPHICS-UK | 2024年 / 122卷
关键词
Image style transfer; Img2Img synthesis; Generative art;
D O I
10.1016/j.cag.2024.103975
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Text-conditioned image synthesis methods such as DALLE -2, IMAGEN, and Stable Diffusion are gaining strong attention from deep learning and art communities recently. Meanwhile, Image -to -Image (Img2Img) synthesis applications that emerged from the pioneering Neural Style Transfer (NST) approach have swiftly transitioned towards the feed-forward Automatic Style Transfer (AST) methods, due to numerous constraints inherent in the former method, including inconsistent synthesis outcomes and sluggish optimization-based synthesis process. However, NST holds significant potential yet remains relatively underexplored within this research domain. In this paper, we revisited the original NST method and uncovered its potential to attain image quality comparable to the AST synthesis methods across a diverse range of artistic styles. We propose a two-stage Feature-guided Style Transfer (FeaST) which consists (a) pre-stylization step called Sketching to address the poor initialization issue, and (b) Finetuning to guide the synthesis process based on high -frequency (HF) and low-frequency (LF) guidance channels. By addressing the issues of inconsistent synthesis and slow convergence inherent in the original method, FeaST unlocks the full capabilities of NST and significantly enhances its efficiency.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] High-Fidelity Guided Image Synthesis with Latent Diffusion Models
    Singh, Jaskirat
    Gould, Stephen
    Zheng, Liang
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5997 - 6006
  • [2] Weakly-Supervised High-Fidelity Ultrasound Video Synthesis with Feature Decoupling
    Liang, Jiamin
    Yang, Xin
    Huang, Yuhao
    Liu, Kai
    Zhou, Xinrui
    Hu, Xindi
    Lin, Zehui
    Luo, Huanjia
    Zhang, Yuanji
    Xiong, Yi
    Ni, Dong
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 310 - 319
  • [3] A high-fidelity face swapping algorithm based on mutual information-guided feature decoupling
    Xiao, Song
    Liu, ZhiGuo
    Gao, Jian
    Wang, ChangXin
    [J]. VISUAL COMPUTER, 2024, 40 (12): : 8957 - 8972
  • [4] FaceRefiner: High-Fidelity Facial Texture Refinement With Differentiable Rendering-Based Style Transfer
    Li, Chengyang
    Cheng, Baoping
    Cheng, Yao
    Zhang, Haocheng
    Liu, Renshuai
    Zheng, Yinglin
    Liao, Jing
    Cheng, Xuan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7225 - 7236
  • [5] Controllable High-fidelity Facial Performance Transfer
    Xu, Feng
    Chai, Jinxiang
    Liu, Yilong
    Tong, Xin
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04):
  • [6] State of the art in high-fidelity simulation models for ultrasound-guided regional anaesthesia media
    Willers, J.
    Goosen, L.
    Bisht, L.
    Hariharan, S.
    Mohamed, S.
    [J]. ANAESTHESIA, 2018, 73 : 101 - 101
  • [7] Sketch-Guided Latent Diffusion Model for High-Fidelity Face Image Synthesis
    Peng, Yichen
    Zhao, Chunqi
    Xie, Haoran
    Fukusato, Tsukasa
    Miyata, Kazunori
    [J]. IEEE ACCESS, 2024, 12 : 5770 - 5780
  • [8] High-Fidelity Dynamic Human Synthesis via UV-Guided NeRF with Sparse Views
    Xie, Zhifeng
    Wang, Zhaosheng
    Wang, Sen
    Sun, Yuzhou
    Ma, Lizhuang
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 357 - 368
  • [9] DISENTANGLED FEATURE-GUIDED MULTI-EXPOSURE HIGH DYNAMIC RANGE IMAGING
    Lee, Keuntek
    Jang, Yeong Il
    Cho, Nam Ik
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2465 - 2469
  • [10] Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis
    Lee, Sang-Hoon
    Yoon, Hyun-Wook
    Noh, Hyeong-Rae
    Kim, Ji-Hoon
    Lee, Seong-Whan
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13198 - 13206