Controllable Sketch-to-Image Translation for Robust Face Synthesis

被引:12
|
作者
Yang, Shuai [1 ]
Wang, Zhangyang [2 ]
Liu, Jiaying [1 ]
Guo, Zongming [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100080, Peoples R China
[2] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
基金
中国国家自然科学基金;
关键词
Image edge detection; Adaptation models; Controllability; Facial features; Data models; Painting; Training; Face synthesis; sketch-to-image translation; user control; image editing; COMPLETION;
D O I
10.1109/TIP.2021.3120669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel controllable sketch-to-image translation framework that allows users to interactively and robustly synthesize and edit face images with hand-drawn sketches. Inspired by the coarse-to-fine painting process of human artists, we propose a novel dilation-based sketch refinement method to refine sketches at varied coarse levels without the need for real sketch training data. We further investigate multi-level refinement that enables users to flexibly define how "reliable" the input sketch should be considered for the final output through a refinement level control parameter, which helps balance between the realism of the output and its structural consistency with the input sketch. It is realized by leveraging scale-aware style transfer to model and adjust the style features of sketches at different coarse levels. Moreover, advanced user controllability in terms of the editing region control, facial attribute editing, and spatially non-uniform refinement is further explored for fine-grained and semantic editing. We demonstrate the effectiveness of the proposed method in terms of visual quality and user controllability through extensive experiments including qualitative and quantitative comparison with state-of-the-art methods, ablation studies and various applications.
引用
收藏
页码:8797 / 8810
页数:14
相关论文
共 50 条
  • [1] Unsupervised Sketch-to-Image Translation Network for Single Category Image Synthesis
    Deng, Wanyu
    Feng, Xiaoting
    Li, Qirui
    Xu, Huijiao
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6616 - 6621
  • [2] Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
    Ghosh, Arnab
    Zhang, Richard
    Dokania, Puneet K.
    Wang, Oliver
    Efros, Alexei A.
    Torr, Philip H. S.
    Shechtman, Eli
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1171 - 1180
  • [3] Multi-Density Sketch-to-Image Translation Network
    Huang, Jialu
    Jing, Liao
    Tan, Zhifeng
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4002 - 4015
  • [4] Self-Supervised Sketch-to-Image Synthesis
    Liu, Bingchen
    Zhu, Yizhe
    Song, Kunpeng
    Elgammal, Ahmed
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2073 - 2081
  • [5] Sketch-to-image synthesis via semantic masks
    Baraheem, Samah S.
    Tam V. Nguyen
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 29047 - 29066
  • [6] Sketch-to-image synthesis via semantic masks
    Samah S. Baraheem
    Tam V. Nguyen
    [J]. Multimedia Tools and Applications, 2024, 83 : 29047 - 29066
  • [7] Facial attribute-controlled sketch-to-image translation with generative adversarial networks
    Hu, Mingming
    Guo, Jingtao
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [8] Facial attribute-controlled sketch-to-image translation with generative adversarial networks
    Mingming Hu
    Jingtao Guo
    [J]. EURASIP Journal on Image and Video Processing, 2020
  • [9] WHFL: Wavelet-Domain High Frequency Loss for Sketch-to-Image Translation
    Kim, Min Woo
    Cho, Nam Ik
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 744 - 754
  • [10] Learning semantic priors for texture-realistic sketch-to-image synthesis
    Li, Zeyu
    Deng, Cheng
    Wei, Kun
    Liu, Wei
    Tao, Dacheng
    [J]. NEUROCOMPUTING, 2021, 464 : 130 - 140