Image-to-Image Translation With Disentangled Latent Vectors for Face Editing

被引:14
|
作者
Dalva, Yusuf [1 ]
Pehlivan, Hamza [2 ]
Hatipoglu, Oyku Irmak [3 ]
Moran, Cansu [4 ]
Dundar, Aysegul [5 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Max Planck Inst, D-80539 Munich, Germany
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[4] Tech Univ Munich, D-80333 Munich, Germany
[5] Bilkent Univ, Dept Comp Sci, TR-06800 Bilkent, Turkiye
关键词
Image translation; generative adversarial net works; latent space manipulation; face attribute editing;
D O I
10.1109/TPAMI.2023.3308102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength and disentanglement in the representations of attributes to preserve the other attributes during edits. For this goal, inspired by the latent space factorization works of fixed pretrained GANs, we design the attribute editing by latent space factorization, and for each attribute, we learn a linear direction that is orthogonal to the others. We train these directions with orthogonality constraints and disentanglement losses. To project images to semantically organized latent spaces, we set an encoder-decoder architecture with attention-based skip connections. We extensively compare with previous image translation algorithms and editing with pretrained GAN works. Our extensive experiments show that our method significantly improves over the state-of-the-arts.
引用
收藏
页码:14777 / 14788
页数:12
相关论文
共 50 条
  • [21] Disentangling latent space better for few-shot image-to-image translation
    Peng Liu
    Yueyue Wang
    Angang Du
    Liqiang Zhang
    Bin Wei
    Zhaorui Gu
    Xiaodong Wang
    Haiyong Zheng
    Juan Li
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 419 - 427
  • [22] Generative image completion with image-to-image translation
    Shuzhen Xu
    Qing Zhu
    Jin Wang
    Neural Computing and Applications, 2020, 32 : 7333 - 7345
  • [23] Generative image completion with image-to-image translation
    Xu, Shuzhen
    Zhu, Qing
    Wang, Jin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 7333 - 7345
  • [24] One-way multimodal image-to-image translation for heterogeneous face recognition
    Ji, Shulin
    Zhai, Xingang
    Liu, Jie
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [25] Vector Quantized Image-to-Image Translation
    Chen, Yu-Jie
    Cheng, Shin-I
    Chiu, Wei-Chen
    Tseng, Hung-Yu
    Lee, Hsin-Ying
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 440 - 456
  • [26] Deliberation Learning for Image-to-Image Translation
    He, Tianyu
    Xia, Yingce
    Lin, Jianxin
    Tan, Xu
    He, Di
    Qin, Tao
    Chen, Zhibo
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2484 - 2490
  • [27] Latent-SDE: guiding stochastic differential equations in latent space for unpaired image-to-image translation
    Zhang, Xianjie
    Li, Min
    He, Yujie
    Gou, Yao
    Zhang, Yusen
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 7765 - 7775
  • [28] Image-to-Image Translation: Methods and Applications
    Pang, Yingxue
    Lin, Jianxin
    Qin, Tao
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3859 - 3881
  • [29] Toward Multimodal Image-to-Image Translation
    Zhu, Jun-Yan
    Zhang, Richard
    Pathak, Deepak
    Darrell, Trevor
    Efros, Alexei A.
    Wang, Oliver
    Shechtman, Eli
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [30] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196