Towards Photographic Image Manipulation with Balanced Growing of Generative Autoencoders

被引:0
|
作者
Heljakka, Ari [1 ,2 ]
Solin, Arno [1 ]
Kannala, Juho [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
[2] GenMind Ltd, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
10.1109/wacv45572.2020.9093375
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a generative autoencoder that provides fast encoding, faithful reconstructions (e.g. retaining the identity of a face), sharp generated/reconstructed samples in high resolutions, and a well-structured latent space that supports semantic manipulation of the inputs. There are no current autoencoder or GAN models that satisfactorily achieve all of these. We build on the progressively growing autoencoder model PIONEER, for which we completely alter the training dynamics based on a careful analysis of recently introduced normalization schemes. We show significantly improved visual and quantitative results for face identity conservation in CELEBA-HQ. Our model achieves state-of-the-art disentanglement of latent space, both quantitatively and via realistic image attribute manipulations. On the LSUN Bedrooms dataset, we improve the disentanglement performance of the vanilla PIONEER, despite having a simpler model. Overall, our results indicate that the PIONEER networks provide a way towards photorealistic face manipulation.
引用
收藏
页码:3109 / 3118
页数:10
相关论文
共 50 条
  • [1] IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis
    Huang, Huaibo
    Li, Zhihang
    He, Ran
    Sun, Zhenan
    Tan, Tieniu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Generative Model for Autoencoders Learning by Image Sampling Representations
    Antsiperov, V. E.
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 354 - 361
  • [3] Semantic Photo Manipulation with a Generative Image Prior
    Bau, David
    Strobelt, Hendrik
    Peebles, William
    Wulff, Jonas
    Zhou, Bolei
    Zhu, Jun-Yan
    Torralba, Antonio
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04):
  • [4] Generative Visual Manipulation on the Natural Image Manifold
    Zhu, Jun-Yan
    Kraehenbuehl, Philipp
    Shechtman, Eli
    Efros, Alexei A.
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 597 - 613
  • [5] Growing the image: Generative AI and the medium of gardening
    Young, Nick
    Terrone, Enrico
    [J]. PHILOSOPHICAL QUARTERLY, 2024,
  • [6] Deep Generative Models for Image Generation: A Practical Comparison Between Variational Autoencoders and Generative Adversarial Networks
    El-Kaddoury, Mohamed
    Mahmoudi, Abdelhak
    Himmi, Mohammed Majid
    [J]. MOBILE, SECURE, AND PROGRAMMABLE NETWORKING, 2019, 11557 : 1 - 8
  • [7] Deep generative image priors for semantic face manipulation
    Hou, Xianxu
    Shen, Linlin
    Ming, Zhong
    Qiu, Guoping
    [J]. PATTERN RECOGNITION, 2023, 139
  • [8] Towards the montage: photographic image in spite of everything
    Margarese, Ivana
    [J]. TORRE DEL VIRREY-REVISTA DE ESTUDIOS CULTURALES, 2010, (07): : 19 - 23
  • [9] Lung image quality assessment and diagnosis using generative autoencoders in unsupervised ensemble learning
    Rajasekar, Elakkiya
    Chandra, Harshiv
    Pears, Nick
    Vairavasundaram, Subramaniyaswamy
    Kotecha, Ketan
    [J]. Biomedical Signal Processing and Control, 2025, 102
  • [10] Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
    Pan, Xingang
    Zhan, Xiaohang
    Dai, Bo
    Lin, Dahua
    Loy, Chen Change
    Luo, Ping
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7474 - 7489