Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks

被引:21
|
作者
He, Zhe [1 ,2 ,3 ]
Spurr, Adrian [1 ]
Zhang, Xucong [1 ]
Hilliges, Otmar [1 ]
机构
[1] Swiss Fed Inst Technol, AIT Lab, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Inst Neuroinformat, Zurich, Switzerland
[3] Univ Zurich, Zurich, Switzerland
关键词
D O I
10.1109/ICCV.2019.00703
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaze redirection is the task of changing the gaze to a desired direction for a given monocular eye patch image. Many applications such as videoconferencing, films, games, and generation of training data for gaze estimation require redirecting the gaze, without distorting the appearance of the area surrounding the eye and while producing photo-realistic images. Existing methods lack the ability to generate perceptually plausible images. In this work, we present a novel method to alleviate this problem by leveraging generative adversarial training to synthesize an eye image conditioned on a target gaze direction. Our method ensures perceptual similarity and consistency of synthesized images to the real images. Furthermore, a gaze estimation loss is used to control the gaze direction accurately. To attain highquality images, we incorporate perceptual and cycle consistency losses into our architecture. In extensive evaluations we show that the proposed method outperforms state-of-the-art approaches in terms of both image quality and redirection precision. Finally, we show that generated images can bring significant improvement for the gaze estimation task if used to augment real training data.
引用
收藏
页码:6931 / 6940
页数:10
相关论文
共 50 条
  • [31] Generation of Realistic Synthetic Validation Healthcare Datasets Using Generative Adversarial Networks
    Ozyigit, Eda Bilici
    Arvanitis, Theodoros N.
    Despotou, George
    [J]. IMPORTANCE OF HEALTH INFORMATICS IN PUBLIC HEALTH DURING A PANDEMIC, 2020, 272 : 322 - 325
  • [32] Realistic image generation using adversarial generative networks combined with depth information
    Yu, Qi
    Yu, Lan
    Li, Guangju
    Jin, Dehu
    Qi, Meng
    [J]. DIGITAL SIGNAL PROCESSING, 2023, 143
  • [33] CORRGAN: SAMPLING REALISTIC FINANCIAL CORRELATION MATRICES USING GENERATIVE ADVERSARIAL NETWORKS
    Marti, Gautier
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8459 - 8463
  • [34] WGANVO: monocular visual odometry based on generative adversarial networks
    Cremona, Javier
    Uzal, Lucas
    Pire, Taihu
    [J]. REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2022, 19 (02): : 144 - 153
  • [35] Towards Realistic Market Simulations: a Generative Adversarial Networks Approach
    Coletta, Andrea
    Prata, Matteo
    Conti, Michele
    Mercanti, Emanuele
    Bartolini, Novella
    Moulin, Aymeric
    Vyetrenko, Svitlana
    Balch, Tucker
    [J]. ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
  • [36] Towards Generating Structurally Realistic Models by Generative Adversarial Networks
    Rahimi, Abbas
    Tisi, Massimo
    Rahimi, Shekoufeh Kolahdouz
    Berardinelli, Luca
    [J]. 2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 597 - 604
  • [37] Gaze in the Dark: Gaze Estimation in a Low-Light Environment with Generative Adversarial Networks
    Kim, Jung-Hwa
    Jeong, Jin-Woo
    [J]. SENSORS, 2020, 20 (17) : 1 - 20
  • [38] Generate Realistic Traffic Sign Image Using Deep Convolutional Generative Adversarial Networks
    Liu, Yan-Ting
    Chen, Rung-Ching
    Dewi, Christine
    [J]. 2021 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2021,
  • [39] Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation
    Puscas, Mihai Marian
    Xu, Dan
    Pilzer, Andrea
    Sebe, Niculae
    [J]. 2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 18 - 26
  • [40] Photo-Realistic Emoticon Generation Using Multi-Modal Input
    Mittal, Paritosh
    Aggarwal, Kunal
    Sahu, Pragya Paramita
    Vatsalya, Vishal
    Mitra, Soumyajit
    Singh, Vikrant
    Veera, Viswanath
    Venkatesan, Shankar M.
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2020, 2020, : 254 - 258