Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks

被引:21
|
作者
He, Zhe [1 ,2 ,3 ]
Spurr, Adrian [1 ]
Zhang, Xucong [1 ]
Hilliges, Otmar [1 ]
机构
[1] Swiss Fed Inst Technol, AIT Lab, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Inst Neuroinformat, Zurich, Switzerland
[3] Univ Zurich, Zurich, Switzerland
关键词
D O I
10.1109/ICCV.2019.00703
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaze redirection is the task of changing the gaze to a desired direction for a given monocular eye patch image. Many applications such as videoconferencing, films, games, and generation of training data for gaze estimation require redirecting the gaze, without distorting the appearance of the area surrounding the eye and while producing photo-realistic images. Existing methods lack the ability to generate perceptually plausible images. In this work, we present a novel method to alleviate this problem by leveraging generative adversarial training to synthesize an eye image conditioned on a target gaze direction. Our method ensures perceptual similarity and consistency of synthesized images to the real images. Furthermore, a gaze estimation loss is used to control the gaze direction accurately. To attain highquality images, we incorporate perceptual and cycle consistency losses into our architecture. In extensive evaluations we show that the proposed method outperforms state-of-the-art approaches in terms of both image quality and redirection precision. Finally, we show that generated images can bring significant improvement for the gaze estimation task if used to augment real training data.
引用
收藏
页码:6931 / 6940
页数:10
相关论文
共 50 条
  • [1] Photo-realistic dehazing via contextual generative adversarial networks
    Zhang, Shengdong
    He, Fazhi
    Ren, Wenqi
    [J]. MACHINE VISION AND APPLICATIONS, 2020, 31 (05)
  • [2] Photo-realistic dehazing via contextual generative adversarial networks
    Shengdong Zhang
    Fazhi He
    Wenqi Ren
    [J]. Machine Vision and Applications, 2020, 31
  • [3] SIMGAN: PHOTO-REALISTIC SEMANTIC IMAGE MANIPULATION USING GENERATIVE ADVERSARIAL NETWORKS
    Yu, Simiao
    Dong, Hao
    Liang, Felix
    Mo, Yuanhan
    Wu, Chao
    Guo, Yike
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 734 - 738
  • [4] Towards Photo-Realistic Visible Watermark Removal with Conditional Generative Adversarial Networks
    Li, Xiang
    Lu, Chan
    Cheng, Danni
    Li, Wei-Hong
    Cao, Mei
    Liu, Bo
    Ma, Jiechao
    Zheng, Wei-Shi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 345 - 356
  • [5] SEMANTICGAN: GENERATIVE ADVERSARIAL NETWORKS FOR SEMANTIC IMAGE TO PHOTO-REALISTIC IMAGE TRANSLATION
    Liu, Junling
    Zou, Yuexian
    Yang, Dongming
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2528 - 2532
  • [6] StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
    Zhang, Han
    Xu, Tao
    Li, Hongsheng
    Zhang, Shaoting
    Wang, Xiaogang
    Huang, Xiaolei
    Metaxas, Dimitris
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5908 - 5916
  • [7] Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
    Ledig, Christian
    Theis, Lucas
    Huszar, Ferenc
    Caballero, Jose
    Cunningham, Andrew
    Acosta, Alejandro
    Aitken, Andrew
    Tejani, Alykhan
    Totz, Johannes
    Wang, Zehan
    Shi, Wenzhe
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 105 - 114
  • [8] Photo-realistic face age progression/regression using a single generative adversarial network
    Zeng, Jiangfeng
    Ma, Xiao
    Zhou, Ke
    [J]. NEUROCOMPUTING, 2019, 366 : 295 - 304
  • [9] Conditional Adversarial Networks for Multimodal Photo-Realistic Point Cloud Rendering
    Peters, Torben
    Brenner, Claus
    [J]. PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2020, 88 (3-4): : 257 - 269
  • [10] Conditional Adversarial Networks for Multimodal Photo-Realistic Point Cloud Rendering
    Torben Peters
    Claus Brenner
    [J]. PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 2020, 88 : 257 - 269