Facial expression morphing: enhancing visual fidelity and preserving facial details in CycleGAN-based expression synthesis

被引:0
|
作者
Sub-r-pa, Chayanon [1 ]
Chen, Rung-Ching [1 ]
Fan, Ming-Zhong [1 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Taichung, Taiwan
关键词
Facial expression synthesis; GANs; Image-to-image; CycleGAN; Image processing; Image translation; RECOGNITION; DEEP;
D O I
10.7717/peerj-cs.2438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advancements in facial expression synthesis using deep learning, particularly with Cycle-Consistent Adversarial Networks (CycleGAN), have led to impressive results. However, a critical challenge persists: the generated expressions often lack the sharpness and fine details of the original face, such as freckles, moles, or birthmarks. To address this issue, we introduce the Facial Expression Morphing (FEM) algorithm, a novel post-processing method designed to enhance the visual fidelity of CycleGANbased outputs. The FEM method blends the input image with the generated expression, prioritizing the preservation of crucial facial details. We experimented with our method on the Radboud Faces Database (RafD) and evaluated employing the Fr & eacute;chet Inception Distance (FID) standard benchmark for image-to-image translation and introducing a new metric, FSD (Facial Similarity Distance), to specifically measure the similarity between translated and real images. Our comprehensive analysis of CycleGAN, UNet Vision Transformer cycle-consistent GAN versions 1 (UVCGANv1) and 2 (UVCGANv2) reveals a substantial enhancement in image clarity and preservation of intricate details. The average FID score of 31.92 achieved by our models represents a remarkable 50% reduction compared to the previous state-of-the-art model's score of 63.82, showcasing the significant advancements made in this domain. This substantial enhancement in image quality is further supported by our proposed FSD metric, which shows a closer resemblance between FEM-processed images and the original faces.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Facial Expression Recognition using Anatomy Based Facial Graph
    Mohseni, Sina
    Zarei, Niloofar
    Ramazani, Saba
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3715 - 3719
  • [42] Facial expression recognition based on facial part attention mechanism
    Zhong, Qiubo
    Fang, Baofu
    Wei, Shenbin
    Wang, Zaijun
    Zhang, Haoxiang
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [43] Two Ways to Facial Expression Recognition? Motor and Visual Information Have Different Effects on Facial Expression Recognition
    de la Rosa, Stephan
    Fademrecht, Laura
    Buelthoff, Heinrich H.
    Giese, Martin A.
    Curio, Cristobal
    PSYCHOLOGICAL SCIENCE, 2018, 29 (08) : 1257 - 1269
  • [44] Facial expression recognition based on Electroencephalogram and facial landmark localization
    Li, Dahua
    Wang, Zhe
    Gao, Qiang
    Song, Yu
    Yu, Xiao
    Wang, Chuhan
    TECHNOLOGY AND HEALTH CARE, 2019, 27 (04) : 373 - 387
  • [45] Emotiongan: Facial Expression Synthesis Based on Pre Trained Generator
    Ning, Xin
    Xu, Shaohui
    Zong, Yixin
    Tian, Weijuan
    Sun, Linjun
    Dong, Xiaoli
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [46] Parameterized facial expression synthesis based on MPEG-4
    Raouzaiou, A
    Tsapatsoulis, N
    Karpouzis, K
    Kollias, S
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (10) : 1021 - 1038
  • [47] ApprGAN: appearance-based GAN for facial expression synthesis
    Peng, Yao
    Yin, Hujun
    IET IMAGE PROCESSING, 2019, 13 (14) : 2706 - 2715
  • [48] Parameterized Facial Expression Synthesis Based on MPEG-4
    Amaryllis Raouzaiou
    Nicolas Tsapatsoulis
    Kostas Karpouzis
    Stefanos Kollias
    EURASIP Journal on Advances in Signal Processing, 2002
  • [49] ORTHOGONAL DISCRIMINANT NEIGHBORHOOD PRESERVING EMBEDDING FOR FACIAL EXPRESSION RECOGNITION
    Liu, Shuai
    Ruan, Qiuqi
    Ni, Rongrong
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2757 - 2760
  • [50] FACIAL EXPRESSION PRESERVING PRIVACY PROTECTION USING IMAGE MELDING
    Nakashima, Yuta
    Koyama, Tatsuya
    Yokoya, Naokazu
    Babaguchi, Noboru
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,