EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

被引:1
|
作者
Drobyshev, Nikita [1 ]
Casademunt, Antoni Bigata [1 ]
Vougioukas, Konstantinos [1 ]
Landgraf, Zoe [1 ]
Petridis, Stavros [1 ]
Pantic, Maja [1 ]
机构
[1] Imperial Coll London, London, England
关键词
D O I
10.1109/CVPR52733.2024.00812
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head avatars animated by visual signals have gained popularity, particularly in cross-driving synthesis where the driver differs from the animated character, a challenging but highly practical approach. The recently presented MegaPortraits model has demonstrated state-of-the-art results in this domain. We conduct a deep examination and evaluation of this model, with a particular focus on its latent space for facial expression descriptors, and uncover several limitations with its ability to express intense face motions. To address these limitations, we propose substantial changes in both training pipeline and model architecture, to introduce our EMOPortraits model, where we: Enhance the model's capability to faithfully support in-tense, asymmetric face expressions, setting a new state-of-the-art result in the emotion transfer task, surpassing previous methods in both metrics and quality. Incorporate speech-driven mode to our model, achieving top-tier performance in audio-driven facial animation, making it possible to drive source identity through diverse modalities, including visual signal, audio, or a blend of both. Furthermore, we propose a novel multi-view video dataset featuring a wide range of intense and asymmetric facial expressions, filling the gap with absence of such data in existing datasets.
引用
收藏
页码:8498 / 8507
页数:10
相关论文
共 41 条
  • [1] MegaPortraits: One-shot Megapixel Neural Head Avatars
    Drobyshev, Nikita
    Chelishev, Jenya
    Khakhulin, Taras
    Ivakhnenko, Aleksei
    Lempitsky, Victor
    Zakharov, Egor
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2663 - 2671
  • [2] Realistic One-Shot Mesh-Based Head Avatars
    Khakhulin, Taras
    Sklyarova, Vanessa
    Lempitsky, Victor
    Zakharov, Egor
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 345 - 362
  • [3] Rapid One-Shot Acquisition of Dynamic VR Avatars
    Malleson, Charles
    Kosek, Maggie
    Klaudiny, Martin
    Huerta, Ivan
    Bazin, Jean-Charles
    Sorkine-Hornung, Alexander
    Mine, Mark
    Mitchell, Kenny
    2017 IEEE VIRTUAL REALITY (VR), 2017, : 131 - 140
  • [4] Demonstration: Rapid One-Shot Acquisition of Dynamic VR Avatars
    Malleson, Charles
    Kosek, Maggie
    Klaudiny, Martin
    Huerta, Ivan
    Bazin, Jean-Charles
    Sorkine-Hornung, Alexander
    Mine, Mark
    Mitchell, Kenny
    2017 IEEE VIRTUAL REALITY (VR), 2017, : 447 - 448
  • [5] MULTIMODAL ONE-SHOT LEARNING OF SPEECH AND IMAGES
    Eloff, Ryan
    Engelbrecht, Herman A.
    Kamper, Herman
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8623 - 8627
  • [6] DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars
    Svitov, David
    Gudkov, Dmitrii
    Bashirov, Renat
    Lempitsky, Victor
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7039 - 7049
  • [7] One-shot Implicit Animatable Avatars with Model-based Priors
    Huang, Yangyi
    Yi, Hongwei
    Liu, Weiyang
    Wang, Haofan
    Wu, Boxi
    Wang, Wenxiao
    Lin, Binbin
    Zhang, Debing
    Cai, Deng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8940 - 8951
  • [8] HeadGAN: One-shot Neural Head Synthesis and Editing
    Doukas, Michail Christos
    Zafeiriou, Stefanos
    Sharmanska, Viktoriia
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14378 - 14387
  • [9] Adaptive Super Resolution For One-Shot Talking-Head Generation
    Song, Luchuan
    Liu, Pinxin
    Yin, Guojun
    Xu, Chenliang
    arXiv, 1600,
  • [10] EmoStyle: One-Shot Facial Expression Editing Using Continuous Emotion Parameters
    Azari, Bita
    Lim, Angelica
    2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 6373 - 6382