Identity-Aware Variational Autoencoder for Face Swapping

被引:0
|
作者
Li, Zonglin [1 ]
Zhang, Zhaoxin [1 ]
He, Shengfeng [2 ]
Meng, Quanling [1 ]
Zhang, Shengping [1 ]
Zhong, Bineng [3 ,4 ]
Ji, Rongrong [5 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore 188065, Singapore
[3] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
[4] Guangxi Normal Univ, Sch Software, Guilin 541004, Peoples R China
[5] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
关键词
Faces; Face recognition; Training; Three-dimensional displays; Decoding; Task analysis; Shape; Face swapping; variational autoencoder; weak-supervised training; UNIFIED FRAMEWORK; IMAGE; MODEL;
D O I
10.1109/TCSVT.2024.3349909
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Face swapping aims to transfer the identity of a source face to a target face image while preserving the target attributes (e.g., facial expression, head pose, illumination, and background). Most existing methods use a face recognition model to extract global features from the source face and directly fuse them with the target to generate a swapping result. However, identity-irrelevant attributes (e.g., hairstyle and facial appearances) contribute a lot to the recognition task, and thus swapping this task-specific feature inevitably interfuses source attributes with target ones. In this paper, we propose an identity-aware variational autoencoder (ID-VAE) based face swapping framework, dubbed VAFSwap, which learns disentangled identity and attribute representations for high-fidelity face swapping. In particular, we overcome the unpaired training barrier of VAE and impose a proxy identity on the latent space by exploiting the weak supervision from an auxiliary image set whose identity is averaged from multiple collected face images. To explicitly guide the identity fusion, we further devise an identity-associated matrix that corresponds different face regions with their identity representations to perform identity-related feature interactions. Finally, we incorporate spatial dimensions into the latent space and exploit the generative priors of a pre-trained face generator, allowing the effective elimination of noticeable swapping artifacts. Extensive experiments on the FaceForensics++ and CelebA-HQ datasets demonstrate that our method outperforms the state-of-the-art significantly.
引用
收藏
页码:5466 / 5479
页数:14
相关论文
共 50 条
  • [41] Adaptive Deep Metric Learning for Identity-Aware Facial Expression Recognition
    Liu, Xiaofeng
    Kumar, B. V. K. Vijaya
    You, Jane
    Jia, Ping
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 522 - 531
  • [42] Expressive Multi-Agent Communication via Identity-Aware Learning
    Du, Wei
    Ding, Shifei
    Guo, Lili
    Zhang, Jian
    Ding, Ling
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17354 - 17361
  • [43] Enabling Identity-Aware Tracking via Fusion of Visual and Inertial Features
    Tsai, Richard Yi-Chia
    Ke, Hans Ting-Yuan
    Lin, Kate Ching-Ju
    Tseng, Yu-Chee
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2260 - 2266
  • [44] Reconstructing Neutral Face Expressions with Disentangled Variational Autoencoder
    Wiem, Grina
    Ali, Douik
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 83 - 94
  • [45] Face Swapping Consistency Transfer with Neural Identity Carrier
    Liu, Kunlin
    Wang, Ping
    Zhou, Wenbo
    Zhang, Zhenyu
    Ge, Yanhao
    Liu, Honggu
    Zhang, Weiming
    Yu, Nenghai
    FUTURE INTERNET, 2021, 13 (11)
  • [46] Implicit Identity Driven Deepfake Face Swapping Detection
    Huang, Baojin
    Wang, Zhongyuan
    Yang, Jifan
    Ai, Jiaxin
    Zou, Qin
    Wang, Qian
    Ye, Dengpan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4490 - 4499
  • [47] UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement
    You, Sisi
    Yao, Hantao
    Bao, Bing-Kun
    Xu, Changsheng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21876 - 21886
  • [48] Mutual information regularized identity-aware facial expression recognition in compressed video
    Liu, Xiaofeng
    Jin, Linghao
    Han, Xu
    You, Jane
    PATTERN RECOGNITION, 2021, 119
  • [49] Identity-Aware Textual-Visual Matching with Latent Co-attention
    Li, Shuang
    Xiao, Tong
    Li, Hongsheng
    Yang, Wei
    Wang, Xiaogang
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1908 - 1917
  • [50] Review-aware heterogeneous variational autoencoder recommendation model
    Liu S.
    Zhang J.
    Chen X.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (01): : 88 - 97