Accurate 3D Face Reconstruction with Facial Component Tokens

被引:5
|
作者
Zhang, Tianke [1 ,2 ]
Chu, Xuangeng [2 ]
Liu, Yunfei [2 ]
Lin, Lijian [2 ]
Yang, Zhendong [2 ]
Xu, Zhengzhuo [1 ,2 ]
Cao, Chengkun [1 ,2 ]
Yu, Fei [3 ]
Zhou, Changyin [3 ]
Yuan, Chun [1 ]
Li, Yu [2 ]
机构
[1] Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] IDEA, Shenzhen, Peoples R China
[3] Vistring Inc, Hong Kong, Peoples R China
基金
国家重点研发计划;
关键词
MORPHABLE MODEL;
D O I
10.1109/ICCV51070.2023.00829
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately reconstructing 3D faces from monocular images and videos is crucial for various applications, such as digital avatar creation. However, the current deep learning-based methods face significant challenges in achieving accurate reconstruction with disentangled facial parameters and ensuring temporal stability in single-frame methods for 3D face tracking on video data. In this paper, we propose TokenFace, a transformer-based monocular 3D face reconstruction model. TokenFace uses separate tokens for different facial components to capture information about different facial parameters and employs temporal transformers to capture temporal information from video data. This design can naturally disentangle different facial components and is flexible to both 2D and 3D training data. Trained on hybrid 2D and 3D data, our model shows its power in accurately reconstructing faces from images and producing stable results for video data. Experimental results on popular benchmarks NoW and Stirling demonstrate that TokenFace achieves state-of-the-art performance, outperforming existing methods on all metrics by a large margin.
引用
收藏
页码:8999 / 9008
页数:10
相关论文
共 50 条
  • [31] Biometrics for Human Face Reconstruction in 3D
    Robert-Inacio, Fredrique
    Caudal, Frederic
    Rousset, Cederic
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 608 - +
  • [32] Research and Prospect on 3D Face Reconstruction
    Yao, Yonghong
    Ni, Rongrong
    PROCEEDINGS OF 2010 CROSS-STRAIT CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY, 2010, : 148 - 152
  • [33] Efficient 3D reconstruction for face recognition
    Jiang, DL
    Hu, YX
    Yan, SC
    Zhang, L
    Zhang, HJ
    Gao, W
    PATTERN RECOGNITION, 2005, 38 (06) : 787 - 798
  • [34] A 3D face matching framework for facial curves
    ter Haar, Frank B.
    Veltkamp, Remco C.
    GRAPHICAL MODELS, 2009, 71 (1-6) : 77 - 91
  • [35] Effects on facial expression in 3D face recognition
    Chang, K
    Bowyer, K
    Flynn, P
    BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION II, 2005, 5779 : 132 - 143
  • [36] Facial features detection for 3D Face Modeling
    Rozinaj, G
    Mistral, FL
    2003 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2003, : 951 - 954
  • [37] A3FD: Accurate 3D face detection
    Anisetti, Marco
    Bellandi, Valerio
    Damiani, Ernesto
    Arnone, Luigi
    Rat, Benoit
    SIGNAL PROCESSING FOR IMAGE ENHANCEMENT AND MULTIMEDIA PROCESSING, 2008, : 155 - +
  • [38] Multimodal Facial Expression Recognition Based on 3D Face Reconstruction from 2D Images
    Moeini, Ali
    Moeini, Hossein
    FACE AND FACIAL EXPRESSION RECOGNITION FROM REAL WORLD VIDEOS, 2015, 8912 : 46 - 57
  • [39] 3D Modeling system of human face and full 3D facial caricaturing
    Fujiwara, T
    Koshimizu, H
    Fujimura, K
    Fujita, G
    Noguchi, Y
    Ishikawa, N
    VSMM 2001: SEVENTH INTERNATIONAL CONFERENCE ON VIRTUAL SYSTEMS AND MULTIMEDIA, PROCEEDINGS: ENHANCED REALITIES: AUGMENTED AND UNPLUGGED, 2001, : 625 - 633
  • [40] 3D modeling system of human face and full 3D facial caricaturing
    Fujiwara, T
    Koshimizu, H
    Fujimura, K
    Kihara, H
    Noguchi, Y
    Ishikawa, N
    THIRD INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2001, : 385 - 392