Self-supervised learning for fine-grained monocular 3D face reconstruction in the wild

被引:0
|
作者
Huang, Dongjin [1 ]
Shi, Yongsheng [1 ]
Liu, Jinhua [1 ]
Tang, Wen [2 ]
机构
[1] Shanghai Univ, Shanghai Film Acad, Shanghai 200072, Peoples R China
[2] Bournemouth Univ, Dept Creat Technol, Poole BH12 5BB, England
关键词
3D face reconstruction; Monocular image; 3DMM; Self-supervised learning; Coarse-to-fine model; ALIGNMENT; SHAPE;
D O I
10.1007/s00530-024-01436-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reconstructing 3D face from monocular images is a challenging computer vision task, due to the limitations of traditional 3DMM (3D Morphable Model) and the lack of high-fidelity 3D facial scanning data. To solve this issue, we propose a novel coarse-to-fine self-supervised learning framework for reconstructing fine-grained 3D faces from monocular images in the wild. In the coarse stage, face parameters extracted from a single image are used to reconstruct a coarse 3D face through a 3DMM. In the refinement stage, we design a wavelet transform perception model to extract facial details in different frequency domains from an input image. Furthermore, we propose a depth displacement module based on the wavelet transform perception model to generate a refined displacement map from the unwrapped UV textures of the input image and rendered coarse face, which can be used to synthesize detailed 3D face geometry. Moreover, we propose a novel albedo map module based on the wavelet transform perception model to capture high-frequency texture information and generate a detailed albedo map consistent with face illumination. The detailed face geometry and albedo map are used to reconstruct a fine-grained 3D face without any labeled data. We have conducted extensive experiments that demonstrate the superiority of our method over existing state-of-the-art approaches for 3D face reconstruction on four public datasets including CelebA, LS3D, LFW, and NoW benchmark. The experimental results indicate that our method achieved higher accuracy and robustness, particularly of under the challenging conditions such as occlusion, large poses, and varying illuminations.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Self-Supervised Learning of Detailed 3D Face Reconstruction
    Chen, Yajing
    Wu, Fanzi
    Wang, Zeyu
    Song, Yibing
    Ling, Yonggen
    Bao, Linchao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8696 - 8705
  • [2] Siamese self-supervised learning for fine-grained visual classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [3] Enhancing Face Recognition with Self-Supervised 3D Reconstruction
    He, Mingjie
    Zhang, Jie
    Shan, Shiguang
    Chen, Xilin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4052 - 4061
  • [4] CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild
    Wandt, Bastian
    Rudolph, Marco
    Zell, Petrissa
    Rhodin, Helge
    Rosenhahn, Bodo
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13289 - 13299
  • [5] LOW-FREQUENCY GUIDED SELF-SUPERVISED LEARNING FOR HIGH-FIDELITY 3D FACE RECONSTRUCTION IN THE WILD
    Wang, Pengrui
    Lin, Chunze
    Xu, Bo
    Che, Wujun
    Wang, Quan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [6] SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields
    Cao, Anh-Quan
    de Charette, Raoul
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9353 - 9364
  • [7] Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
    Jung, Hyunyoung
    Park, Eunhyeok
    Yoo, Sungjoo
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12622 - 12632
  • [8] Self-Supervised 3D Face Reconstruction via Conditional Estimation
    Wen, Yandong
    Liu, Weiyang
    Raj, Bhiksha
    Singh, Rita
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13269 - 13278
  • [9] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
    Department of Software, Ajou University, Korea, Republic of
    不详
    [J]. Comput. Biol. Med, 2024,
  • [10] 3D Guided Fine-Grained Face Manipulation
    Geng, Zhenglin
    Cao, Chen
    Tulyakov, Sergey
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9813 - 9822