Self-supervised learning for fine-grained monocular 3D face reconstruction in the wild

被引：0

作者：

Huang, Dongjin ^{[1
]}

Shi, Yongsheng ^{[1
]}

Liu, Jinhua ^{[1
]}

Tang, Wen ^{[2
]}

机构：

[1] Shanghai Univ, Shanghai Film Acad, Shanghai 200072, Peoples R China

[2] Bournemouth Univ, Dept Creat Technol, Poole BH12 5BB, England

来源：

MULTIMEDIA SYSTEMS | 2024年 / 30卷 / 04期

关键词：

3D face reconstruction; Monocular image; 3DMM; Self-supervised learning; Coarse-to-fine model; ALIGNMENT; SHAPE;

D O I：

10.1007/s00530-024-01436-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reconstructing 3D face from monocular images is a challenging computer vision task, due to the limitations of traditional 3DMM (3D Morphable Model) and the lack of high-fidelity 3D facial scanning data. To solve this issue, we propose a novel coarse-to-fine self-supervised learning framework for reconstructing fine-grained 3D faces from monocular images in the wild. In the coarse stage, face parameters extracted from a single image are used to reconstruct a coarse 3D face through a 3DMM. In the refinement stage, we design a wavelet transform perception model to extract facial details in different frequency domains from an input image. Furthermore, we propose a depth displacement module based on the wavelet transform perception model to generate a refined displacement map from the unwrapped UV textures of the input image and rendered coarse face, which can be used to synthesize detailed 3D face geometry. Moreover, we propose a novel albedo map module based on the wavelet transform perception model to capture high-frequency texture information and generate a detailed albedo map consistent with face illumination. The detailed face geometry and albedo map are used to reconstruct a fine-grained 3D face without any labeled data. We have conducted extensive experiments that demonstrate the superiority of our method over existing state-of-the-art approaches for 3D face reconstruction on four public datasets including CelebA, LS3D, LFW, and NoW benchmark. The experimental results indicate that our method achieved higher accuracy and robustness, particularly of under the challenging conditions such as occlusion, large poses, and varying illuminations.

引用

页数：18

共 50 条

[1] Self-Supervised Learning of Detailed 3D Face Reconstruction
Chen, Yajing
Wu, Fanzi
Wang, Zeyu
Song, Yibing
Ling, Yonggen
Bao, Linchao
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8696 - 8705
[2] Siamese self-supervised learning for fine-grained visual classification
Ji, Ruyi
Li, Jiaying
Zhang, Libo
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
[3] Enhancing Face Recognition with Self-Supervised 3D Reconstruction
He, Mingjie
Zhang, Jie
Shan, Shiguang
Chen, Xilin
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4052 - 4061
[4] CanonPose: Self-Supervised Monocular 3D Human Pose Estimation in the Wild
Wandt, Bastian
Rudolph, Marco
Zell, Petrissa
Rhodin, Helge
Rosenhahn, Bodo
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13289 - 13299
[5] LOW-FREQUENCY GUIDED SELF-SUPERVISED LEARNING FOR HIGH-FIDELITY 3D FACE RECONSTRUCTION IN THE WILD
Wang, Pengrui
Lin, Chunze
Xu, Bo
Che, Wujun
Wang, Quan
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[6] SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields
Cao, Anh-Quan
de Charette, Raoul
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9353 - 9364
[7] Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
Jung, Hyunyoung
Park, Eunhyeok
Yoo, Sungjoo
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12622 - 12632
[8] Self-Supervised 3D Face Reconstruction via Conditional Estimation
Wen, Yandong
Liu, Weiyang
Raj, Bhiksha
Singh, Rita
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13269 - 13278
[9] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
Department of Software, Ajou University, Korea, Republic of
不详
[J]. Comput. Biol. Med, 2024,
[10] 3D Guided Fine-Grained Face Manipulation
Geng, Zhenglin
Cao, Chen
Tulyakov, Sergey
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9813 - 9822

← 1 2 3 4 5 →