Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement

被引：1

作者：

Qin, Xujia ^{[1
]}

Li, Xinyu ^{[1
]}

Li, Mengjia ^{[1
]}

Zheng, Hongbo ^{[1
]}

Xu, Xiaogang ^{[2
,3
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China

[2] Zhejiang Lab, Inst Artificial Intelligence, Hangzhou, Peoples R China

[3] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 41卷 / 1期

基金：

中国国家自然科学基金;

关键词：

3D face reconstruction; Attention mechanism; Self-supervised; Attribute refinement; Deep learning; SHAPE;

D O I：

10.1007/s00371-024-03319-0

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Single-view 3D face reconstruction refers to recovering 3D information of a face, such as shape and texture, from a single image. With the wide application of deep learning in the image field, there have been a number of researches using this method to learn the 3D shape and texture of a face from image information. In this paper, we propose a self-supervised single-image 3D face reconstruction method based on the attention mechanism and attribute refinement, which incorporates the attention mechanism in the network structural model, allowing feature extraction to fuse the information of the channel domain and the spatial domain to enhance the feature extraction capability. Joint 2D image-level supervision and supervision between 3D attributes can better learn the 3D model of the face. In this paper, on the basis of using the traditional 2D image supervision, we design a variety of loss functions by combining the cyclic consistency, interpolation consistency, and landmark consistency to realize the 3D attribute level supervision. In order to strengthen the ability to characterize the details of the face, this paper proposes an attribute refinement network to enhance the ability of the model to reconstruct the details and make the reconstruction results more realistic. Based on the symmetry of the face, this paper constructs a deep learning network model to decouple the 3D information directly from the image, and finally realizes unsupervised 3D face reconstruction from a single image.

引用

下载

页码：209 / 227

页数：19

共 50 条

[41] Consistent 3D Hand Reconstruction in Video via Self-Supervised Learning
Tu, Zhigang
Huang, Zhisheng
Chen, Yujin
Kang, Di
Bao, Linchao
Yang, Bisheng
Yuan, Junsong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9469 - 9485
[42] Attention-based 3D Object Reconstruction from a Single Image
Salvi, Andrey
Gavenski, Nathan
Pooch, Eduardo
Tasoniero, Felipe
Barros, Rodrigo
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[43] IMAGE BASED 3D FACE RECONSTRUCTION: A SURVEY
Stylianou, Georgios
Lanitis, Andreas
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2009, 9 (02) : 217 - 250
[44] A Geometric Constraints Based Single-image Reconstruction Method
Wan, Fang
Yang, R.
ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1320 - 1325
[45] Curriculum Self-Supervised Learning for 3D CT Cardiac Image Segmentation
Taher, Mohammad Reza Hosseinzadeh
Ikuta, Masaki
Soni, Ravi
MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 145 - 156
[46] 3D Face Reconstruction from a Single 2D Face Image
Park, Sung Won
Heo, Jingu
Savvides, Marios
2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1280 - 1287
[47] LOW-FREQUENCY GUIDED SELF-SUPERVISED LEARNING FOR HIGH-FIDELITY 3D FACE RECONSTRUCTION IN THE WILD
Wang, Pengrui
Lin, Chunze
Xu, Bo
Che, Wujun
Wang, Quan
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[48] Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition
Yang, Yang
Liu, Guangjun
Gao, Xuehao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8623 - 8634
[49] Attention-guided mask learning for self-supervised 3D action recognition
Zhang, Haoyuan
COMPLEX & INTELLIGENT SYSTEMS, 2024, : 7487 - 7496
[50] Improving ultrasound tongue image reconstruction from lip images using self-supervised learning and attention mechanism
Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
不详
arXiv,

← 1 2 3 4 5 →