Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement

被引:1
|
作者
Qin, Xujia [1 ]
Li, Xinyu [1 ]
Li, Mengjia [1 ]
Zheng, Hongbo [1 ]
Xu, Xiaogang [2 ,3 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Lab, Inst Artificial Intelligence, Hangzhou, Peoples R China
[3] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 41卷 / 1期
基金
中国国家自然科学基金;
关键词
3D face reconstruction; Attention mechanism; Self-supervised; Attribute refinement; Deep learning; SHAPE;
D O I
10.1007/s00371-024-03319-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Single-view 3D face reconstruction refers to recovering 3D information of a face, such as shape and texture, from a single image. With the wide application of deep learning in the image field, there have been a number of researches using this method to learn the 3D shape and texture of a face from image information. In this paper, we propose a self-supervised single-image 3D face reconstruction method based on the attention mechanism and attribute refinement, which incorporates the attention mechanism in the network structural model, allowing feature extraction to fuse the information of the channel domain and the spatial domain to enhance the feature extraction capability. Joint 2D image-level supervision and supervision between 3D attributes can better learn the 3D model of the face. In this paper, on the basis of using the traditional 2D image supervision, we design a variety of loss functions by combining the cyclic consistency, interpolation consistency, and landmark consistency to realize the 3D attribute level supervision. In order to strengthen the ability to characterize the details of the face, this paper proposes an attribute refinement network to enhance the ability of the model to reconstruct the details and make the reconstruction results more realistic. Based on the symmetry of the face, this paper constructs a deep learning network model to decouple the 3D information directly from the image, and finally realizes unsupervised 3D face reconstruction from a single image.
引用
下载
收藏
页码:209 / 227
页数:19
相关论文
共 50 条
  • [41] Consistent 3D Hand Reconstruction in Video via Self-Supervised Learning
    Tu, Zhigang
    Huang, Zhisheng
    Chen, Yujin
    Kang, Di
    Bao, Linchao
    Yang, Bisheng
    Yuan, Junsong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9469 - 9485
  • [42] Attention-based 3D Object Reconstruction from a Single Image
    Salvi, Andrey
    Gavenski, Nathan
    Pooch, Eduardo
    Tasoniero, Felipe
    Barros, Rodrigo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] IMAGE BASED 3D FACE RECONSTRUCTION: A SURVEY
    Stylianou, Georgios
    Lanitis, Andreas
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2009, 9 (02) : 217 - 250
  • [44] A Geometric Constraints Based Single-image Reconstruction Method
    Wan, Fang
    Yang, R.
    ADVANCES IN MECHATRONICS, AUTOMATION AND APPLIED INFORMATION TECHNOLOGIES, PTS 1 AND 2, 2014, 846-847 : 1320 - 1325
  • [45] Curriculum Self-Supervised Learning for 3D CT Cardiac Image Segmentation
    Taher, Mohammad Reza Hosseinzadeh
    Ikuta, Masaki
    Soni, Ravi
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 145 - 156
  • [46] 3D Face Reconstruction from a Single 2D Face Image
    Park, Sung Won
    Heo, Jingu
    Savvides, Marios
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1280 - 1287
  • [47] LOW-FREQUENCY GUIDED SELF-SUPERVISED LEARNING FOR HIGH-FIDELITY 3D FACE RECONSTRUCTION IN THE WILD
    Wang, Pengrui
    Lin, Chunze
    Xu, Bo
    Che, Wujun
    Wang, Quan
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition
    Yang, Yang
    Liu, Guangjun
    Gao, Xuehao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8623 - 8634
  • [49] Attention-guided mask learning for self-supervised 3D action recognition
    Zhang, Haoyuan
    COMPLEX & INTELLIGENT SYSTEMS, 2024, : 7487 - 7496
  • [50] Improving ultrasound tongue image reconstruction from lip images using self-supervised learning and attention mechanism
    Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
    不详
    arXiv,