3D hand mesh reconstruction from a monocular RGB image

被引:9
|
作者
Peng, Hao [1 ]
Xian, Chuhua [1 ]
Zhang, Yunbo [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Rochester Inst Technol, Dept Ind & Syst Engn, New York, NY USA
来源
VISUAL COMPUTER | 2020年 / 36卷 / 10-12期
关键词
Image-based modeling; 3D hand mesh reconstruction; Hand dataset; Hand pose estimation;
D O I
10.1007/s00371-020-01908-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most of the existing methods for 3D hand analysis based on RGB images mainly focus on estimating hand keypoints or poses, which cannot capture geometric details of the 3D hand shape. In this work, we propose a novel method to reconstruct a 3D hand mesh from a single monocular RGB image. Different from current parameter-based or pose-based methods, our proposed method directly estimates the 3D hand mesh based on graph convolution neural network (GCN). Our network consists of two modules: the hand localization and mask generation module, and the 3D hand mesh reconstruction module. The first module, which is a VGG16-based network, is applied to localize the hand region in the input image and generate the binary mask of the hand. The second module takes the high-order features from the first and uses a GCN-based network to estimate the coordinates of each vertex of the hand mesh and reconstruct the 3D hand shape. To achieve better accuracy, a novel loss based on the differential properties of the discrete mesh is proposed. We also use professional software to create a large synthetic dataset that contains both ground truth 3D hand meshes and poses for training. To handle the real-world data, we use the CycleGAN network to transform the data domain of real-world images to that of our synthesis dataset. We demonstrate that our method can produce accurate 3D hand mesh and achieve an efficient performance for real-time applications.
引用
收藏
页码:2227 / 2239
页数:13
相关论文
共 50 条
  • [21] 3D Hand Shape and Pose Estimation from a Single RGB Image
    Ge, Liuhao
    Ren, Zhou
    Li, Yuncheng
    Xue, Zehao
    Wang, Yingying
    Cai, Jianfei
    Yuan, Junsong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834
  • [22] 3D Reconstruction of Indoor Scenes Using RGB-D Monocular Vision
    Liu, Sanmao
    Zhu, Wenqiu
    Zhang, Canqing
    Sun, Wenjing
    [J]. 2016 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS), 2016, : 1 - 7
  • [23] Physically Plausible 3D Human-Scene Reconstruction From Monocular RGB Image Using an Adversarial Learning Approach
    Biswas, Sandika
    Li, Kejie
    Banerjee, Biplab
    Chaudhuri, Subhasis
    Rezatofighi, Hamid
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6227 - 6234
  • [24] RGB-Fusion: Monocular 3D reconstruction with learned depth prediction
    Duan, ZhiMin
    Chen, YingWen
    Yu, HuJie
    Hu, BoWen
    Chen, Chen
    [J]. DISPLAYS, 2021, 70
  • [25] 3D Reconstruction of a Smooth Articulated Trajectory from a Monocular Image Sequence
    Park, Hyun Soo
    Sheikh, Yaser
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 201 - 208
  • [26] Latent Distribution-Based 3D Hand Pose Estimation From Monocular RGB Images
    Li, Moran
    Wang, Jialong
    Sang, Nong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4883 - 4894
  • [27] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
    Huang, Siyuan
    Qi, Siyuan
    Zhu, Yixin
    Xiao, Yinxue
    Xu, Yuanlu
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 194 - 211
  • [28] RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video
    Wang, Jiayi
    Mueller, Franziska
    Bernard, Florian
    Sorli, Suzanne
    Sotnychenko, Oleksandr
    Qian, Neng
    Otaduy, Miguel A.
    Casas, Dan
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [29] From D-RGB-based reconstruction toward a mesh deformation model for monocular reconstruction of isometric surfaces
    Hosseini, S. Jafar
    Araujo, Helder
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2016,
  • [30] From D-RGB-based reconstruction toward a mesh deformation model for monocular reconstruction of isometric surfaces
    S. Jafar Hosseini
    Helder Araujo
    [J]. EURASIP Journal on Image and Video Processing, 2016