Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

被引:8
|
作者
Yoshiyasu, Yusuke [1 ,2 ]
Sagawa, Ryusuke [1 ,2 ]
Ayusawa, Ko [1 ,2 ]
Murai, Akihiko [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
[2] AIST, CNRS, UMI3218, JRL,RL, Tsukuba, Ibaraki, Japan
来源
关键词
Convolutional neural networks; 3D human pose; Skeleton;
D O I
10.1007/978-3-030-20870-7_30
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present Skeleton Transformer Networks (SkeletonNet), an end-to-end framework that can predict not only 3D joint positions but also 3D angular pose (bone rotations) of a human skeleton from a single color image. This in turn allows us to generate skinned mesh animations. Here, we propose a two-step regression approach. The first step regresses bone rotations in order to obtain an initial solution by considering skeleton structure. The second step performs refinement based on heatmap regressor using a 3D pose representation called cross heatmap which stacks heatmaps of xy and zy coordinates. By training the network using the proposed 3D human pose dataset that is comprised of images annotated with 3D skeletal angular poses, we showed that SkeletonNet can predict a full 3D human pose (joint positions and bone rotations) from a single image in-the-wild.
引用
收藏
页码:485 / 500
页数:16
相关论文
共 50 条
  • [41] Dense 3D Face Reconstruction from a Single RGB Image
    Mao, Jianxu
    Zhang, Yifeng
    Liu, Caiping
    Tao, Ziming
    Yi, Junfei
    Wang, Yaonan
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, CSE, 2022, : 24 - 31
  • [42] 3D hand pose estimation from a single RGB image through semantic decomposition of VAE latent space
    Xinru Guo
    Song Xu
    Xiangbo Lin
    Yi Sun
    Xiaohong Ma
    Pattern Analysis and Applications, 2022, 25 : 157 - 167
  • [43] 3D hand pose estimation from a single RGB image through semantic decomposition of VAE latent space
    Guo, Xinru
    Xu, Song
    Lin, Xiangbo
    Sun, Yi
    Ma, Xiaohong
    PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (01) : 157 - 167
  • [44] Panoptic 3D Scene Reconstruction From a Single RGB Image
    Dahnert, Manuel
    Hou, Ji
    Niessner, Matthias
    Dai, Angela
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
    Mehta, Dushyant
    Sridhar, Srinath
    Sotnychenko, Oleksandr
    Rhodin, Helge
    Shafiei, Mohammad
    Seidel, Hans-Peter
    Xu, Weipeng
    Casas, Dan
    Theobalt, Christian
    ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
  • [46] POSE-HMR: HEURISTIC TRANSFORMER WITH POSTURAL PRIOR CONSTRAINTS FOR 3D HUMAN MESH RECONSTRUCTION
    Pan, Songqi
    Liu, Sheng
    Feng, Yuan
    Zhang, Yineng
    Tian, Xiaopeng
    Yang, Jiantao
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2835 - 2839
  • [47] Learning Joint Twist Rotation for 3D Human Pose Estimation from a Single Image
    Nakatsuka, Chihiro
    Xu, Jianfeng
    Tasaka, Kazuyuki
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 379 - 386
  • [48] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
    Bogo, Federica
    Kanazawa, Angjoo
    Lassner, Christoph
    Gehler, Peter
    Romero, Javier
    Black, Michael J.
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
  • [49] 3D human pose and shape estimation with dense correspondence from a single depth image
    Wang, Kangkan
    Zhang, Guofeng
    Yang, Jian
    VISUAL COMPUTER, 2023, 39 (01): : 429 - 441
  • [50] Accurate 3D Pose Estimation From a Single Depth Image
    Ye, Mao
    Wang, Xianwang
    Yang, Ruigang
    Ren, Liu
    Pollefeys, Marc
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 731 - 738