Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

被引:8
|
作者
Yoshiyasu, Yusuke [1 ,2 ]
Sagawa, Ryusuke [1 ,2 ]
Ayusawa, Ko [1 ,2 ]
Murai, Akihiko [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
[2] AIST, CNRS, UMI3218, JRL,RL, Tsukuba, Ibaraki, Japan
来源
关键词
Convolutional neural networks; 3D human pose; Skeleton;
D O I
10.1007/978-3-030-20870-7_30
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present Skeleton Transformer Networks (SkeletonNet), an end-to-end framework that can predict not only 3D joint positions but also 3D angular pose (bone rotations) of a human skeleton from a single color image. This in turn allows us to generate skinned mesh animations. Here, we propose a two-step regression approach. The first step regresses bone rotations in order to obtain an initial solution by considering skeleton structure. The second step performs refinement based on heatmap regressor using a 3D pose representation called cross heatmap which stacks heatmaps of xy and zy coordinates. By training the network using the proposed 3D human pose dataset that is comprised of images annotated with 3D skeletal angular poses, we showed that SkeletonNet can predict a full 3D human pose (joint positions and bone rotations) from a single image in-the-wild.
引用
收藏
页码:485 / 500
页数:16
相关论文
共 50 条
  • [1] 3D Hand Shape and Pose Estimation from a Single RGB Image
    Ge, Liuhao
    Ren, Zhou
    Li, Yuncheng
    Xue, Zehao
    Wang, Yingying
    Cai, Jianfei
    Yuan, Junsong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10825 - 10834
  • [2] Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network
    Elbasiony, Reda
    Gomaa, Walid
    Ogata, Tetsuya
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 310 - 320
  • [3] Accurate 3D hand mesh recovery from a single RGB image
    Pemasiri A.
    Nguyen K.
    Sridharan S.
    Fookes C.
    Scientific Reports, 12 (1)
  • [4] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [5] Reweighted sparse representation with residual compensation for 3D human pose estimation from a single RGB image
    Jiang, Mengxi
    Yu, Zhuliang
    Zhang, Yan
    Wang, Qicong
    Li, Cuihua
    Lei, Yunqi
    NEUROCOMPUTING, 2019, 358 : 332 - 343
  • [6] 3D Human Skeleton Estimation from Monocular Single RGB Image based on Multiple Virtual-View Skeleton Generation
    Lie, Wen-Nung
    Vann, Veasna
    Aing, Lee
    Chiang, Jui-Chiu
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [7] A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image
    Jiang, Changlong
    Xiao, Yang
    Wu, Cunlin
    Zhang, Mingyang
    Zheng, Jinghong
    Cao, Zhiguo
    Zhou, Joey Tianyi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8846 - 8855
  • [8] Creating a 3D Mesh in A-pose from a Single Image for Character Rigging
    Lee, Seunghwan
    Liu, C. Karen
    ACM SIGGRAPH / EUROGRAPHICS SYMPOSIUM OF COMPUTER ANIMATION 2024, 2024,
  • [9] A hybrid network for estimating 3D interacting hand pose from a single RGB image
    Bao, Wenxia
    Gao, Qiuyue
    Yang, Xianjun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3801 - 3814
  • [10] A hybrid network for estimating 3D interacting hand pose from a single RGB image
    Wenxia Bao
    Qiuyue Gao
    Xianjun Yang
    Signal, Image and Video Processing, 2024, 18 : 3801 - 3814