Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

被引:8
|
作者
Yoshiyasu, Yusuke [1 ,2 ]
Sagawa, Ryusuke [1 ,2 ]
Ayusawa, Ko [1 ,2 ]
Murai, Akihiko [1 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
[2] AIST, CNRS, UMI3218, JRL,RL, Tsukuba, Ibaraki, Japan
来源
关键词
Convolutional neural networks; 3D human pose; Skeleton;
D O I
10.1007/978-3-030-20870-7_30
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present Skeleton Transformer Networks (SkeletonNet), an end-to-end framework that can predict not only 3D joint positions but also 3D angular pose (bone rotations) of a human skeleton from a single color image. This in turn allows us to generate skinned mesh animations. Here, we propose a two-step regression approach. The first step regresses bone rotations in order to obtain an initial solution by considering skeleton structure. The second step performs refinement based on heatmap regressor using a 3D pose representation called cross heatmap which stacks heatmaps of xy and zy coordinates. By training the network using the proposed 3D human pose dataset that is comprised of images annotated with 3D skeletal angular poses, we showed that SkeletonNet can predict a full 3D human pose (joint positions and bone rotations) from a single image in-the-wild.
引用
收藏
页码:485 / 500
页数:16
相关论文
共 50 条
  • [31] Learning to Estimate 3D Human Pose and Shape from a Single Color Image
    Pavlakos, Georgios
    Zhu, Luyang
    Zhou, Xiaowei
    Daniilidis, Kostas
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 459 - 468
  • [32] 3D human pose estimation from a single image via exemplar augmentation
    Yang, Jingjing
    Wan, Lili
    Xu, Wanru
    Wang, Shenghui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 371 - 379
  • [33] Deformable Mesh Transformer for 3D Human Mesh Recovery
    Yoshiyasu, Yusuke
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17006 - 17015
  • [34] Joint-wise 2D to 3D lifting for hand pose estimation from a single RGB image
    Chen, Zheng
    Sun, Yi
    APPLIED INTELLIGENCE, 2023, 53 (06) : 6421 - 6431
  • [35] Hand Pose Estimation from a Single RGB-D Image
    Kuznetsova, Alina
    Rosenhahn, Bodo
    ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 592 - 602
  • [36] Joint-wise 2D to 3D lifting for hand pose estimation from a single RGB image
    Zheng Chen
    Yi Sun
    Applied Intelligence, 2023, 53 : 6421 - 6431
  • [37] Template based Human Pose and Shape Estimation from a Single RGB-D Image
    Li, Zhongguo
    Heyden, Anders
    Oskarsson, Magnus
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 574 - 581
  • [38] Review on 3D Hand Pose Estimation Based on a RGB Image
    Xiao Y.
    Liu Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 161 - 172
  • [39] 3D Human Skeleton Estimation Based on RGB Image Sequence and Graph Convolution Network
    Lie, Wen-Nung
    Yang, Pei-Hsuan
    Vann, Veasna
    Chiang, Jui-Chiu
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [40] 3D Room Layout Estimation From a Single RGB Image
    Yan, Chenggang
    Shao, Biyao
    Zhao, Hao
    Ning, Ruixin
    Zhang, Yongdong
    Xu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 3014 - 3024