Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

被引：8

作者：

Yoshiyasu, Yusuke ^{[1
,2
]}

Sagawa, Ryusuke ^{[1
,2
]}

Ayusawa, Ko ^{[1
,2
]}

Murai, Akihiko ^{[1
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan

[2] AIST, CNRS, UMI3218, JRL,RL, Tsukuba, Ibaraki, Japan

来源：

COMPUTER VISION - ACCV 2018, PT IV | 2019年 / 11364卷

关键词：

Convolutional neural networks; 3D human pose; Skeleton;

D O I：

10.1007/978-3-030-20870-7_30

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we present Skeleton Transformer Networks (SkeletonNet), an end-to-end framework that can predict not only 3D joint positions but also 3D angular pose (bone rotations) of a human skeleton from a single color image. This in turn allows us to generate skinned mesh animations. Here, we propose a two-step regression approach. The first step regresses bone rotations in order to obtain an initial solution by considering skeleton structure. The second step performs refinement based on heatmap regressor using a 3D pose representation called cross heatmap which stacks heatmaps of xy and zy coordinates. By training the network using the proposed 3D human pose dataset that is comprised of images annotated with 3D skeletal angular poses, we showed that SkeletonNet can predict a full 3D human pose (joint positions and bone rotations) from a single image in-the-wild.

引用

页码：485 / 500

页数：16

共 50 条

[41] Dense 3D Face Reconstruction from a Single RGB Image
Mao, Jianxu
Zhang, Yifeng
Liu, Caiping
Tao, Ziming
Yi, Junfei
Wang, Yaonan
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, CSE, 2022, : 24 - 31
[42] 3D hand pose estimation from a single RGB image through semantic decomposition of VAE latent space
Xinru Guo
Song Xu
Xiangbo Lin
Yi Sun
Xiaohong Ma
Pattern Analysis and Applications, 2022, 25 : 157 - 167
[43] 3D hand pose estimation from a single RGB image through semantic decomposition of VAE latent space
Guo, Xinru
Xu, Song
Lin, Xiangbo
Sun, Yi
Ma, Xiaohong
PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (01) : 157 - 167
[44] Panoptic 3D Scene Reconstruction From a Single RGB Image
Dahnert, Manuel
Hou, Ji
Niessner, Matthias
Dai, Angela
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[45] VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera
Mehta, Dushyant
Sridhar, Srinath
Sotnychenko, Oleksandr
Rhodin, Helge
Shafiei, Mohammad
Seidel, Hans-Peter
Xu, Weipeng
Casas, Dan
Theobalt, Christian
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):
[46] POSE-HMR: HEURISTIC TRANSFORMER WITH POSTURAL PRIOR CONSTRAINTS FOR 3D HUMAN MESH RECONSTRUCTION
Pan, Songqi
Liu, Sheng
Feng, Yuan
Zhang, Yineng
Tian, Xiaopeng
Yang, Jiantao
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2835 - 2839
[47] Learning Joint Twist Rotation for 3D Human Pose Estimation from a Single Image
Nakatsuka, Chihiro
Xu, Jianfeng
Tasaka, Kazuyuki
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 379 - 386
[48] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
Bogo, Federica
Kanazawa, Angjoo
Lassner, Christoph
Gehler, Peter
Romero, Javier
Black, Michael J.
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
[49] 3D human pose and shape estimation with dense correspondence from a single depth image
Wang, Kangkan
Zhang, Guofeng
Yang, Jian
VISUAL COMPUTER, 2023, 39 (01): : 429 - 441
[50] Accurate 3D Pose Estimation From a Single Depth Image
Ye, Mao
Wang, Xianwang
Yang, Ruigang
Ren, Liu
Pollefeys, Marc
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 731 - 738

← 1 2 3 4 5 →