Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

被引：8

作者：

Yoshiyasu, Yusuke ^{[1
,2
]}

Sagawa, Ryusuke ^{[1
,2
]}

Ayusawa, Ko ^{[1
,2
]}

Murai, Akihiko ^{[1
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan

[2] AIST, CNRS, UMI3218, JRL,RL, Tsukuba, Ibaraki, Japan

来源：

COMPUTER VISION - ACCV 2018, PT IV | 2019年 / 11364卷

关键词：

Convolutional neural networks; 3D human pose; Skeleton;

D O I：

10.1007/978-3-030-20870-7_30

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we present Skeleton Transformer Networks (SkeletonNet), an end-to-end framework that can predict not only 3D joint positions but also 3D angular pose (bone rotations) of a human skeleton from a single color image. This in turn allows us to generate skinned mesh animations. Here, we propose a two-step regression approach. The first step regresses bone rotations in order to obtain an initial solution by considering skeleton structure. The second step performs refinement based on heatmap regressor using a 3D pose representation called cross heatmap which stacks heatmaps of xy and zy coordinates. By training the network using the proposed 3D human pose dataset that is comprised of images annotated with 3D skeletal angular poses, we showed that SkeletonNet can predict a full 3D human pose (joint positions and bone rotations) from a single image in-the-wild.

引用

页码：485 / 500

页数：16

共 50 条

[31] Learning to Estimate 3D Human Pose and Shape from a Single Color Image
Pavlakos, Georgios
Zhu, Luyang
Zhou, Xiaowei
Daniilidis, Kostas
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 459 - 468
[32] 3D human pose estimation from a single image via exemplar augmentation
Yang, Jingjing
Wan, Lili
Xu, Wanru
Wang, Shenghui
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 371 - 379
[33] Deformable Mesh Transformer for 3D Human Mesh Recovery
Yoshiyasu, Yusuke
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17006 - 17015
[34] Joint-wise 2D to 3D lifting for hand pose estimation from a single RGB image
Chen, Zheng
Sun, Yi
APPLIED INTELLIGENCE, 2023, 53 (06) : 6421 - 6431
[35] Hand Pose Estimation from a Single RGB-D Image
Kuznetsova, Alina
Rosenhahn, Bodo
ADVANCES IN VISUAL COMPUTING, PT II, 2013, 8034 : 592 - 602
[36] Joint-wise 2D to 3D lifting for hand pose estimation from a single RGB image
Zheng Chen
Yi Sun
Applied Intelligence, 2023, 53 : 6421 - 6431
[37] Template based Human Pose and Shape Estimation from a Single RGB-D Image
Li, Zhongguo
Heyden, Anders
Oskarsson, Magnus
ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 574 - 581
[38] Review on 3D Hand Pose Estimation Based on a RGB Image
Xiao Y.
Liu Y.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (02): : 161 - 172
[39] 3D Human Skeleton Estimation Based on RGB Image Sequence and Graph Convolution Network
Lie, Wen-Nung
Yang, Pei-Hsuan
Vann, Veasna
Chiang, Jui-Chiu
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[40] 3D Room Layout Estimation From a Single RGB Image
Yan, Chenggang
Shao, Biyao
Zhao, Hao
Ning, Ruixin
Zhang, Yongdong
Xu, Feng
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 3014 - 3024

← 1 2 3 4 5 →