Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video

被引:241
|
作者
Zhou, Xiaowei [1 ]
Zhu, Menglong [1 ]
Leonardos, Spyridon [1 ]
Derpanis, Konstantinos G. [2 ]
Daniilidis, Kostas [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
[2] Ryerson Univ, Toronto, ON, Canada
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
RECONSTRUCTION;
D O I
10.1109/CVPR.2016.537
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the challenge of 3D full-body human pose estimation from a monocular image sequence. Here, two cases are considered: (i) the image locations of the human joints are provided and (ii) the image locations of joints are unknown. In the former case, a novel approach is introduced that integrates a sparsity-driven 3D geometric prior and temporal smoothness. In the latter case, the former case is extended by treating the image locations of the joints as latent variables to take into account considerable uncertainties in 2D joint locations. A deep fully convolutional network is trained to predict the uncertainty maps of the 2D joint locations. The 3D pose estimates are realized via an Expectation-Maximization algorithm over the entire sequence, where it is shown that the 2D joint location uncertainties can be conveniently marginalized out during inference. Empirical evaluation on the Human3.6M dataset shows that the proposed approaches achieve greater 3D pose estimation accuracy over state-of-the-art baselines. Further, the proposed approach outperforms a publicly available 2D pose estimation baseline on the challenging PennAction dataset.
引用
收藏
页码:4966 / 4975
页数:10
相关论文
共 50 条
  • [1] Uncertainty-Aware 3D Human Pose Estimation from Monocular Video
    Zhang, Jinlu
    Chen, Yujin
    Tu, Zhigang
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5102 - 5113
  • [2] A survey on monocular 3D human pose estimation
    Ji, Xiaopeng
    Fang, Qi
    Dong, Junting
    Shuai, Qing
    Jiang, Wen
    Zhou, Xiaowei
    [J]. Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
  • [3] MONOCULAR 3D HUMAN POSE ESTIMATION BY CLASSIFICATION
    Greif, Thomas
    Lienhart, Rainer
    Sengupta, Debabrata
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [4] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Liu, Shuangjun
    Sehgal, Naveen
    Ostadabbas, Sarah
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 14491 - 14506
  • [5] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    [J]. Applied Intelligence, 2022, 52 : 14491 - 14506
  • [6] Model-Based 3D Hand Pose Estimation from Monocular Video
    de La Gorce, Martin
    Fleet, David J.
    Paragios, Nikos
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) : 1793 - 1805
  • [7] Generalizing Monocular 3D Human Pose Estimation in the Wild
    Wang, Luyang
    Chen, Yan
    Guo, Zhenhua
    Qian, Keyuan
    Lin, Mude
    Li, Hongsheng
    Ren, Jimmy S.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033
  • [8] HPOF: 3D Human Pose Recovery from Monocular Video with Optical Flow
    Ji, Bin
    Yang, Chen
    Yao, Shunyu
    Pan, Ye
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 144 - 154
  • [9] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Wei, Wen-Li
    Lin, Jen-Chun
    Liu, Tyng-Luh
    Liao, Hong-Yuan Mark
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13201 - 13210
  • [10] Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video
    Institute of Information Science, Academia Sinica, Taiwan
    不详
    [J]. Proc IEEE Comput Soc Conf Comput Vision Pattern Recognit, (13201-13210):