Neural Monocular 3D Human Motion Capture with Physical Awareness

被引:50
|
作者
Shimada, Soshi [1 ]
Golyanik, Vladislav [1 ]
Xu, Weipeng [2 ]
Perez, Patrick [3 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] Facebook Real Labs, Pittsburgh, PA USA
[3] Valeoai, Paris, France
来源
ACM TRANSACTIONS ON GRAPHICS | 2021年 / 40卷 / 04期
关键词
Monocular 3D Human Motion Capture; Physical Awareness; Global; 3D; Physionical Approach; POSE;
D O I
10.1145/3450626.3459825
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a new trainable system for physically plausible markerless 3D human motion capture, which achieves state-of-the-art results in a broad range of challenging scenarios. Unlike most neural methods for human motion capture, our approach, which we dub "physionical", is aware of physical and environmental constraints. It combines in a fully-differentiable way several key innovations, i.e., 1) a proportional-derivative controller, with gains predicted by a neural network, that reduces delays even in the presence of fast motions, 2) an explicit rigid body dynamics model and 3) a novel optimisation layer that prevents physically implausible foot-floor penetration as a hard constraint. The inputs to our system are 2D joint keypoints, which are canonicalised in a novel way so as to reduce the dependency on intrinsic camera parameters-both at train and test time. This enables more accurate global translation estimation without generalisability loss. Our model can be finetuned only with 2D annotations when the 3D annotations are not available. It produces smooth and physically-principled 3D motions in an interactive frame rate in a wide variety of challenging scenes, including newly recorded ones. Its advantages are especially noticeable on in-the-wild sequences that significantly differ from common 3D pose estimation benchmarks such as Human 3.6M and MPI-INF-3DHP. Qualitative results are provided in the supplementary video.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Joint 3D Human Motion Capture and Physical Analysis from Monocular Videos
    Zell, Petrissa
    Wandt, Bastian
    Rosenhahn, Bodo
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 17 - 26
  • [2] MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes
    Li, Zhi
    Shimada, Soshi
    Schiele, Bernt
    Theobalt, Christian
    Golyanik, Vladislav
    [J]. 2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 1 - 11
  • [3] 3D Human Motion Capture from Monocular Image Sequences
    Wandt, Bastian
    Ackermann, Hanno
    Rosenhahn, Bodo
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [4] Neural Capture of Animatable 3D Human from Monocular Video
    Te, Gusi
    Li, Xiu
    Li, Xiao
    Wang, Jinglu
    Hu, Wei
    Lu, Yan
    [J]. COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 275 - 291
  • [5] PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time
    Shimada, Soshi
    Golyanik, Vladislav
    Xu, Weipeng
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [6] Efficient 3D recovery of human motion in monocular video
    Chen, Cheng
    Xiao, Jun
    Zhuang, Yueting
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2009, 21 (08): : 1118 - 1126
  • [7] 3D Human Motion Reconstruction in Unity with Monocular Camera
    Chen, Tai-Wei
    Lin, Wei-Liang
    [J]. 2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 191 - 192
  • [8] Research on 3D Human Motion Capture Algorithm for Online Physical Education Teaching
    Li, Weiguo
    Yang, Yongli
    Zhou, Jing
    Li, Zhipeng
    [J]. IEIE Transactions on Smart Processing and Computing, 2023, 12 (02): : 97 - 106
  • [10] Dense 3D Motion Capture for Human Faces
    Furukawa, Yasutaka
    Ponce, Jean
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1674 - +