PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

被引:106
|
作者
Shimada, Soshi [1 ]
Golyanik, Vladislav [1 ]
Xu, Weipeng [2 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] Facebook Real Labs, Pittsburgh, PA USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2020年 / 39卷 / 06期
基金
欧洲研究理事会; 欧盟地平线“2020”;
关键词
Monocular Motion Capture; Physics-Based Constraints; Real Time; Human Body; Global; 3D;
D O I
10.1145/3414685.3417877
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Marker-less 3D human motion capture from a single colour camera has seen significant progress. However, it is a very challenging and severely ill-posed problem. In consequence, even the most accurate state-of-the-art approaches have significant limitations. Purely kinematic formulations on the basis of individual joints or skeletons, and the frequent frame-wise reconstruction in state-of-the-art methods greatly limit 3D accuracy and temporal stability compared to multi-view or marker-based motion capture. Further, captured 3D poses are often physically incorrect and biomechanically implausible, or exhibit implausible environment interactions (floor penetration, foot skating, unnatural body leaning and strong shifting in depth), which is problematic for any use case in computer graphics. We, therefore, present PhysCap, the first algorithm for physically plausible, real-time and marker-less human 3D motion capture with a single colour camera at 25 fps. Our algorithm first captures 3D human poses purely kinematically. To this end, a CNN infers 2D and 3D joint positions, and subsequently, an inverse kinematics step finds space-time coherent joint angles and global 3D pose. Next, these kinematic reconstructions are used as constraints in a real-time physics-based pose optimiser that accounts for environment constraints (e.g., collision handling and floor placement), gravity, and biophysical plausibility of human postures. Our approach employs a combination of ground reaction force and residual force for plausible root control, and uses a trained neural network to detect foot contact events in images. Our method captures physically plausible and temporally stable global 3D human motion, without physically implausible postures, floor penetrations or foot skating, from video in real time and in general scenes. PhysCap achieves state-of-the-art accuracy on established pose benchmarks, and we propose new metrics to demonstrate the improved physical plausibility and temporal stability.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [42] 3D Motion and Skeleton Construction from Monocular Video
    Azmi, Nik Mohammad Wafiy
    Albakri, Ikmal Faiq
    Suaib, Norhaida Mohd
    Rahim, Mohd Shafry Mohd
    Yu, Hongchuan
    COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 75 - 84
  • [43] Monocular lie algebra approach for 3D motion estimation
    Ortegon-Aguilar, J
    Bayro-Corrochano, E
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 200 - 203
  • [44] Low-cost 3D motion capture system using passive optical markers and monocular vision
    Lee, Yeonkyung
    Yoo, Hoon
    OPTIK, 2017, 130 : 1397 - 1407
  • [45] Marker-Less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps
    Du, Yu
    Wong, Yongkang
    Liu, Yonghao
    Han, Feilin
    Gui, Yilin
    Wang, Zhen
    Kankanhalli, Mohan
    Geng, Weidong
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 20 - 36
  • [46] SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos
    Xin Chen
    Anqi Pang
    Wei Yang
    Yuexin Ma
    Lan Xu
    Jingyi Yu
    International Journal of Computer Vision, 2021, 129 : 2846 - 2864
  • [47] TravelNet: Self-supervised Physically Plausible Hand Motion Learning from Monocular Color Images
    Zhao, Zimeng
    Zhao, Xi
    Wang, Yangang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11646 - 11656
  • [48] Real-time motion tracking using 3D ultrasound
    Xu, Sheng
    Kruecker, Jochen
    Settlemier, Scott
    Wood, Bradford J.
    MEDICAL IMAGING 2007: VISUALIZATION AND IMAGE-GUIDED PROCEDURES, PTS 1 AND 2, 2007, 6509
  • [49] Development of real-time motion capture system for 3D on-line games linked with virtual character
    Kim, JH
    Ryu, YK
    Cho, HS
    MACHINE VISION AND ITS OPTOMECHATRONIC APPLICATIONS, 2004, 5603 : 243 - 252
  • [50] Invariant Trajectory Indexing for Real Time 3D Motion Recognition
    Yang, Jianyu
    Li, Y. F.
    Wang, Keyi
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, : 3440 - 3445