MOVIN: Real-time Motion Capture using a Single LiDAR

被引:0
|
作者
Jang, Deok-Kyeong [1 ,2 ]
Yang, Dongseok [1 ,2 ]
Jang, Deok-Yun [1 ,3 ]
Choi, Byeoli [1 ,2 ]
Jin, Taeil [2 ]
Lee, Sung-Hee [2 ]
机构
[1] MOVIN Inc, Santa Clara, CA USA
[2] Korea Adv Inst Sci & Technol KAIST, Daejeon, South Korea
[3] Gwangju Inst Sci & Technol GIST, Gwangju, South Korea
关键词
<bold>CCS Concepts</bold>; center dot <bold>Computing methodologies</bold> -> <bold>Motion capture</bold>; <bold>Motion processing</bold>; <bold>Neural networks</bold>; HUMAN POSE ESTIMATION;
D O I
10.1111/cgf.14961
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recent advancements in technology have brought forth new forms of interactive applications, such as the social metaverse, where end users interact with each other through their virtual avatars. In such applications, precise full-body tracking is essential for an immersive experience and a sense of embodiment with the virtual avatar. However, current motion capture systems are not easily accessible to end users due to their high cost, the requirement for special skills to operate them, or the discomfort associated with wearable devices. In this paper, we present MOVIN, the data-driven generative method for real-time motion capture with global tracking, using a single LiDAR sensor. Our autoregressive conditional variational autoencoder (CVAE) model learns the distribution of pose variations conditioned on the given 3D point cloud from LiDAR. As a central factor for high-accuracy motion capture, we propose a novel feature encoder to learn the correlation between the historical 3D point cloud data and global, local pose features, resulting in effective learning of the pose prior. Global pose features include root translation, rotation, and foot contacts, while local features comprise joint positions and rotations. Subsequently, a pose generator takes into account the sampled latent variable along with the features from the previous frame to generate a plausible current pose. Our framework accurately predicts the performer's 3D global information and local joint details while effectively considering temporally coherent movements across frames. We demonstrate the effectiveness of our architecture through quantitative and qualitative evaluations, comparing it against state-of-the-art methods. Additionally, we implement a real-time application to showcase our method in real-world scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Experimental Investigations into Using Motion Capture State Feedback for Real-Time Control of a Humanoid Robot
    Popescu, Mihaela
    Mronga, Dennis
    Bergonzani, Ivan
    Kumar, Shivesh
    Kirchner, Frank
    SENSORS, 2022, 22 (24)
  • [32] A specialized motion capture system for real-time analysis of mandibular movements using infrared cameras
    Daniel Antônio Furtado
    Adriano Alves Pereira
    Adriano de Oliveira Andrade
    Douglas Peres Bellomo
    Marlete Ribeiro da Silva
    BioMedical Engineering OnLine, 12
  • [33] Real-time and markerless 3D human motion capture using multiple views
    Michoud, Brice
    Guillou, Erwan
    Bouakaz, Saieda
    HUMAN MOTION - UNDERSTANDING, MODELING, CAPTURE AND ANIMATION, PROCEEDINGS, 2007, 4814 : 88 - +
  • [34] Virtual Reality-Based Gymnastics Visualization Using Real-Time Motion Capture Suit
    Artlip, Michael
    Chen, Jiangong
    Li, Bin
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 728 - 729
  • [35] Real-time marker-free motion capture system using blob feature analysis
    Park, CJ
    Kim, SE
    Kim, HS
    Lee, IH
    Real-Time Imaging IX, 2005, 5671 : 237 - 246
  • [36] Comparing Real-time Human Motion Capture System using Inertial Sensors with Microsoft Kinect
    Xiang, Chengkai
    Hsu, Hui-Huang
    Hwang, Wu-Yuin
    Ma, Jianhua
    2014 7TH INTERNATIONAL CONFERENCE ON UBI-MEDIA COMPUTING AND WORKSHOPS (UMEDIA), 2014, : 53 - 58
  • [37] Real-time marker prediction and CoR estimation in optical motion capture
    Aristidou, Andreas
    Lasenby, Joan
    VISUAL COMPUTER, 2013, 29 (01): : 7 - 26
  • [38] Multiresolution coding of motion capture data for real-time multimedia applications
    Murtaza Ali Khan
    Multimedia Tools and Applications, 2017, 76 : 16683 - 16698
  • [39] A constrained inverse kinematics technique for real-time motion capture animation
    Tang, W
    Cavazza, M
    Mountain, D
    Earnshaw, R
    VISUAL COMPUTER, 1999, 15 (7-8): : 413 - 425
  • [40] A constrained inverse kinematics technique for real-time motion capture animation
    Wen Tang
    Marc Cavazza
    Dale Mountain
    Rae Earnshaw
    The Visual Computer, 1999, 15 : 413 - 425