A generic diffusion-based approach for 3D human pose prediction in the wild

被引:5
|
作者
Saadatnejad, Saeed [1 ]
Rasekh, Ali [1 ]
Mofayezi, Mohammadreza [1 ]
Medghalchi, Yasamin [1 ]
Rajahzadeh, Sara [1 ]
Mordan, Taylor [1 ]
Alahi, Alexandre [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
10.1109/ICRA48891.2023.10160399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.
引用
收藏
页码:8246 / 8253
页数:8
相关论文
共 50 条
  • [21] Adapted human pose: monocular 3D human pose estimation with zero real 3D pose data
    Shuangjun Liu
    Naveen Sehgal
    Sarah Ostadabbas
    Applied Intelligence, 2022, 52 : 14491 - 14506
  • [22] Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision
    Wang, Jian
    Liu, Lingjie
    Xu, Weipeng
    Sarkar, Kripasindhu
    Luvizon, Diogo
    Theobalt, Christian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13147 - 13156
  • [23] RFID-based 3D human pose tracking: A subject generalization approach
    Yang, Chao
    Wang, Xuyu
    Mao, Shiwen
    DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (03) : 278 - 288
  • [24] A Bayesian Part-based Approach to 3D Human Pose and Camera Estimation
    Brau, Ernesto
    Jiang, Hao
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1762 - 1767
  • [25] RFID-based 3D human pose tracking:A subject generalization approach
    Chao Yang
    Xuyu Wang
    Shiwen Mao
    Digital Communications and Networks, 2022, 8 (03) : 278 - 288
  • [26] EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild
    Kaufmann, Manuel
    Song, Jie
    Guo, Chen
    Shen, Kaiyue
    Jiang, Tianjian
    Tang, Chengcheng
    Zarate, Juan Jose
    Hilliges, Otmar
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14586 - 14597
  • [27] 3D Pose Estimation and 3D Model Retrieval for Objects in the Wild
    Grabner, Alexander
    Roth, Peter M.
    Lepetit, Vincent
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3022 - 3031
  • [28] An approach to 3D pose determination
    Ezquerra, N
    Mullick, R
    ACM TRANSACTIONS ON GRAPHICS, 1996, 15 (02): : 99 - 120
  • [29] Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation
    Zhou, Jieming
    Zhang, Tong
    Hayder, Zeeshan
    Petersson, Lars
    Harandi, Mehrtash
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2084 - 2094
  • [30] 3D generic object categorization, localization and pose estimation
    Savarese, Silvio
    Fei-Fei, Li
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1245 - 1252