A generic diffusion-based approach for 3D human pose prediction in the wild

被引:19
|
作者
Saadatnejad, Saeed [1 ]
Rasekh, Ali [1 ]
Mofayezi, Mohammadreza [1 ]
Medghalchi, Yasamin [1 ]
Rajahzadeh, Sara [1 ]
Mordan, Taylor [1 ]
Alahi, Alexandre [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
10.1109/ICRA48891.2023.10160399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.
引用
收藏
页码:8246 / 8253
页数:8
相关论文
共 50 条
  • [1] Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
    Shan, Wenkang
    Liu, Zhenhua
    Zhang, Xinfeng
    Wang, Zhao
    Han, Kai
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14715 - 14725
  • [2] Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
    Cai, Qingyuan
    Hu, Xuecai
    Hou, Saihui
    Yao, Li
    Huang, Yongzhen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 882 - 890
  • [3] Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation
    Shan, Wenkang
    Zhang, Yuhuai
    Zhang, Xinfeng
    Wang, Shanshe
    Zhou, Xilong
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10678 - 10691
  • [4] Diffusion-Based 3D Bioprinting Strategies
    Cai, Betty
    Kilian, David
    Mejia, Daniel Ramos
    Rios, Ricardo J.
    Ali, Ashal
    Heilshorn, Sarah C.
    ADVANCED SCIENCE, 2024, 11 (08)
  • [5] A pose prediction approach based on ligand 3D shape similarity
    Kumar, Ashutosh
    Zhang, Kam Y. J.
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (06) : 457 - 469
  • [6] A pose prediction approach based on ligand 3D shape similarity
    Ashutosh Kumar
    Kam Y. J. Zhang
    Journal of Computer-Aided Molecular Design, 2016, 30 : 457 - 469
  • [7] DDBMHT: A Diffusion-Based Double-Branch Multi-Hypothesis Transformer for 3D Human Pose Estimation in Video
    Bao, Weijie
    Xiang, Xuezhi
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 35 - 39
  • [8] Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
    Zhou, Xingyi
    Huang, Qixing
    Sun, Xiao
    Xue, Xiangyang
    Wei, Yichen
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 398 - 407
  • [9] Generalizing Monocular 3D Human Pose Estimation in the Wild
    Wang, Luyang
    Chen, Yan
    Guo, Zhenhua
    Qian, Keyuan
    Lin, Mude
    Li, Hongsheng
    Ren, Jimmy S.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033
  • [10] SMPLy Benchmarking 3D Human Pose Estimation in the Wild
    Leroy, Vincent
    Weinzaepfel, Philippe
    Bregier, Romain
    Combaluzier, Hadrien
    Rogez, Gregory
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 301 - 310