A generic diffusion-based approach for 3D human pose prediction in the wild

被引:5
|
作者
Saadatnejad, Saeed [1 ]
Rasekh, Ali [1 ]
Mofayezi, Mohammadreza [1 ]
Medghalchi, Yasamin [1 ]
Rajahzadeh, Sara [1 ]
Mordan, Taylor [1 ]
Alahi, Alexandre [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
10.1109/ICRA48891.2023.10160399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.
引用
收藏
页码:8246 / 8253
页数:8
相关论文
共 50 条
  • [1] Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
    Shan, Wenkang
    Liu, Zhenhua
    Zhang, Xinfeng
    Wang, Zhao
    Han, Kai
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14715 - 14725
  • [2] Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
    Cai, Qingyuan
    Hu, Xuecai
    Hou, Saihui
    Yao, Li
    Huang, Yongzhen
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 882 - 890
  • [3] Diffusion-Based 3D Bioprinting Strategies
    Cai, Betty
    Kilian, David
    Mejia, Daniel Ramos
    Rios, Ricardo J.
    Ali, Ashal
    Heilshorn, Sarah C.
    [J]. ADVANCED SCIENCE, 2024, 11 (08)
  • [4] A pose prediction approach based on ligand 3D shape similarity
    Kumar, Ashutosh
    Zhang, Kam Y. J.
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (06) : 457 - 469
  • [5] A pose prediction approach based on ligand 3D shape similarity
    Ashutosh Kumar
    Kam Y. J. Zhang
    [J]. Journal of Computer-Aided Molecular Design, 2016, 30 : 457 - 469
  • [6] Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
    Zhou, Xingyi
    Huang, Qixing
    Sun, Xiao
    Xue, Xiangyang
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 398 - 407
  • [7] Generalizing Monocular 3D Human Pose Estimation in the Wild
    Wang, Luyang
    Chen, Yan
    Guo, Zhenhua
    Qian, Keyuan
    Lin, Mude
    Li, Hongsheng
    Ren, Jimmy S.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033
  • [8] 3D Human Pose Estimation in the Wild by Adversarial Learning
    Yang, Wei
    Ouyang, Wanli
    Wang, Xiaolong
    Ren, Jimmy
    Li, Hongsheng
    Wang, Xiaogang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5255 - 5264
  • [9] SMPLy Benchmarking 3D Human Pose Estimation in the Wild
    Leroy, Vincent
    Weinzaepfel, Philippe
    Bregier, Romain
    Combaluzier, Hadrien
    Rogez, Gregory
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 301 - 310
  • [10] Diffusion-based Generation, Optimization, and Planning in 3D Scenes
    Huang, Siyuan
    Wang, Zan
    Li, Puhao
    Jia, Baoxiong
    Liu, Tengyu
    Zhu, Yixin
    Liang, Wei
    Zhu, Song-Chun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16750 - 16761