A generic diffusion-based approach for 3D human pose prediction in the wild

被引：19

作者：

Saadatnejad, Saeed ^{[1
]}

Rasekh, Ali ^{[1
]}

Mofayezi, Mohammadreza ^{[1
]}

Medghalchi, Yasamin ^{[1
]}

Rajahzadeh, Sara ^{[1
]}

Mordan, Taylor ^{[1
]}

Alahi, Alexandre ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160399

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a preprocessing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: https://github.com/vita- epfl/DePOSit.

引用

页码：8246 / 8253

页数：8

共 50 条

[1] Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation
Shan, Wenkang
Liu, Zhenhua
Zhang, Xinfeng
Wang, Zhao
Han, Kai
Wang, Shanshe
Ma, Siwei
Gao, Wen
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14715 - 14725
[2] Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser
Cai, Qingyuan
Hu, Xuecai
Hou, Saihui
Yao, Li
Huang, Yongzhen
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 882 - 890
[3] Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation
Shan, Wenkang
Zhang, Yuhuai
Zhang, Xinfeng
Wang, Shanshe
Zhou, Xilong
Ma, Siwei
Gao, Wen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10678 - 10691
[4] Diffusion-Based 3D Bioprinting Strategies
Cai, Betty
Kilian, David
Mejia, Daniel Ramos
Rios, Ricardo J.
Ali, Ashal
Heilshorn, Sarah C.
ADVANCED SCIENCE, 2024, 11 (08)
[5] A pose prediction approach based on ligand 3D shape similarity
Kumar, Ashutosh
Zhang, Kam Y. J.
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2016, 30 (06) : 457 - 469
[6] A pose prediction approach based on ligand 3D shape similarity
Ashutosh Kumar
Kam Y. J. Zhang
Journal of Computer-Aided Molecular Design, 2016, 30 : 457 - 469
[7] DDBMHT: A Diffusion-Based Double-Branch Multi-Hypothesis Transformer for 3D Human Pose Estimation in Video
Bao, Weijie
Xiang, Xuezhi
2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 35 - 39
[8] Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach
Zhou, Xingyi
Huang, Qixing
Sun, Xiao
Xue, Xiangyang
Wei, Yichen
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 398 - 407
[9] Generalizing Monocular 3D Human Pose Estimation in the Wild
Wang, Luyang
Chen, Yan
Guo, Zhenhua
Qian, Keyuan
Lin, Mude
Li, Hongsheng
Ren, Jimmy S.
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4024 - 4033
[10] SMPLy Benchmarking 3D Human Pose Estimation in the Wild
Leroy, Vincent
Weinzaepfel, Philippe
Bregier, Romain
Combaluzier, Hadrien
Rogez, Gregory
2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 301 - 310

← 1 2 3 4 5 →