Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations

被引：0

作者：

Shen, Tianyu ^{[1
]}

Li, Deqi ^{[1
]}

Wang, Fei-Yue ^{[2
,3
]}

Huang, Hua ^{[1
]}

机构：

[1] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China

[2] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China

[3] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Three-dimensional displays; Pose estimation; Feature extraction; Location awareness; Cameras; Semantics; Solid modeling; Human depth perceiving; multi-person 3d pose estimation; multi-scale representation; occlusion handling;

D O I：

10.1109/TMM.2022.3233251

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Estimating absolute 3D poses of multiple people from monocular image is challenging due to the presence of occlusions and the scale variation among different persons. Among the existing methods, the top-down paradigms are highly dependent on human detection which is prone to the influence from inter-person occlusions, while the bottom-up paradigms suffer from the difficulties in keypoint feature extraction caused by scale variation and unreliable joint grouping caused by occlusions. To address these challenges, we introduce a novel multi-person 3D pose estimation framework, aided by multi-scale feature representations and human depth perceiving. Firstly, a waterfall-based architecture is incorporated for multi-scale feature representations to achieve a more accurate estimation of occluded joints with a better detection of human shapes. Then the global and local representations are fused for handling the effects of inter-person occlusion and scale variation in depth perceiving and keypoint feature extraction. Finally, with the guidance of the fused multi-scale representations, a depth-aware model is exploited for better 2D joint grouping and 3D pose recovering. Quantitative and qualitative evaluations on benchmark datasets of MuCo-3DHP and MuPoTS-3D prove the effectiveness of our proposed method. Furthermore, we produce an occluded MuPoTS-3D dataset and the experiments on it validate the superiority of our method for overcoming the occlusions.

引用

页码：1439 / 1451

页数：13

共 50 条

[1] Multi-person 3D Pose Estimation and Tracking in Sports
Bridgeman, Lewis
Volino, Marco
Guillemaut, Jean-Yves
Hilton, Adrian
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496
[2] Multi-person Absolute 3D Human Pose Estimation with Weak Depth Supervision
Veges, Marton
Lorincz, Andras
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 : 258 - 270
[3] Multi-Person 3D Pose Estimation With Occlusion Reasoning
Chen, Xipeng
Zhang, Junzheng
Wang, Keze
Wei, Pengxu
Lin, Liang
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 878 - 889
[4] AnimePose: Multi-person 3D pose estimation and animation
Kumarapu, Laxman
Mukherjee, Prerana
[J]. PATTERN RECOGNITION LETTERS, 2021, 147 : 16 - 24
[5] Direct Multi-view Multi-person 3D Pose Estimation
Wang, Tao
Zhang, Jianfeng
Cai, Yujun
Yan, Shuicheng
Feng, Jiashi
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[6] Multi-Person Hierarchical 3D Pose Estimation in Natural Videos
Gu, Renshu
Wang, Gaoang
Jiang, Zhongyu
Hwang, Jenq-Neng
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 4245 - 4257
[7] Dynamic Graph Reasoning for Multi-person 3D Pose Estimation
Qiu, Zhongwei
Yang, Qiansheng
Wang, Jian
Fu, Dongmei
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3521 - 3529
[8] Multi-person 3D pose estimation from unlabelled data
Daniel Rodriguez-Criado
Pilar Bachiller-Burgos
George Vogiatzis
Luis J. Manso
[J]. Machine Vision and Applications, 2024, 35
[9] Multi-person 3D pose estimation from unlabelled data
Rodriguez-Criado, Daniel
Bachiller-Burgos, Pilar
Vogiatzis, George
Manso, Luis J.
[J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
[10] Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
Fabbri, Matteo
Lanzi, Fabio
Calderara, Simone
Alletto, Stefano
Cucchiara, Rita
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7202 - 7211

← 1 2 3 4 5 →