Skeleton Cluster Tracking for robust multi-view multi-person 3D human pose estimation

被引:1
|
作者
Niu, Zehai [1 ]
Lu, Ke [1 ,2 ]
Xue, Jian [1 ]
Wang, Jinbao [3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Vanke Cloud City Phase I Bldg 8,Xili St, Shenzhen 518055, Guangdong, Peoples R China
[3] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[4] Guangdong Prov Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
关键词
3D human pose estimation; Motion capture; Deep learning;
D O I
10.1016/j.cviu.2024.104059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multi -view 3D human pose estimation task relies on 2D human pose estimation for each view; however, severe occlusion, truncation, and human interaction lead to incorrect 2D human pose estimation for some views. The traditional "Matching-Lifting-Tracking"paradigm amplifies the incorrect 2D human pose into an incorrect 3D human pose, which significantly challenges the robustness of multi -view 3D human pose estimation. In this paper, we propose a novel method that tackles the inherent difficulties of the traditional paradigm. This method is rooted in the newly devised "Skeleton Pooling -Clustering -Tracking (SPCT)"paradigm. It initiates a 2D human pose estimation for each perspective. Then a symmetrical dilated network is created for skeleton pool estimation. Upon clustering the skeleton pool, we introduce and implement an innovative tracking method that is explicitly designed for the SPCT paradigm. The tracking method refines and filters the skeleton clusters, thereby enhancing the robustness of the multi -person 3D human pose estimation results. By coupling the skeleton pool with the tracking refinement process, our method obtains high -quality multi -person 3D human pose estimation results despite severe occlusions that produce erroneous 2D and 3D estimates. By employing the proposed SPCT paradigm and a computationally efficient network architecture, our method outperformed existing approaches regarding robustness on the Shelf, 4D Association, and CMU Panoptic datasets, and could be applied in practical scenarios such as markerless motion capture and animation production.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Direct Multi-view Multi-person 3D Pose Estimation
    Wang, Tao
    Zhang, Jianfeng
    Cai, Yujun
    Yan, Shuicheng
    Feng, Jiashi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
    Lin, Jiahao
    Lee, Gim Hee
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11881 - 11890
  • [3] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Yuxing Chen
    Renshu Gu
    Ouhan Huang
    Gangyong Jia
    [J]. Applied Intelligence, 2023, 53 : 26568 - 26579
  • [4] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Chen, Yuxing
    Gu, Renshu
    Huang, Ouhan
    Jia, Gangyong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (22) : 26568 - 26579
  • [5] RF-based Multi-view Pose Machine for Multi-Person 3D Pose Estimation
    Xie, Chunyang
    Zhang, Dongheng
    Wu, Zhi
    Yu, Cong
    Hu, Yang
    Sun, Qibin
    Chen, Yan
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2669 - 2674
  • [6] Unsupervised Multi-view Multi-person 3D Pose Estimation Using Reprojection Error
    de Franca Silva, Diogenes Wallis
    Do Monte Lima, Joao Paulo Silva
    Macedo, David
    Zanchettin, Cleber
    Thomas, Diego Gabriel Francis
    Uchiyama, Hideaki
    Teichrieb, Veronica
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 482 - 494
  • [7] Multi-person 3D Pose Estimation and Tracking in Sports
    Bridgeman, Lewis
    Volino, Marco
    Guillemaut, Jean-Yves
    Hilton, Adrian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2487 - 2496
  • [8] VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild
    Zhang, Yifu
    Wang, Chunyu
    Wang, Xinggang
    Liu, Wenyu
    Zeng, Wenjun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2613 - 2626
  • [10] Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
    Wu, Size
    Jin, Sheng
    Liu, Wentao
    Bai, Lei
    Qian, Chen
    Liu, Dong
    Ouyang, Wanli
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11128 - 11137