Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

被引:8
|
作者
Sarandi, Istvan [1 ]
Hermans, Alexander [1 ]
Leibe, Bastian [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
关键词
D O I
10.1109/WACV56688.2023.00297
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based 3D human pose estimation performs best when trained on large amounts of labeled data, making combined learning from many datasets an important research direction. One obstacle to this endeavor are the different skeleton formats provided by different datasets, i.e., they do not label the same set of anatomical landmarks. There is little prior research on how to best supervise one model with such discrepant labels. We show that simply using separate output heads for different skeletons results in inconsistent depth estimates and insufficient information sharing across skeletons. As a remedy, we propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks. The discovered latent 3D points capture the redundancy among skeletons, enabling enhanced information sharing when used for consistency regularization. Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset. Our code and models are available for research purposes.(1)
引用
收藏
页码:2955 / 2965
页数:11
相关论文
共 50 条
  • [21] REAL-TIME 3D HEAD POSE ESTIMATION USING BOTH GEOMETRY AND LEARNING
    Raytchev, Bisser
    Kimura, Yusuke
    Yoda, Ikushi
    Sakaue, Katsuhiko
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1525 - 1528
  • [22] 3D Human Pose Estimation in the Wild by Adversarial Learning
    Yang, Wei
    Ouyang, Wanli
    Wang, Xiaolong
    Ren, Jimmy
    Li, Hongsheng
    Wang, Xiaogang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5255 - 5264
  • [23] 3D Human Pose Estimation from RGB-D Images Using Deep Learning Method
    Chun, Junchul
    Park, Seohee
    Ji, Myunggeun
    2018 INTERNATIONAL CONFERENCE ON SENSORS, SIGNAL AND IMAGE PROCESSING (SSIP 2018), 2018, : 51 - 55
  • [24] Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation
    Shengping Zhang
    Chenyang Wang
    Liqiang Nie
    Hongxun Yao
    Qingming Huang
    Qi Tian
    International Journal of Computer Vision, 2023, 131 : 3119 - 3119
  • [25] Hourglass-GCN for 3D Human Pose Estimation Using Skeleton Structure and View Correlation
    Chen, Ange
    Wu, Chengdong
    Leng, Chuanjiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (01): : 173 - 191
  • [26] 2D and 3D Human Pose Estimation and Analysis Using Deep Learning
    Yadav, Anju
    Saxena, Rahul
    Bhattacharya, Anubhav
    Pal, Vipin
    Pathak, Nitish
    ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 133 - 143
  • [27] Occlusion-Aware Networks for 3D Human Pose Estimation in Video
    Cheng, Yu
    Yang, Bo
    Wang, Bo
    Yan, Wending
    Tan, Robby T.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 723 - 732
  • [28] An Articulated Structure-aware Network for 3D Human Pose Estimation
    Tang, Zhenhua
    Zhang, Xiaoyan
    Hou, Junhui
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 48 - 63
  • [29] 3D Human Pose Estimation from Static Images Using Local Features and Discriminative Learning
    Sedai, Suman
    Flitti, Farid
    Bennamoun, Mohammed
    Huynh, Du
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2009, 5627 : 327 - 336
  • [30] View consistency aware holistic triangulation for 3D human pose estimation
    Wan, Xiaoyue
    Chen, Zhuo
    Zhao, Xu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236