Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

被引:8
|
作者
Sarandi, Istvan [1 ]
Hermans, Alexander [1 ]
Leibe, Bastian [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
关键词
D O I
10.1109/WACV56688.2023.00297
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based 3D human pose estimation performs best when trained on large amounts of labeled data, making combined learning from many datasets an important research direction. One obstacle to this endeavor are the different skeleton formats provided by different datasets, i.e., they do not label the same set of anatomical landmarks. There is little prior research on how to best supervise one model with such discrepant labels. We show that simply using separate output heads for different skeletons results in inconsistent depth estimates and insufficient information sharing across skeletons. As a remedy, we propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks. The discovered latent 3D points capture the redundancy among skeletons, enabling enhanced information sharing when used for consistency regularization. Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset. Our code and models are available for research purposes.(1)
引用
收藏
页码:2955 / 2965
页数:11
相关论文
共 50 条
  • [1] Geometry-aware 3D pose transfer using transformer autoencoder
    Liu, Shanghuan
    Gai, Shaoyan
    Da, Feipeng
    Waris, Fazal
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (6) : 1063 - 1078
  • [2] Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation
    Rhodin, Helge
    Salzmann, Mathieu
    Fua, Pascal
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 765 - 782
  • [3] Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation
    Chen, Xipeng
    Lin, Kwan-Yee
    Liu, Wentao
    Qian, Chen
    Lin, Liang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10887 - 10896
  • [4] Stacked Capsule Graph Autoencoders for geometry-aware 3D head pose estimation
    Hong, Chaoqun
    Chen, Liang
    Liang, Yuxin
    Zeng, Zhiqiang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 208
  • [5] Stacked Capsule Graph Autoencoders for geometry-aware 3D head pose estimation
    Hong, Chaoqun
    Chen, Liang
    Liang, Yuxin
    Zeng, Zhiqiang
    Computer Vision and Image Understanding, 2021, 208-209
  • [6] PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers
    Yu, Frank
    Salzmann, Mathieu
    Fua, Pascal
    Rhodin, Helge
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9060 - 9069
  • [7] Excavator 3D pose estimation using deep learning and hybrid datasets
    Assadzadeh, Amin
    Arashpour, Mehrdad
    Li, Heng
    Hosseini, Reza
    Elghaish, Faris
    Baduge, Shanaka
    ADVANCED ENGINEERING INFORMATICS, 2023, 55
  • [8] Geometry-Aware 3D Hand-Object Pose Estimation Under Occlusion via Hierarchical Feature Decoupling
    Cai, Yuting
    Pan, Huimin
    Yang, Jiayi
    Liu, Yichen
    Gao, Quanli
    Wang, Xihan
    ELECTRONICS, 2025, 14 (05):
  • [9] 3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning
    Tian, Mi
    Nie, Qiong
    Shen, Hao
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4211 - 4217
  • [10] 3D Vehicle Pose Estimation from an Image Using Geometry
    Stojanovic, Nikola
    Pantic, Vasilije
    Damjanovic, Vladan
    Vukmirovic, Srdan
    2022 21ST INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2022,