Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

被引:8
|
作者
Sarandi, Istvan [1 ]
Hermans, Alexander [1 ]
Leibe, Bastian [1 ]
机构
[1] Rhein Westfal TH Aachen, Aachen, Germany
关键词
D O I
10.1109/WACV56688.2023.00297
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based 3D human pose estimation performs best when trained on large amounts of labeled data, making combined learning from many datasets an important research direction. One obstacle to this endeavor are the different skeleton formats provided by different datasets, i.e., they do not label the same set of anatomical landmarks. There is little prior research on how to best supervise one model with such discrepant labels. We show that simply using separate output heads for different skeletons results in inconsistent depth estimates and insufficient information sharing across skeletons. As a remedy, we propose a novel affine-combining autoencoder (ACAE) method to perform dimensionality reduction on the number of landmarks. The discovered latent 3D points capture the redundancy among skeletons, enabling enhanced information sharing when used for consistency regularization. Our approach scales to an extreme multi-dataset regime, where we use 28 3D human pose datasets to supervise one model, which outperforms prior work on a range of benchmarks, including the challenging 3D Poses in the Wild (3DPW) dataset. Our code and models are available for research purposes.(1)
引用
收藏
页码:2955 / 2965
页数:11
相关论文
共 50 条
  • [31] Boosting Monocular 3D Human Pose Estimation With Part Aware Attention
    Xue, Youze
    Chen, Jiansheng
    Gu, Xiangming
    Ma, Huimin
    Ma, Hongbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4278 - 4291
  • [32] Learning 6-DOF Grasping Interaction via Deep Geometry-aware 3D Representations
    Yan, Xinchen
    Hsu, Jasmine
    Khansari, Mohammad
    Bai, Yunfei
    Pathak, Arkanath
    Gupta, Abhinav
    Davidson, James
    Lee, Honglak
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 3766 - 3773
  • [33] PoseGU: 3D human pose estimation with novel human pose generator and unbiased learning
    Guan, Shannan
    Lu, Haiyan
    Zhu, Linchao
    Fang, Gengfa
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [34] A Novel Skeleton-based Model with Spine for 3D Human Pose Estimation
    Li, Zhaoxu
    Liu, Sheng
    Bai, Jue
    Peng, Chenglei
    Li, Yang
    Du, Sidan
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 501 - 506
  • [35] A monocular 3D human pose estimation approach for virtual character skeleton retargeting
    Yang A.
    Liu G.
    Naeem W.
    Wu D.
    Zhou Y.
    Chen L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9563 - 9574
  • [36] Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation
    Fang, Hao-Shu
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Xiaobai
    Zhu, Song-Chun
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6821 - 6828
  • [37] Estimation of 3D human pose using prior knowledge
    Zhang, Lei
    Chen, Shu
    Zou, Beiji
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (04)
  • [38] GoPose: 3D Human Pose Estimation Using WiFi
    Ren, Yili
    Wang, Zi
    Wang, Yichao
    Tan, Sheng
    Chen, Yingying
    Yang, Jie
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2022, 6 (02):
  • [39] Absolute 3D Human Pose Estimation Using Noise-Aware Radial Distance Predictions
    Chang, Inho
    Park, Min-Gyu
    Kim, Je Woo
    Yoon, Ju Hong
    SYMMETRY-BASEL, 2023, 15 (01):
  • [40] Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition
    Chen, Tianlang
    Fang, Chen
    Shen, Xiaohui
    Zhu, Yiheng
    Chen, Zhili
    Luo, Jiebo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 198 - 209