HEMlets PoSh: Learning Part-Centric Heatmap Triplets for 3D Human Pose and Shape Estimation

被引:13
|
作者
Zhou, Kun [1 ]
Han, Xiaoguang [2 ]
Jiang, Nianjuan [1 ]
Jia, Kui [3 ,4 ,5 ]
Lu, Jiangbo [6 ,7 ]
机构
[1] SmartMore Corp Ltd, Shenzhen, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong, Shenzhen Inst Big Data, Shenzhen, Peoples R China
[3] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Peoples R China
[4] Pazhou Lab, Guangzhou 510335, Peoples R China
[5] Peng Cheng Lab, Shenzhen 518005, Peoples R China
[6] SmartMore Corp Ltd, Shenzhen, Peoples R China
[7] South China Univ Technol, Guangzhou 510641, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Two dimensional displays; Heating systems; Pose estimation; Task analysis; Training; Shape; 3D human pose estimation; deep learning; heatmaps; human body mesh recovery;
D O I
10.1109/TPAMI.2021.3051173
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating 3D human pose from a single image is a challenging task. This work attempts to address the uncertainty of lifting the detected 2D joints to the 3D space by introducing an intermediate state - Part-Centric Heatmap Triplets (HEMlets), which shortens the gap between the 2D observation and the 3D interpretation. The HEMlets utilize three joint-heatmaps to represent the relative depth information of the end-joints for each skeletal body part. In our approach, a Convolutional Network (ConvNet) is first trained to predict HEMlets from the input image, followed by a volumetric joint-heatmap regression. We leverage on the integral operation to extract the joint locations from the volumetric heatmaps, guaranteeing end-to-end learning. Despite the simplicity of the network design, the quantitative comparisons show a significant performance improvement over the best-of-grade methods (e.g., 20 percent on Human3.6M). The proposed method naturally supports training with "in-the-wild" images, where only weakly-annotated relative depth information of skeletal joints is available. This further improves the generalization ability of our model, as validated by qualitative comparisons on outdoor images. Leveraging the strength of the HEMlets pose estimation, we further design and append a shallow yet effective network module to regress the SMPL parameters of the body pose and shape. We term the entire HEMlets-based human pose and shape recovery pipeline HEMlets PoSh. Extensive quantitative and qualitative experiments on the existing human body recovery benchmarks justify the state-of-the-art results obtained with our HEMlets PoSh approach.
引用
收藏
页码:3000 / 3014
页数:15
相关论文
共 50 条
  • [1] HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation
    Zhou, Kun
    Han, Xiaoguang
    Jiang, Nianjuan
    Jia, Kui
    Lu, Jiangbo
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2344 - 2353
  • [2] ADVERSARIAL LEARNING ENHANCEMENT FOR 3D HUMAN POSE AND SHAPE ESTIMATION
    Sun, Yidian
    Zhang, Jiwei
    Wang, Wendong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3743 - 3747
  • [3] MH Pose: 3D Human Pose Estimation based on High-quality Heatmap
    Zhou, Huifen
    Hong, Chaoqun
    Han, Yong
    Huang, Pengcheng
    Zhuang, Yanhui
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3215 - 3222
  • [4] LEAPSE: Learning Environment Affordances for 3D Human Pose and Shape Estimation
    Tian, Fangzheng
    Kim, Sungchan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3285 - 3300
  • [5] Downsizing Heatmap Resolution for real-time 3D Human Pose Estimation
    Kong, Dae-hyeon
    Kang, Suk-ju
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [6] 3D Human Pose Estimation With Adversarial Learning
    Meng, Wenming
    Hu, Tao
    Shuai, Li
    2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99
  • [7] ADAPTIVE MULTI-DOMAIN LEARNING FOR OUTDOOR 3D HUMAN POSE AND SHAPE ESTIMATION
    Gui, Zhaoyang
    Zhang, Shanshan
    Wang, Kangkan
    Yang, Jian
    Yuen, Pong Chi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2165 - 2169
  • [8] Action-conditioned contrastive learning for 3D human pose and shape estimation in videos
    Song, Inpyo
    Ryu, Moonwook
    Lee, Jangwon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [9] Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation
    Chun, Sungho
    Park, Sungbum
    Chang, Ju Yong
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2849 - 2858
  • [10] HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation
    Li, Wenhao
    Liu, Mengyuan
    Liu, Hong
    Ren, Bin
    Li, Xia
    You, Yingxuan
    Sebe, Nicu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 235 - 246