HEMlets PoSh: Learning Part-Centric Heatmap Triplets for 3D Human Pose and Shape Estimation

被引：13

作者：

Zhou, Kun ^{[1
]}

Han, Xiaoguang ^{[2
]}

Jiang, Nianjuan ^{[1
]}

Jia, Kui ^{[3
,4
,5
]}

Lu, Jiangbo ^{[6
,7
]}

机构：

[1] SmartMore Corp Ltd, Shenzhen, Guangdong, Peoples R China

[2] Chinese Univ Hong Kong, Shenzhen Inst Big Data, Shenzhen, Peoples R China

[3] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Peoples R China

[4] Pazhou Lab, Guangzhou 510335, Peoples R China

[5] Peng Cheng Lab, Shenzhen 518005, Peoples R China

[6] SmartMore Corp Ltd, Shenzhen, Peoples R China

[7] South China Univ Technol, Guangzhou 510641, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Two dimensional displays; Heating systems; Pose estimation; Task analysis; Training; Shape; 3D human pose estimation; deep learning; heatmaps; human body mesh recovery;

D O I：

10.1109/TPAMI.2021.3051173

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Estimating 3D human pose from a single image is a challenging task. This work attempts to address the uncertainty of lifting the detected 2D joints to the 3D space by introducing an intermediate state - Part-Centric Heatmap Triplets (HEMlets), which shortens the gap between the 2D observation and the 3D interpretation. The HEMlets utilize three joint-heatmaps to represent the relative depth information of the end-joints for each skeletal body part. In our approach, a Convolutional Network (ConvNet) is first trained to predict HEMlets from the input image, followed by a volumetric joint-heatmap regression. We leverage on the integral operation to extract the joint locations from the volumetric heatmaps, guaranteeing end-to-end learning. Despite the simplicity of the network design, the quantitative comparisons show a significant performance improvement over the best-of-grade methods (e.g., 20 percent on Human3.6M). The proposed method naturally supports training with "in-the-wild" images, where only weakly-annotated relative depth information of skeletal joints is available. This further improves the generalization ability of our model, as validated by qualitative comparisons on outdoor images. Leveraging the strength of the HEMlets pose estimation, we further design and append a shallow yet effective network module to regress the SMPL parameters of the body pose and shape. We term the entire HEMlets-based human pose and shape recovery pipeline HEMlets PoSh. Extensive quantitative and qualitative experiments on the existing human body recovery benchmarks justify the state-of-the-art results obtained with our HEMlets PoSh approach.

引用

页码：3000 / 3014

页数：15

共 50 条

[1] HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation
Zhou, Kun
Han, Xiaoguang
Jiang, Nianjuan
Jia, Kui
Lu, Jiangbo
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2344 - 2353
[2] ADVERSARIAL LEARNING ENHANCEMENT FOR 3D HUMAN POSE AND SHAPE ESTIMATION
Sun, Yidian
Zhang, Jiwei
Wang, Wendong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3743 - 3747
[3] MH Pose: 3D Human Pose Estimation based on High-quality Heatmap
Zhou, Huifen
Hong, Chaoqun
Han, Yong
Huang, Pengcheng
Zhuang, Yanhui
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3215 - 3222
[4] LEAPSE: Learning Environment Affordances for 3D Human Pose and Shape Estimation
Tian, Fangzheng
Kim, Sungchan
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3285 - 3300
[5] Downsizing Heatmap Resolution for real-time 3D Human Pose Estimation
Kong, Dae-hyeon
Kang, Suk-ju
2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
[6] 3D Human Pose Estimation With Adversarial Learning
Meng, Wenming
Hu, Tao
Shuai, Li
2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99
[7] ADAPTIVE MULTI-DOMAIN LEARNING FOR OUTDOOR 3D HUMAN POSE AND SHAPE ESTIMATION
Gui, Zhaoyang
Zhang, Shanshan
Wang, Kangkan
Yang, Jian
Yuen, Pong Chi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2165 - 2169
[8] Action-conditioned contrastive learning for 3D human pose and shape estimation in videos
Song, Inpyo
Ryu, Moonwook
Lee, Jangwon
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
[9] Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation
Chun, Sungho
Park, Sungbum
Chang, Ju Yong
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2849 - 2858
[10] HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation
Li, Wenhao
Liu, Mengyuan
Liu, Hong
Ren, Bin
Li, Xia
You, Yingxuan
Sebe, Nicu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 235 - 246

← 1 2 3 4 5 →