Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search

被引：2

作者：

Chen, Zerui ^{[1
,2
]}

Huang, Yan ^{[1
]}

Yu, Hongyuan ^{[1
,2
]}

Wang, Liang ^{[1
,2
,3
,4
]}

机构：

[1] CASIA, Ctr Res Intelligent Percept & Comp, NLPR, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[3] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China

[4] Chinese Acad Sci, Artificial Intelligence Res, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2022年 / 130卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Monocular 3D human pose estimation; Heterogeneous human body parts; Neural architecture search;

D O I：

10.1007/s11263-021-01525-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Even though most existing monocular 3D human pose estimation methods achieve very competitive performance, they are limited in estimating heterogeneous human body parts with the same decoder architecture. In this work, we present an approach to build a part-aware 3D human pose estimator to better deal with these heterogeneous human body parts. Our proposed method consists of two learning stages: (1) searching suitable decoder architectures for specific parts and (2) training the part-aware 3D human pose estimator built with these optimized neural architectures. Consequently, our searched model is very efficient and compact and can automatically select a suitable decoder architecture to estimate each human body part. In comparison with previous state-of-the-art models built with ResNet-50 network, our method can achieve better performance and reduce 64.4% parameters and 8.5% FLOPs (multiply-adds). We validate the robustness and stability of our searched models by conducting extensive and rigorous ablation experiments. Our method can advance state-of-the-art accuracy on both the single-person and multi-person 3D human pose estimation benchmarks with affordable computational cost.

引用

页码：56 / 75

页数：20

共 50 条

[1] Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search
Zerui Chen
Yan Huang
Hongyuan Yu
Liang Wang
International Journal of Computer Vision, 2022, 130 : 56 - 75
[2] PSPDNet: Part-aware shape and pose disentanglement neural network for 3D human animating meshes
Li, Guiqing
Zeng, Juncheng
Zeng, Fanzhong
Yao, Chenhao
Kuang, Bixia
Nie, Yongwei
COMPUTER AIDED GEOMETRIC DESIGN, 2023, 104
[3] Boosting Monocular 3D Human Pose Estimation With Part Aware Attention
Xue, Youze
Chen, Jiansheng
Gu, Xiangming
Ma, Huimin
Ma, Hongbing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4278 - 4291
[4] Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking
Chu, Hau
Lee, Jia-Hong
Lee, Yao-Chih
Hsu, Ching-Hsien
Li, Jia-Da
Chen, Chu-Song
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1472 - 1481
[5] Limb Pose Aware Networks for Monocular 3D Pose Estimation
Wu, Lele
Yu, Zhenbo
Liu, Yijiang
Liu, Qingshan
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 906 - 917
[6] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
Xu, Zhenbo
Zhang, Wei
Ye, Xiaoqing
Tan, Xiao
Yang, Wei
Wen, Shilei
Ding, Errui
Meng, Ajin
Huang, Liusheng
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12557 - 12564
[7] Robust 3D Human Pose Estimation via Dual Dictionaries Learning
Ji, Hao
Su, Fei
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3370 - 3373
[8] 3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation
Zhang, Yi
Ji, Pengliang
Wang, Angtian
Mei, Jieru
Kortylewski, Adam
Yuille, Alan
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9365 - 9376
[9] Generating Part-Aware Editable 3D Shapes without 3D Supervision
Tertikas, Konstantinos
Paschalidou, Despoina
Pan, Boxiao
Park, Jeong Joon
Uy, Mikaela Angelina
Emiris, Ioannis
Avrithis, Yannis
Guibas, Leonidas
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4466 - 4478
[10] LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION
Chen, Ziyi
Sugimoto, Akihiro
Lai, Shang-Hong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4218 - 4222

← 1 2 3 4 5 →