Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search

被引:2
|
作者
Chen, Zerui [1 ,2 ]
Huang, Yan [1 ]
Yu, Hongyuan [1 ,2 ]
Wang, Liang [1 ,2 ,3 ,4 ]
机构
[1] CASIA, Ctr Res Intelligent Percept & Comp, NLPR, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[4] Chinese Acad Sci, Artificial Intelligence Res, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular 3D human pose estimation; Heterogeneous human body parts; Neural architecture search;
D O I
10.1007/s11263-021-01525-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though most existing monocular 3D human pose estimation methods achieve very competitive performance, they are limited in estimating heterogeneous human body parts with the same decoder architecture. In this work, we present an approach to build a part-aware 3D human pose estimator to better deal with these heterogeneous human body parts. Our proposed method consists of two learning stages: (1) searching suitable decoder architectures for specific parts and (2) training the part-aware 3D human pose estimator built with these optimized neural architectures. Consequently, our searched model is very efficient and compact and can automatically select a suitable decoder architecture to estimate each human body part. In comparison with previous state-of-the-art models built with ResNet-50 network, our method can achieve better performance and reduce 64.4% parameters and 8.5% FLOPs (multiply-adds). We validate the robustness and stability of our searched models by conducting extensive and rigorous ablation experiments. Our method can advance state-of-the-art accuracy on both the single-person and multi-person 3D human pose estimation benchmarks with affordable computational cost.
引用
收藏
页码:56 / 75
页数:20
相关论文
共 50 条
  • [1] Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search
    Zerui Chen
    Yan Huang
    Hongyuan Yu
    Liang Wang
    International Journal of Computer Vision, 2022, 130 : 56 - 75
  • [2] PSPDNet: Part-aware shape and pose disentanglement neural network for 3D human animating meshes
    Li, Guiqing
    Zeng, Juncheng
    Zeng, Fanzhong
    Yao, Chenhao
    Kuang, Bixia
    Nie, Yongwei
    COMPUTER AIDED GEOMETRIC DESIGN, 2023, 104
  • [3] Boosting Monocular 3D Human Pose Estimation With Part Aware Attention
    Xue, Youze
    Chen, Jiansheng
    Gu, Xiangming
    Ma, Huimin
    Ma, Hongbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4278 - 4291
  • [4] Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking
    Chu, Hau
    Lee, Jia-Hong
    Lee, Yao-Chih
    Hsu, Ching-Hsien
    Li, Jia-Da
    Chen, Chu-Song
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1472 - 1481
  • [5] Limb Pose Aware Networks for Monocular 3D Pose Estimation
    Wu, Lele
    Yu, Zhenbo
    Liu, Yijiang
    Liu, Qingshan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 906 - 917
  • [6] ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
    Xu, Zhenbo
    Zhang, Wei
    Ye, Xiaoqing
    Tan, Xiao
    Yang, Wei
    Wen, Shilei
    Ding, Errui
    Meng, Ajin
    Huang, Liusheng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12557 - 12564
  • [7] Robust 3D Human Pose Estimation via Dual Dictionaries Learning
    Ji, Hao
    Su, Fei
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3370 - 3373
  • [8] 3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation
    Zhang, Yi
    Ji, Pengliang
    Wang, Angtian
    Mei, Jieru
    Kortylewski, Adam
    Yuille, Alan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9365 - 9376
  • [9] Generating Part-Aware Editable 3D Shapes without 3D Supervision
    Tertikas, Konstantinos
    Paschalidou, Despoina
    Pan, Boxiao
    Park, Jeong Joon
    Uy, Mikaela Angelina
    Emiris, Ioannis
    Avrithis, Yannis
    Guibas, Leonidas
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 4466 - 4478
  • [10] LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION
    Chen, Ziyi
    Sugimoto, Akihiro
    Lai, Shang-Hong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4218 - 4222