Hierarchical Contextual Refinement Networks for Human Pose Estimation

被引:45
|
作者
Nie, Xuecheng [1 ]
Feng, Jiashi [1 ]
Xing, Junliang [2 ]
Xiao, Shengtao [3 ]
Yan, Shuicheng [1 ,3 ]
机构
[1] Natl Univ Singapore, ECE Dept, Learning & Vis Lab, Singapore 117583, Singapore
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[3] Qihoo 360 AI Inst, Beijing 100016, Peoples R China
关键词
Human pose estimation; joint complexity-aware; hierarchical contextual refinement network; PICTORIAL STRUCTURES; FLEXIBLE MIXTURES;
D O I
10.1109/TIP.2018.2872628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting human pose in the wild is a challenging problem due to high flexibility of joints and possible occlusion. Existing approaches generally tackle the difficulties either by holistic prediction or multi-stage processing, which suffer from poor performance for locating challenging joints or high computational cost. In this paper, we propose a new hierarchical contextual refinement network (HCRN) to robustly predict human poses in an efficient manner, where human body joints of different complexities are processed at different layers in a context hierarchy. Different from existing approaches, our proposed model predicts positions of joints from easy to difficult in a single stage through effectively exploiting informative contexts provided in the previous layer. Such approach offers two appealing advantages over state-of-the-arts: 1) more accurate than predicting all the joints together and 2) more efficient than multi-stage processing methods. We design a contextual refinement unit (CRU) to implement the proposed model, which enables auto-diffusion of joint detection results to effectively transfer informative context from easy joints to difficult ones. In this way, difficult joints can be reliably detected even in presence of occlusion or severe distracting factors. Multiple CRUs are organized into a tree-structured hierarchy which is end-to-end trainable and does not require processing joints for multiple iterations. Comprehensive experiments evaluate the efficacy and efficiency of the proposed HCRN model to improve well-established baselines and achieve the new state-of-the-art on multiple human pose estimation benchmarks.
引用
收藏
页码:924 / 936
页数:13
相关论文
共 50 条
  • [1] Deep Refinement Convolutional Networks for Human Pose Estimation
    Marras, Ioannis
    Palasek, Petar
    Patras, Ioannis
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 446 - 453
  • [2] Hierarchical Dynamic Programming Module for Human Pose Refinement
    Xie, Chunyang
    Zhang, Dongheng
    Hu, Yang
    Chen, Yan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6214 - 6226
  • [3] Efficient Human Pose Estimation in Hierarchical Context
    Zhang, Feng
    Zhu, Xiatian
    Ye, Mao
    [J]. IEEE ACCESS, 2019, 7 : 29365 - 29373
  • [4] Hierarchical Adversarial Network for Human Pose Estimation
    Radwan, Ibrahim
    Moustafa, Nour
    Keating, Byron
    Choo, Kim-Kmang Raymond
    Goecke, Roland
    [J]. IEEE ACCESS, 2019, 7 : 103619 - 103628
  • [5] Hierarchical pose estimation for human gait analysis
    Spehr, Jens
    Winkelbach, Simon
    Wahl, Friedrich M.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 106 (02) : 104 - 113
  • [6] FaSRnet: a feature and semantics refinement network for human pose estimation
    Zhong, Yuanhong
    Xu, Qianfeng
    Zhong, Daidi
    Yang, Xun
    Wang, Shanshan
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (04) : 513 - 526
  • [7] Hierarchical Graph Neural Network for Human Pose Estimation
    Zheng, Guanghua
    Zhao, Zhongqiu
    Zhang, Zhao
    Yang, Yi
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2663 - 2668
  • [8] Human Pose Estimation Using Exemplars and Part Based Refinement
    Su, Yanchao
    Ai, Haizhou
    Yamashita, Takayoshi
    Lao, Shihong
    [J]. COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 174 - +
  • [9] Hybrid Refinement-Correction Heatmaps for Human Pose Estimation
    Kamel, Aouaidjia
    Sheng, Bin
    Li, Ping
    Kim, Jinman
    Feng, David Dagan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1330 - 1342
  • [10] Dilated Hourglass Networks for Human Pose Estimation
    Zhang, Yudong
    Liu, Jing
    Huang, Kaiyu
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 2597 - 2602