HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引:4
|
作者
Xia, Han [1 ]
Wang, Yiran [2 ]
Wang, Xiaoru [1 ]
Xiong, Songkai [1 ]
Yu, Zhihong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Intel China Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;
D O I
10.1109/IJCNN55064.2022.9892251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Improved Graph Convolutional Neural Network for Dance Tracking and Pose Estimation
    Zhang, Liangliang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [22] Improved Graph Convolutional Neural Network for Dance Tracking and Pose Estimation
    Zhang, Liangliang
    Computational Intelligence and Neuroscience, 2022, 2022
  • [23] Hierarchical Graph Neural Network for Human Pose Estimation
    Zheng, Guanghua
    Zhao, Zhongqiu
    Zhang, Zhao
    Yang, Yi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2663 - 2668
  • [24] Robust 3D Human Pose Estimation Guided by Filtered Subsets of Body Keypoints
    Makris, Alexandros
    Argyros, Antonis
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [25] Global Relation Reasoning Graph Convolutional Networks for Human Pose Estimation
    Wang, Rui
    Huang, Chenyang
    Wang, Xiangyang
    IEEE ACCESS, 2020, 8 : 38472 - 38480
  • [26] Structure-aware human pose estimation with graph convolutional networks
    Bin, Yanrui
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Chen, Xinya
    Gao, Changxin
    Sang, Nong
    PATTERN RECOGNITION, 2020, 106
  • [27] Simplified-attention Enhanced Graph Convolutional Network for 3D human pose estimation
    Wang, Tianfeng
    Zhang, Xiaoxu
    NEUROCOMPUTING, 2022, 501 : 231 - 243
  • [28] TAA-GCN: A temporally aware Adaptive Graph Convolutional Network for age estimation
    Korban, Matthew
    Youngs, Peter
    Acton, Scott T.
    PATTERN RECOGNITION, 2023, 134
  • [29] An adversarial human pose estimation network injected with graph structure
    Tian, Lei
    Wang, Peng
    Liang, Guoqiang
    Shen, Chunhua
    PATTERN RECOGNITION, 2021, 115
  • [30] A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video
    The Biomimetic and Intelligent Robotics Lab , School of Electromechanical Engineering, Guangdong University of Technology, Guangzhou
    510006, China
    不详
    不详
    Proc IEEE Int Conf Rob Autom, 2021, (3374-3380): : 3374 - 3380