HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引:4
|
作者
Xia, Han [1 ]
Wang, Yiran [2 ]
Wang, Xiaoru [1 ]
Xiong, Songkai [1 ]
Yu, Zhihong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Intel China Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;
D O I
10.1109/IJCNN55064.2022.9892251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Multibranch Attention Graph Convolutional Networks for 3-D Human Pose Estimation
    Yin, Yanfang
    Liu, Ming
    Zhu, Qigang
    Zhang, Shuaishuai
    Hussien, Naseer Ali
    Fan, Yong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [42] Motion-Guided Graph Convolutional Network for Human Action Recognition
    Li, Jingjing
    Huang, Zhangjin
    Zou, Lu
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (07): : 1077 - 1086
  • [43] 3D Human Pose Estimation Using Mobius Graph Convolutional Networks
    Azizi, Niloofar
    Possegger, Horst
    Rodola, Emanuele
    Bischof, Horst
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 160 - 178
  • [44] Context-Guided Adaptive Network for Efficient Human Pose Estimation
    Zhao, Lei
    Wen, Jun
    Wang, Pengfei
    Zhen, Nenggan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3492 - 3499
  • [45] Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition
    Nandagopal, S.
    Karthy, G.
    Oliver, A. Sheryl
    Subha, M.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1719 - 1733
  • [46] Human Pose Estimation via Multi-resolution Convolutional Neural Network
    Zhu, Aichun
    Jin, Jing
    Wang, Tian
    Zhu, Qiurong
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 700 - 705
  • [47] Squirrel Search Optimization with Deep Convolutional Neural Network for Human Pose Estimation
    Ishwarya, K.
    Nithya, A. Alice
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6081 - 6099
  • [48] Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation
    Chen, Yu
    Shen, Chunhua
    Wei, Xiu-Shen
    Liu, Lingqiao
    Yang, Jian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1221 - 1230
  • [49] Spatial-Temporal-Geometric Graph Convolutional Network for 3-D Human Pose Estimation From Multiview Video
    Dong, Kaiwen
    Zhou, Yu
    Riou, Kevin
    Yun, Xiao
    Sun, Yanjing
    Subrin, Kevin
    Le Callet, Patrick
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [50] SMGNFORMER: Fusion Mamba-graph transformer network for human pose estimation
    Li, Yi
    Wang, Zan
    Niu, Weiran
    IET COMPUTER VISION, 2025, 19 (01)