HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引:4
|
作者
Xia, Han [1 ]
Wang, Yiran [2 ]
Wang, Xiaoru [1 ]
Xiong, Songkai [1 ]
Yu, Zhihong [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] Beijing Forestry Univ, Beijing, Peoples R China
[3] Intel China Res Ctr, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;
D O I
10.1109/IJCNN55064.2022.9892251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video
    Liu, Junfa
    Rojas, Juan
    Li, Yihui
    Liang, Zhijun
    Guan, Yisheng
    Xi, Ning
    Zhu, Haifei
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 3374 - 3380
  • [32] Compositional Graph Convolutional Networks for 3D Human Pose Estimation
    Zou, Zhiming
    Liu, Tianqi
    Wu, Dapeng
    Tang, Wei
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [33] MS-GCN : Multi-Stream Graph Convolution Network for Driver Head Pose Estimation
    Li, Yao-Kun
    Yu, Yue-Zhao
    Liu, Yu-Liang
    Gou, Chao
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3819 - 3824
  • [34] DGCN: Dynamic Graph Convolutional Network for Efficient Multi-Person Pose Estimation
    Qiu, Zhongwei
    Qiu, Kai
    Fu, Jianlong
    Fu, Dongmei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11924 - 11931
  • [35] Graph Convolutional Network for 3D Object Pose Estimation in a Point Cloud
    Jung, Tae-Won
    Jeong, Chi-Seo
    Kim, In-Seon
    Yu, Min-Su
    Kwon, Soon-Chul
    Jung, Kye-Dong
    SENSORS, 2022, 22 (21)
  • [36] 3D HUMAN POSE REGRESSION USING GRAPH CONVOLUTIONAL NETWORK
    Banik, Soubarna
    García, Alejandro Mendoza
    Knoll, Alois
    arXiv, 2021,
  • [37] 3D HUMAN POSE REGRESSION USING GRAPH CONVOLUTIONAL NETWORK
    Banik, Soubarna
    GarcIa, Alejandro Mendoza
    Knoll, Alois
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 924 - 928
  • [38] Human Pose Prediction Using Interpretable Graph Convolutional Network for Smart Home
    Yang, Boyu
    Hu, Liyazhou
    Peng, Yuyang
    Wang, Tingting
    Fang, Xiaofen
    Wang, Lina
    Fang, Kai
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 876 - 888
  • [39] SPGformer: Serial-Parallel Hybrid GCN-Transformer With Graph-Oriented Encoder for 2-D-to-3-D Human Pose Estimation
    Fang, Qin
    Xu, Zihan
    Hu, Mengxian
    Zeng, Qinyang
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [40] Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation
    Tompson, Jonathan
    Jain, Arjun
    LeCun, Yann
    Bregler, Christoph
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27