Joint graph convolution networks and transformer for human pose estimation in sports technique analysis

被引:2
|
作者
Cheng, Hongren [1 ,2 ]
Wang, Jing [3 ]
Zhao, Anran [4 ]
Zhong, Yaping [1 ,2 ]
Li, Jingli [5 ]
Dong, Liangshan [6 ]
机构
[1] Wuhan Sports Univ, Sports Big Data Res Ctr, Wuhan 430079, Peoples R China
[2] Hubei Prov Sports & Hlth Innovat Dev Res Ctr, Wuhan 430079, Hubei, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Sch Automat, Chongqing 400065, Peoples R China
[4] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[5] Huazhong Univ Sci & Technol, Sch Phys Educ, Wuhan 430074, Peoples R China
[6] China Univ Geosci, Sch Phys Educ, Wuhan 430074, Peoples R China
关键词
Human pose estimation; Graph convolutional network; Transformer; The topological structure between; IMAGE STEGANOGRAPHY METHOD;
D O I
10.1016/j.jksuci.2023.101819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation has various applications in domains such as sports technology analysis, virtual reality, and education. However, most previous studies focused on the respective feature representations of keypoints, but disregarded the topological relationship among keypoints. To address this challenge, we propose GTPose, a network structure that integrates graph convolutional networks and Transform. First of all, a set of multi-scale convolution operations are applied to extract local feature maps of images. Secondly, the positions of keypoints are roughly estimated by using Transform to process the sequential relations between feature maps. Finally, GCN is adopted to model the topological structure between keypoints to accurately locate the location of keypoints and learn feature representations. The performance of GTPose is evaluated on two real datasets: MS COCO and MPII. Experimental results demonstrate that GTPose outperforms other methods in human pose estimation tasks. In addition, experimental results also show that the spatial relationship between keypoints is effective for accurately characterizing keypoints.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] SCGFormer: Semantic Chebyshev Graph Convolution Transformer for 3D Human Pose Estimation
    Liang, Jiayao
    Yin, Mengxiao
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [2] HOGFormer: high-order graph convolution transformer for 3D human pose estimation
    Xie, Yuhong
    Hong, Chaoqun
    Zhuang, Weiwei
    Liu, Lijuan
    Li, Jie
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, : 599 - 610
  • [3] PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
    Zhu, Yiran
    Xu, Xing
    Shen, Fumin
    Ji, Yanli
    Gao, Lianli
    Shen, Heng Tao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1359 - 1365
  • [4] Three-dimensional human pose estimation based on improved semantic graph convolution neural networks
    Yang, Chengkun
    Guo, Min
    Ma, Miao
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [5] Conditional Directed Graph Convolution for 3D Human Pose Estimation
    Hu, Wenbo
    Zhang, Changgong
    Zhan, Fangneng
    Zhang, Lei
    Wong, Tien-Tsin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 602 - 611
  • [6] DGFormer: Dynamic graph transformer for 3D human pose estimation
    Chen Z.
    Dai J.
    Bai J.
    Pan J.
    Pattern Recognition, 2024, 152
  • [7] Aggregation Transformer for Human Pose Estimation
    Dong, Hao
    Wang, Guodong
    Zhang, Xinyue
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3660 - 3667
  • [8] Global Relation Reasoning Graph Convolutional Networks for Human Pose Estimation
    Wang, Rui
    Huang, Chenyang
    Wang, Xiangyang
    IEEE ACCESS, 2020, 8 : 38472 - 38480
  • [9] Structure-aware human pose estimation with graph convolutional networks
    Bin, Yanrui
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Chen, Xinya
    Gao, Changxin
    Sang, Nong
    PATTERN RECOGNITION, 2020, 106
  • [10] Pose Relation Transformer Refine Occlusions for Human Pose Estimation
    Chi, Hyung-gun
    Chi, Seunggeun
    Chan, Stanley
    Ramani, Karthik
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6138 - 6145