Keypoint Detection Method for Single Person Gymnastics Actions Based on Multi-Scale Incremental Learning

被引:0
|
作者
Jiang J.-H. [1 ]
Xia N. [1 ]
Li C.-W. [1 ]
Zhou S.-Y. [1 ]
Yu X.-M. [1 ]
机构
[1] School of Information Science and Engineering, Dalian Polytecnic University, Liaoning, Dalian
来源
关键词
gymnastics actions; human keypoint detection; incremental learning; multi-resolution network; weight fusion;
D O I
10.12263/DZXB.20230729
中图分类号
学科分类号
摘要
Keypoint detection of human body is a hot research area in computer vision. At present there exist some problems for keypoint detection in gymnastics actions, such as insufficient detection accuracy and lack of capability to detect detailed body parts. In order to improve the detection accuracy, this paper proposes a multi-resolution network that has a larger receptive field in the shallow layers and can utilize high-resolution channel to enhance the extraction of detailed features. To achieve the detection of keypoints of hands and feet, an incremental learning network is designed. The network fuses the shallow features of the multi-resolution network and computes deep features using a gymnastics actions self-built dataset, so that the detection ability of keypoints on hands and feet is improved. Finally, the output results of the two sub-networks are concated. Computer simulations demonstrate that the multi-resolution network achieves an accuracy rate of 94.4% on the COCO2017 keypoint detection dataset, and the incremental learning network can accurately detect keypoints of detailed body parts with fewer training data. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1730 / 1742
页数:12
相关论文
共 40 条
  • [1] ZHANG S Q, WANG C F, DONG W L, Et al., A survey on depth ambiguity of 3D human pose estimation, Applied Sciences, 12, 20, (2022)
  • [2] LUO H L, TONG K, KONG F S., The progress of human action recognition in videos based on deep learning: A review, Acta Electronica Sinica, 47, 5, pp. 1162-1173, (2019)
  • [3] LIU S L., Research on Dance Action Recognition Based on Deep Learning, (2022)
  • [4] REN X Y, JIANG L B, ZHONG W J, Et al., A vision-based method for 3D pose estimation of non-cooperative space target, Journal of Electronics & Information Technology, 43, 12, pp. 3476-3485, (2021)
  • [5] CHEN Y L, WANG Z C, PENG Y X, Et al., Cascaded pyramid network for multi-person pose estimation, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7103-7112, (2018)
  • [6] FANG H S, XIE S Q, TAI Y W, Et al., RMPE: Regional multi-person pose estimation, 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2353-2362, (2017)
  • [7] NEWELL A, YANG K Y, DENG J., Stacked hourglass networks for human pose estimation, European Conference on Computer Vision, pp. 483-499, (2016)
  • [8] SUN K, XIAO B, LIU D, Et al., Deep high-resolution representation learning for human pose estimation, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5686-5696, (2019)
  • [9] CAO Z, HIDALGO G, SIMON T, Et al., OpenPose: Real-time multi-person 2D pose estimation using part affinity fields, IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 1, pp. 172-186, (2021)
  • [10] INSAFUTDINOV E, PISHCHULIN L, ANDRES B, Et al., DeeperCut: A deeper, stronger, and faster multi-person pose estimation model, European Conference on Computer Vision, pp. 34-50, (2016)