Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引:0
|
作者
Kou, Yue [1 ]
Li, Hai [2 ,3 ]
机构
[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China
[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China
[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China
关键词
Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;
D O I
10.1007/s44196-024-00662-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Federated learning based multi-head attention framework for medical image classification
    Firdaus, Naima
    Raza, Zahid
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (27):
  • [2] MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION
    Nediyanchath, Anish
    Paramasivam, Periyasamy
    Yenigalla, Promod
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7179 - 7183
  • [3] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
    Nouisser, Aicha
    Zouari, Ramzi
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
  • [4] A fiber recognition framework based on multi-head attention mechanism
    Xu, Luoli
    Li, Fenying
    Chang, Shan
    TEXTILE RESEARCH JOURNAL, 2024, 94 (23-24) : 2629 - 2640
  • [5] Self Multi-Head Attention for Speaker Recognition
    India, Miquel
    Safari, Pooyan
    Hernando, Javier
    INTERSPEECH 2019, 2019, : 4305 - 4309
  • [6] Music Emotion Recognition Using Multi-head Self-attention-Based Models
    Xiao, Yao
    Ruan, Haoxin
    Zhao, Xujian
    Jin, Peiquan
    Cai, Xuebo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 101 - 114
  • [7] Multi-stage Transfer Learning Based Yoga Pose Recognition Using CNN
    Pradeep, Chakka Sai
    Sinha, Neelam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 151 - 159
  • [8] Personalized federated learning based on multi-head attention algorithm
    Jiang, Shanshan
    Lu, Meixia
    Hu, Kai
    Wu, Jiasheng
    Li, Yaogen
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3783 - 3798
  • [9] Personalized federated learning based on multi-head attention algorithm
    Shanshan Jiang
    Meixia Lu
    Kai Hu
    Jiasheng Wu
    Yaogen Li
    Liguo Weng
    Min Xia
    Haifeng Lin
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3783 - 3798
  • [10] Speech recognition based on the transformer's multi-head attention in Arabic
    Mahmoudi O.
    Filali-Bouami M.
    Benchat M.
    International Journal of Speech Technology, 2024, 27 (01) : 211 - 223