Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引:0
|
作者
Kou, Yue [1 ]
Li, Hai [2 ,3 ]
机构
[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China
[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China
[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China
关键词
Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;
D O I
10.1007/s44196-024-00662-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] DROPOUT MULTI-HEAD ATTENTION FOR SINGLE IMAGE SUPER-RESOLUTION
    Yang, Chao
    Fan, Yong
    Lu, Cheng
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2655 - 2659
  • [42] Image-based Pose Representation for Action Recognition and Hand Gesture Recognition
    Lin, Zeyi
    Zhang, Wei
    Deng, Xiaoming
    Ma, Cuixia
    Wang, Hongan
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 532 - 539
  • [43] Predicting the Urban Water Demand Based on Transfer Learning Method With Multi-head Attention
    Chen, Zhuo
    Deng, Chuhan
    Che, Fei
    Li, Yan
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3760 - 3765
  • [44] Multi-Attention Cascade Model Based on Multi-Head Structure for Image-Text Retrieval
    Zhang, Haotian
    Wu, Wei
    Zhang, Meng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [45] A Dual Multi-Head Contextual Attention Network for Hyperspectral Image Classification
    Liang, Miaomiao
    He, Qinghua
    Yu, Xiangchun
    Wang, Huai
    Meng, Zhe
    Jiao, Licheng
    REMOTE SENSING, 2022, 14 (13)
  • [46] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
    Li, XinRan
    Wei, ZhenChun
    Lyu, ZengWei
    Yuan, XiaoHui
    Xu, Juan
    Zhang, ZeYu
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
  • [47] Building pattern recognition by using an edge-attention multi-head graph convolutional network
    Wang, Haitao
    Xu, Yongyang
    Hu, Anna
    Xie, Xuejing
    Chen, Siqiong
    Xie, Zhong
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2025, 39 (04) : 732 - 757
  • [48] Riding feeling recognition based on multi-head self-attention LSTM for driverless automobile
    Tang, Xianzhi
    Xie, Yongjia
    Li, Xinlong
    Wang, Bo
    PATTERN RECOGNITION, 2025, 159
  • [49] Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition
    Garg, Mallika
    Ghosh, Debashis
    Pradhan, Pyari Mohan
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 80 - 84
  • [50] Multi-head attention fusion networks for multi-modal speech emotion recognition
    Zhang, Junfeng
    Xing, Lining
    Tan, Zhen
    Wang, Hongsen
    Wang, Kesheng
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 168