Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引：0

作者：

Kou, Yue ^{[1
]}

Li, Hai ^{[2
,3
]}

机构：

[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China

[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China

[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2024年 / 17卷 / 01期

关键词：

Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;

D O I：

10.1007/s44196-024-00662-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.

引用

页数：17

共 50 条

[41] DROPOUT MULTI-HEAD ATTENTION FOR SINGLE IMAGE SUPER-RESOLUTION
Yang, Chao
Fan, Yong
Lu, Cheng
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2655 - 2659
[42] Image-based Pose Representation for Action Recognition and Hand Gesture Recognition
Lin, Zeyi
Zhang, Wei
Deng, Xiaoming
Ma, Cuixia
Wang, Hongan
2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 532 - 539
[43] Predicting the Urban Water Demand Based on Transfer Learning Method With Multi-head Attention
Chen, Zhuo
Deng, Chuhan
Che, Fei
Li, Yan
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3760 - 3765
[44] Multi-Attention Cascade Model Based on Multi-Head Structure for Image-Text Retrieval
Zhang, Haotian
Wu, Wei
Zhang, Meng
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[45] A Dual Multi-Head Contextual Attention Network for Hyperspectral Image Classification
Liang, Miaomiao
He, Qinghua
Yu, Xiangchun
Wang, Huai
Meng, Zhe
Jiao, Licheng
REMOTE SENSING, 2022, 14 (13)
[46] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
Li, XinRan
Wei, ZhenChun
Lyu, ZengWei
Yuan, XiaoHui
Xu, Juan
Zhang, ZeYu
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
[47] Building pattern recognition by using an edge-attention multi-head graph convolutional network
Wang, Haitao
Xu, Yongyang
Hu, Anna
Xie, Xuejing
Chen, Siqiong
Xie, Zhong
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2025, 39 (04) : 732 - 757
[48] Riding feeling recognition based on multi-head self-attention LSTM for driverless automobile
Tang, Xianzhi
Xie, Yongjia
Li, Xinlong
Wang, Bo
PATTERN RECOGNITION, 2025, 159
[49] Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition
Garg, Mallika
Ghosh, Debashis
Pradhan, Pyari Mohan
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 80 - 84
[50] Multi-head attention fusion networks for multi-modal speech emotion recognition
Zhang, Junfeng
Xing, Lining
Tan, Zhen
Wang, Hongsen
Wang, Kesheng
COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 168

← 1 2 3 4 5 →