Image-Based Fitness Yoga Pose Recognition: Using Ensemble Learning and Multi-head Attention

被引：0

作者：

Kou, Yue ^{[1
]}

Li, Hai ^{[2
,3
]}

机构：

[1] Civil Aviat Flight Univ China, Coll Aviat Secur, Deyang 618307, Peoples R China

[2] Civil Aviat Flight Univ China, Coll Civil Aviat Safety Engn, Deyang 618307, Peoples R China

[3] Civil Aviat Flight Univ China, Civil Aircraft Fire Sci & Safety Engn Key Lab Sich, Deyang 618307, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2024年 / 17卷 / 01期

关键词：

Fitness yoga movements; Residual network; VGG network; Multi-head attention; Ensemble learning;

D O I：

10.1007/s44196-024-00662-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the increasing awareness of fitness, more and more people are choosing to participate in fitness activities. Yoga, as a form of exercise that improves both physical and mental health, is becoming increasingly popular worldwide. In order to assist yoga practitioners in more effective training through automated or semi automated systems, improve training effectiveness, assist professional athletes in training through intelligent recognition systems, correct movements, and improve athletic performance. This paper proposes a method that addresses the low accuracy issue of current yoga pose recognition algorithms by integrating multi-head attention mechanism and ensemble learning. Firstly, the Mixup algorithm is used to enhance yoga movement images. Subsequently, convolutional features are extracted from the images using the ResNet101 and VGGNet19 transfer learning models. Finally, the extracted convolutional features are combined and stacked using a multi-head attention mechanism. Model training, validation, and testing are performed using the Soft target cross-entropy loss function. Experimental results demonstrate that the proposed method achieves a training accuracy of 100%, a validation accuracy of 89.94%, a testing accuracy of 93.79%, and a detection speed of 297 frames per second. Overall, this method demonstrates high stability and robustness, providing a technological foundation for intelligent recognition of yoga poses.

引用

页数：17

共 50 条

[1] Federated learning based multi-head attention framework for medical image classification
Firdaus, Naima
Raza, Zahid
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (27):
[2] MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION
Nediyanchath, Anish
Paramasivam, Periyasamy
Yenigalla, Promod
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7179 - 7183
[3] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
Nouisser, Aicha
Zouari, Ramzi
Kherallah, Monji
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
[4] A fiber recognition framework based on multi-head attention mechanism
Xu, Luoli
Li, Fenying
Chang, Shan
TEXTILE RESEARCH JOURNAL, 2024, 94 (23-24) : 2629 - 2640
[5] Self Multi-Head Attention for Speaker Recognition
India, Miquel
Safari, Pooyan
Hernando, Javier
INTERSPEECH 2019, 2019, : 4305 - 4309
[6] Music Emotion Recognition Using Multi-head Self-attention-Based Models
Xiao, Yao
Ruan, Haoxin
Zhao, Xujian
Jin, Peiquan
Cai, Xuebo
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 101 - 114
[7] Multi-stage Transfer Learning Based Yoga Pose Recognition Using CNN
Pradeep, Chakka Sai
Sinha, Neelam
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 151 - 159
[8] Personalized federated learning based on multi-head attention algorithm
Jiang, Shanshan
Lu, Meixia
Hu, Kai
Wu, Jiasheng
Li, Yaogen
Weng, Liguo
Xia, Min
Lin, Haifeng
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3783 - 3798
[9] Personalized federated learning based on multi-head attention algorithm
Shanshan Jiang
Meixia Lu
Kai Hu
Jiasheng Wu
Yaogen Li
Liguo Weng
Min Xia
Haifeng Lin
International Journal of Machine Learning and Cybernetics, 2023, 14 : 3783 - 3798
[10] Speech recognition based on the transformer's multi-head attention in Arabic
Mahmoudi O.
Filali-Bouami M.
Benchat M.
International Journal of Speech Technology, 2024, 27 (01) : 211 - 223

← 1 2 3 4 5 →