Improving Badminton Action Recognition Using Spatio-Temporal Analysis and a Weighted Ensemble Learning Model

被引：0

作者：

Asriani, Farida ^{[1
,2
]}

Azhari, Azhari ^{[1
]}

Wahyono, Wahyono ^{[1
]}

机构：

[1] Univ Gadjah Mada, Dept Comp Sci & Elect, Yogyakarta 55281, Indonesia

[2] Univ Jenderal Soedirman, Elect Engn Dept, Purbalingga 53371, Indonesia

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 02期

关键词：

Weighted ensemble learning; badminton action; soft voting classifier; joint skeleton; fast dynamic time warping; spatiotemporal;

D O I：

10.32604/cmc.2024.058193

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Incredible progress has been made in human action recognition (HAR), significantly impacting computer vision applications in sports analytics. However, identifying dynamic and complex movements in sports like badminton remains challenging due to the need for precise recognition accuracy and better management of complex motion patterns. Deep learning techniques like convolutional neural networks (CNNs), long short-term memory (LSTM), and graph convolutional networks (GCNs) improve recognition in large datasets, while the traditional machine learning methods like SVM (support vector machines), RF (random forest), and LR (logistic regression), combined with handcrafted features and ensemble approaches, perform well but struggle with the complexity of fast-paced sports like badminton. We proposed an ensemble learning model combining support vector machines (SVM), logistic regression (LR), random forest (RF), and adaptive boosting (AdaBoost) for badminton action recognition. The data in this study consist of video recordings of badminton stroke techniques, which have been extracted into spatiotemporal data. The three-dimensional distance between each skeleton point and the right hip represents the spatial features. The temporal features are the results of Fast Dynamic Time Warping (FDTW) calculations applied to 15 frames of each video sequence. The weighted ensemble model employs soft voting classifiers from SVM, LR, RF, and AdaBoost to enhance the accuracy of badminton action recognition. The E2 ensemble model, which combines SVM, LR, and AdaBoost, achieves the highest accuracy of 95.38%.

引用

页码：3079 / 3096

页数：18

共 50 条

[1] LEARNING SPATIO-TEMPORAL DEPENDENCIES FOR ACTION RECOGNITION
Cai, Qiao
Yin, Yafeng
Man, Hong
2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3740 - 3744
[2] Action Recognition Using a Spatio-Temporal Model in Dynamic Scenes
Chathuramali, K. G. Manosha
Rodrigo, Ranga
2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
[3] Spatio-Temporal Information for Action Recognition in Thermal Video Using Deep Learning Model
Srihari, P.
Harikiran, J.
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (08) : 669 - 680
[4] Spatio-Temporal Contrastive Learning for Compositional Action Recognition
Gong, Yezi
Pei, Mingtao
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 424 - 438
[5] ENSEMBLE SPATIO-TEMPORAL DISTANCE NET FOR SKELETON BASED ACTION RECOGNITION
Naveenkumar, M.
Domnic, S.
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (03): : 485 - 494
[6] ACTION RECOGNITION USING SPATIO-TEMPORAL DIFFERENTIAL MOTION
Yadav, Gaurav Kumar
Sethi, Amit
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3415 - 3419
[7] Human Action Recognition Using Spatio-temporal Classification
Fang, Chin-Hsien
Chen, Ju-Chin
Tseng, Chien-Chung
Lien, Jenn-Jier James
COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 98 - 109
[8] Accelerated Learning of Discriminative Spatio-temporal Features for Action Recognition
Varshney, Munender
Rameshan, Renu
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[9] Supervised Spatio-Temporal Neighborhood Topology Learning for Action Recognition
Ma, Andy J.
Yuen, Pong C.
Zou, Wilman W. W.
Lai, Jian-Huang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1447 - 1460
[10] Multimodal human action recognition based on spatio-temporal action representation recognition model
Wu, Qianhan
Huang, Qian
Li, Xing
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16409 - 16430

← 1 2 3 4 5 →