Improving Badminton Action Recognition Using Spatio-Temporal Analysis and a Weighted Ensemble Learning Model

被引:0
|
作者
Asriani, Farida [1 ,2 ]
Azhari, Azhari [1 ]
Wahyono, Wahyono [1 ]
机构
[1] Univ Gadjah Mada, Dept Comp Sci & Elect, Yogyakarta 55281, Indonesia
[2] Univ Jenderal Soedirman, Elect Engn Dept, Purbalingga 53371, Indonesia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 02期
关键词
Weighted ensemble learning; badminton action; soft voting classifier; joint skeleton; fast dynamic time warping; spatiotemporal;
D O I
10.32604/cmc.2024.058193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incredible progress has been made in human action recognition (HAR), significantly impacting computer vision applications in sports analytics. However, identifying dynamic and complex movements in sports like badminton remains challenging due to the need for precise recognition accuracy and better management of complex motion patterns. Deep learning techniques like convolutional neural networks (CNNs), long short-term memory (LSTM), and graph convolutional networks (GCNs) improve recognition in large datasets, while the traditional machine learning methods like SVM (support vector machines), RF (random forest), and LR (logistic regression), combined with handcrafted features and ensemble approaches, perform well but struggle with the complexity of fast-paced sports like badminton. We proposed an ensemble learning model combining support vector machines (SVM), logistic regression (LR), random forest (RF), and adaptive boosting (AdaBoost) for badminton action recognition. The data in this study consist of video recordings of badminton stroke techniques, which have been extracted into spatiotemporal data. The three-dimensional distance between each skeleton point and the right hip represents the spatial features. The temporal features are the results of Fast Dynamic Time Warping (FDTW) calculations applied to 15 frames of each video sequence. The weighted ensemble model employs soft voting classifiers from SVM, LR, RF, and AdaBoost to enhance the accuracy of badminton action recognition. The E2 ensemble model, which combines SVM, LR, and AdaBoost, achieves the highest accuracy of 95.38%.
引用
收藏
页码:3079 / 3096
页数:18
相关论文
共 50 条
  • [1] LEARNING SPATIO-TEMPORAL DEPENDENCIES FOR ACTION RECOGNITION
    Cai, Qiao
    Yin, Yafeng
    Man, Hong
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3740 - 3744
  • [2] Action Recognition Using a Spatio-Temporal Model in Dynamic Scenes
    Chathuramali, K. G. Manosha
    Rodrigo, Ranga
    2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
  • [3] Spatio-Temporal Information for Action Recognition in Thermal Video Using Deep Learning Model
    Srihari, P.
    Harikiran, J.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (08) : 669 - 680
  • [4] Spatio-Temporal Contrastive Learning for Compositional Action Recognition
    Gong, Yezi
    Pei, Mingtao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 424 - 438
  • [5] ENSEMBLE SPATIO-TEMPORAL DISTANCE NET FOR SKELETON BASED ACTION RECOGNITION
    Naveenkumar, M.
    Domnic, S.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (03): : 485 - 494
  • [6] ACTION RECOGNITION USING SPATIO-TEMPORAL DIFFERENTIAL MOTION
    Yadav, Gaurav Kumar
    Sethi, Amit
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3415 - 3419
  • [7] Human Action Recognition Using Spatio-temporal Classification
    Fang, Chin-Hsien
    Chen, Ju-Chin
    Tseng, Chien-Chung
    Lien, Jenn-Jier James
    COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 98 - 109
  • [8] Accelerated Learning of Discriminative Spatio-temporal Features for Action Recognition
    Varshney, Munender
    Rameshan, Renu
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [9] Supervised Spatio-Temporal Neighborhood Topology Learning for Action Recognition
    Ma, Andy J.
    Yuen, Pong C.
    Zou, Wilman W. W.
    Lai, Jian-Huang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (08) : 1447 - 1460
  • [10] Multimodal human action recognition based on spatio-temporal action representation recognition model
    Wu, Qianhan
    Huang, Qian
    Li, Xing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16409 - 16430