Improving Badminton Action Recognition Using Spatio-Temporal Analysis and a Weighted Ensemble Learning Model

被引:0
|
作者
Asriani, Farida [1 ,2 ]
Azhari, Azhari [1 ]
Wahyono, Wahyono [1 ]
机构
[1] Univ Gadjah Mada, Dept Comp Sci & Elect, Yogyakarta 55281, Indonesia
[2] Univ Jenderal Soedirman, Elect Engn Dept, Purbalingga 53371, Indonesia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 02期
关键词
Weighted ensemble learning; badminton action; soft voting classifier; joint skeleton; fast dynamic time warping; spatiotemporal;
D O I
10.32604/cmc.2024.058193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incredible progress has been made in human action recognition (HAR), significantly impacting computer vision applications in sports analytics. However, identifying dynamic and complex movements in sports like badminton remains challenging due to the need for precise recognition accuracy and better management of complex motion patterns. Deep learning techniques like convolutional neural networks (CNNs), long short-term memory (LSTM), and graph convolutional networks (GCNs) improve recognition in large datasets, while the traditional machine learning methods like SVM (support vector machines), RF (random forest), and LR (logistic regression), combined with handcrafted features and ensemble approaches, perform well but struggle with the complexity of fast-paced sports like badminton. We proposed an ensemble learning model combining support vector machines (SVM), logistic regression (LR), random forest (RF), and adaptive boosting (AdaBoost) for badminton action recognition. The data in this study consist of video recordings of badminton stroke techniques, which have been extracted into spatiotemporal data. The three-dimensional distance between each skeleton point and the right hip represents the spatial features. The temporal features are the results of Fast Dynamic Time Warping (FDTW) calculations applied to 15 frames of each video sequence. The weighted ensemble model employs soft voting classifiers from SVM, LR, RF, and AdaBoost to enhance the accuracy of badminton action recognition. The E2 ensemble model, which combines SVM, LR, and AdaBoost, achieves the highest accuracy of 95.38%.
引用
收藏
页码:3079 / 3096
页数:18
相关论文
共 50 条
  • [41] Spatio-Temporal Feature Extraction and Distance Metric Learning for Unconstrained Action Recognition
    Yoon, Yongsang
    Yu, Jongmin
    Jeon, Moongu
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [42] Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks
    Wang, Lei
    Xu, Yangyang
    Cheng, Jun
    Xia, Haiying
    Yin, Jianqin
    Wu, Jiaji
    IEEE ACCESS, 2018, 6 : 17913 - 17922
  • [43] Semi-CNN Architecture for Effective Spatio-Temporal Learning in Action Recognition
    Leong, Mei Chee
    Prasad, Dilip K.
    Lee, Yong Tsui
    Lin, Feng
    APPLIED SCIENCES-BASEL, 2020, 10 (02):
  • [44] Action recognition method of spatio-temporal feature fusion deep learning network
    Pei, Xiaomin
    Fan, Huijie
    Tang, Yandong
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
  • [45] Learning Action-guided Spatio-temporal Transformer for Group Activity Recognition
    Li, Wei
    Yang, Tianzhao
    Wu, Xiao
    Du, Xian-Jun
    Qiao, Jian-Jun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2051 - 2060
  • [46] Learning to track for spatio-temporal action localization
    Weinzaepfel, Philippe
    Harchaoui, Zaid
    Schmid, Cordelia
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3164 - 3172
  • [47] Spatio-temporal Video Autoencoder for Human Action Recognition
    Sousa e Santos, Anderson Carlos
    Pedrini, Helio
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 114 - 123
  • [48] Spatio-Temporal Steerable Pyramid for Human Action Recognition
    Zhen, Xiantong
    Shao, Ling
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [49] Projection transform on spatio-temporal context for action recognition
    Wanru Xu
    Zhenjiang Miao
    Qiang Zhang
    Multimedia Tools and Applications, 2015, 74 : 7711 - 7728
  • [50] Spatio-Temporal Covariance Descriptors for Action and Gesture Recognition
    Sanin, Andres
    Sanderson, Conrad
    Harandi, Mehrtash T.
    Lovell, Brian C.
    2013 IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION (WACV), 2013, : 103 - 110