Attention-enhanced gated recurrent unit for action recognition in tennis

被引:0
|
作者
Gao, Meng [1 ]
Ju, Bingchun [2 ]
机构
[1] Henan Finance Univ, Coll Sports & Hlth Management, Zhengzhou, Peoples R China
[2] Zhengzhou Univ Light Ind, Coll Sports, Zhengzhou, Peoples R China
关键词
Computer vision; Deep learning; Action recognition; Tennis; CONVOLUTIONAL NEURAL-NETWORK; LSTM;
D O I
10.7717/peerj-cs.1804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human Action Recognition (HAR) is an essential topic in computer vision and artificial intelligence, focused on the automatic identification and categorization of human actions or activities from video sequences or sensor data. The goal of HAR is to teach machines to comprehend and interpret human movements, gestures, and behaviors, allowing fora wide range of applications in areas such as surveillance, healthcare, sports analysis, and human-computer interaction. HAR systems utilize a variety of techniques, including deep learning, motion analysis, and feature extraction, to capture and analyze the spatiotemporal characteristics of human actions. These systems have the capacity to distinguish between various actions, whether they are simple actions like walking and waving or more complex activities such as playing a musical instrument or performing sports maneuvers. HAR continues to be an active area of research and development, with the potential to enhance numerous real-world applications by providing machines with the ability to understand and respond to human actions effectively. In our study, we developed a HAR system to recognize actions in tennis using an attention-based gated recurrent unit (GRU), a prevalent recurrent neural network. The combination of GRU architecture and attention mechanism showed a significant improvement in prediction power compared to two other deep learning models. Our models were trained on the THETIS dataset, one of the standard medium-sized datasets for fine-grained tennis actions. The effectiveness of the proposed model was confirmed by three different types of image encoders: InceptionV3, DenseNet, and EfficientNetB5. The models developed with InceptionV3, DenseNet, and EfficientNetB5 achieved average ROC-AUC values of 0.97, 0.98, and 0.81, respectively. While, the models obtained average PR-AUC values of 0.84, 0.87, and 0.49 for InceptionV3, DenseNet, and EfficientNetB5 features, respectively. The experimental results confirmed the applicability of our proposed method in recognizing action in tennis and may be applied to other HAR problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Learning Attention-Enhanced Spatiotemporal Representation for Action Recognition
    Shi, Zhensheng
    Cao, Liangjie
    Guan, Cheng
    Zheng, Haiyong
    Gu, Zhaorui
    Yu, Zhibin
    Zheng, Bing
    [J]. IEEE ACCESS, 2020, 8 : 16785 - 16794
  • [2] ATTENTION-ENHANCED SENSORIMOTOR OBJECT RECOGNITION
    Thermos, Spyridon
    Papadopoulos, Georgios Th.
    Daras, Petros
    Potamianos, Gerasimos
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 336 - 340
  • [3] Attention-Based Gated Recurrent Unit for Gesture Recognition
    Khodabandelou, Ghazaleh
    Jung, Pyeong-Gook
    Amirat, Yacine
    Mohammed, Samer
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (02) : 495 - 507
  • [4] Action recognition method based on multi-stream attention-enhanced recursive graph convolution
    Wang, Huaijun
    Bai, Bingqian
    Li, Junhuai
    Ke, Hui
    Xiang, Wei
    [J]. APPLIED INTELLIGENCE, 2024, 54 (20) : 10133 - 10147
  • [5] Gated Recurrent Unit Based On Feature Attention Mechanism For Physical Behavior Recognition Analysis
    Ying, Wen
    [J]. JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2023, 26 (03): : 357 - 365
  • [6] Adaptive Hierarchical Attention-Enhanced Gated Network Integrating Reviews for Item Recommendation
    Liu, Donghua
    Wu, Jia
    Li, Jing
    Du, Bo
    Chang, Jun
    Li, Xuefei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2076 - 2090
  • [7] Attention-enhanced reservoir computing
    Koester, Felix
    Kanno, Kazutaka
    Ohkubo, Jun
    Uchida, Atsushi
    [J]. PHYSICAL REVIEW APPLIED, 2024, 22 (01):
  • [8] A NOVEL ATTENTION-BASED GATED RECURRENT UNIT AND ITS EFFICACY IN SPEECH EMOTION RECOGNITION
    Rajamani, Srividya Tirunellai
    Rajamani, Kumar T.
    Mallol-Ragolta, Adria
    Liu, Shuo
    Schuller, Bjoern
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6294 - 6298
  • [9] A model for electroencephalogram emotion recognition: Residual block-gated recurrent unit with attention mechanism
    Wang, Yujie
    Zhang, Xiu
    Zhang, Xin
    Sun, Baiwei
    Xu, Bingyue
    [J]. REVIEW OF SCIENTIFIC INSTRUMENTS, 2024, 95 (08):
  • [10] Residual attention unit for action recognition
    Liao, Zhongke
    Hu, Haifeng
    Zhang, Junxuan
    Yin, Chang
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 189