Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network

被引:5
|
作者
Dey, Arnab [1 ]
Biswas, Samit [1 ]
Le, Dac-Nhuong [2 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur 711103, Howrah, India
[2] Haiphong Univ, Fac Informat Technol, Haiphong 180000, Vietnam
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Workout action recognition; video stream; action recognition; residual network; GRU; attention; SPATIOTEMPORAL FEATURES; LSTM;
D O I
10.32604/cmc.2024.049512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers the likelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in video streams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enable instant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing action datasets often lack diversity and specificity for workout actions, hindering the development of accurate recognition models. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significant contribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated to encompass various exercises performed by numerous individuals in different settings. This research proposes an innovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU) network for workout action recognition in video streams. Unlike image-based action recognition, videos contain spatio-temporal information, making the task more complex and challenging. While substantial progress has been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions, and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attention model demonstrated exceptional classification performance with 95.81% accuracy in classifying workout action videos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and 93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101, respectively, showcasing its superiority and robustness in action recognition. The findings suggest practical implications in real-world scenarios where precise video action recognition is paramount, addressing the persisting challenges in the field. The WAVd dataset serves as a catalyst for the development of more robust and effective fitness tracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.
引用
收藏
页码:3067 / 3087
页数:21
相关论文
共 50 条
  • [1] Umpire's Signal Recognition in Cricket Using an Attention based DC-GRU Network
    Dey, A.
    Biswas, S.
    Abualigah, L.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (04): : 662 - 674
  • [2] Residual attention fusion network for video action recognition
    Li, Ao
    Yi, Yang
    Liang, Daan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [3] Video action recognition method based on attention residual network and LSTM
    Zhang, Yu
    Dong, Pengyue
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3611 - 3616
  • [4] TWIN-GRU: Twin stream GRU network for action recognition from RGB video
    Essefi, Hajer
    Ahmed, Olfa Ben
    Bidet-Ildei, Christel
    Blandin, Yannick
    Fernandez-Maloigne, Christine
    ICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence, 2021, 2 : 351 - 359
  • [5] TWIN-GRU: Twin Stream GRU Network for Action Recognition from RGB Video
    Essefi, Hajer
    Ben Ahmed, Olfa
    Bidet-Ildei, Christel
    Blandin, Yannick
    Fernandez-Maloigne, Christine
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 351 - 359
  • [6] A hybrid attention-guided ConvNeXt-GRU network for action recognition
    An, Yiyuan
    Yi, Yingmin
    Han, Xiaoyong
    Wu, Li
    Su, Chunyi
    Liu, Bojun
    Xue, Xianghong
    Li, Yankai
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [7] Accurate recognition of human abnormal behaviours using adaptive 3D residual attention network with gated recurrent units (GRU) in the video sequences
    Balakrishnan, T. Suresh
    Jayalakshmi, D.
    Geetha, P.
    Raj, T. Saju
    Hemavathi, R.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2024, 12 (01):
  • [8] Recurrent Region Attention and Video Frame Attention Based Video Action Recognition Network Design
    Sang H.-F.
    Zhao Z.-Y.
    He D.-K.
    Zhao, Zi-Yu (Maikuraky1022@outlook.com), 1600, Chinese Institute of Electronics (48): : 1052 - 1061
  • [9] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Haiping Zhang
    Zepeng Hu
    Dongjin Yu
    Liming Guan
    Xu Liu
    Conghao Ma
    Neural Processing Letters, 56
  • [10] SDAN: Stacked Diverse Attention Network for Video Action Recognition
    Zhu, Xiaoguang
    Huang, Siran
    Fan, Wenjing
    Cheng, Yuhao
    Shao, Huaqing
    Liu, Peilin
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,