Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network

被引:5
|
作者
Dey, Arnab [1 ]
Biswas, Samit [1 ]
Le, Dac-Nhuong [2 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur 711103, Howrah, India
[2] Haiphong Univ, Fac Informat Technol, Haiphong 180000, Vietnam
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Workout action recognition; video stream; action recognition; residual network; GRU; attention; SPATIOTEMPORAL FEATURES; LSTM;
D O I
10.32604/cmc.2024.049512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers the likelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in video streams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enable instant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing action datasets often lack diversity and specificity for workout actions, hindering the development of accurate recognition models. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significant contribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated to encompass various exercises performed by numerous individuals in different settings. This research proposes an innovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU) network for workout action recognition in video streams. Unlike image-based action recognition, videos contain spatio-temporal information, making the task more complex and challenging. While substantial progress has been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions, and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attention model demonstrated exceptional classification performance with 95.81% accuracy in classifying workout action videos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and 93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101, respectively, showcasing its superiority and robustness in action recognition. The findings suggest practical implications in real-world scenarios where precise video action recognition is paramount, addressing the persisting challenges in the field. The WAVd dataset serves as a catalyst for the development of more robust and effective fitness tracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.
引用
收藏
页码:3067 / 3087
页数:21
相关论文
共 50 条
  • [41] Towards efficient video-based action recognition: context-aware memory attention network
    Koh, Thean Chun
    Yeo, Chai Kiat
    Jing, Xuan
    Sivadas, Sunil
    SN APPLIED SCIENCES, 2023, 5 (12):
  • [42] Towards efficient video-based action recognition: context-aware memory attention network
    Thean Chun Koh
    Chai Kiat Yeo
    Xuan Jing
    Sunil Sivadas
    SN Applied Sciences, 2023, 5
  • [43] A Video Action Recognition Method via Dual-Stream Feature Fusion Neural Network with Attention
    Han, Jianmin
    Li, Jie
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2024, 32 (04) : 673 - 694
  • [44] STA-TSN: Spatial-Temporal Attention Temporal Segment Network for action recognition in video
    Yang, Guoan
    Yang, Yong
    Lu, Zhengzhi
    Yang, Junjie
    Liu, Deyang
    Zhou, Chuanbo
    Fan, Zien
    PLOS ONE, 2022, 17 (03):
  • [45] HIGHER-ORDER RECURRENT NETWORK WITH SPACE-TIME ATTENTION FOR VIDEO EARLY ACTION RECOGNITION
    Tai, Tsung-Ming
    Fiameni, Giuseppe
    Lee, Cheng-Kuang
    Lanz, Oswald
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1631 - 1635
  • [46] Skeleton-weighted and multi-scale temporal-driven network for video action recognition
    Xu, Ziqi
    Zhang, Jie
    Zhang, Peng
    Ding, Pengfei
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [47] Human Action Recognition using Multi-Kernel Learning for Temporal Residual Network
    Nazir, Saima
    Qian, Yu
    Yousaf, Muhammad Haroon
    Velastin, Sergio A.
    Izquierdo, Ebroul
    Vazquez, Eduard
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 420 - 426
  • [48] Action Recognition Based on CSI Signal Using Improved Deep Residual Network Model
    Zhao, Jian
    Chong, Shangwu
    Huang, Liang
    Li, Xin
    He, Chen
    Jia, Jian
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2022, 130 (03): : 1827 - 1851
  • [49] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
    Xing, Hao
    Burschka, Darius
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3333 - 3340
  • [50] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
    Xing, Hao
    Burschka, Darius
    arXiv, 2022,