Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network

被引:5
|
作者
Dey, Arnab [1 ]
Biswas, Samit [1 ]
Le, Dac-Nhuong [2 ]
机构
[1] Indian Inst Engn Sci & Technol, Dept Comp Sci & Technol, Sibpur 711103, Howrah, India
[2] Haiphong Univ, Fac Informat Technol, Haiphong 180000, Vietnam
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Workout action recognition; video stream; action recognition; residual network; GRU; attention; SPATIOTEMPORAL FEATURES; LSTM;
D O I
10.32604/cmc.2024.049512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers the likelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in video streams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enable instant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing action datasets often lack diversity and specificity for workout actions, hindering the development of accurate recognition models. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significant contribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated to encompass various exercises performed by numerous individuals in different settings. This research proposes an innovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU) network for workout action recognition in video streams. Unlike image-based action recognition, videos contain spatio-temporal information, making the task more complex and challenging. While substantial progress has been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions, and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attention model demonstrated exceptional classification performance with 95.81% accuracy in classifying workout action videos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and 93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101, respectively, showcasing its superiority and robustness in action recognition. The findings suggest practical implications in real-world scenarios where precise video action recognition is paramount, addressing the persisting challenges in the field. The WAVd dataset serves as a catalyst for the development of more robust and effective fitness tracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis.
引用
收藏
页码:3067 / 3087
页数:21
相关论文
共 50 条
  • [31] DC3D: A Video Action Recognition Network Based on Dense Connection
    Mu, Xiaofang
    Liu, Zhenyu
    Liu, Jiaji
    Li, Hao
    Li, Yue
    Li, Yikun
    2022 TENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, CBD, 2022, : 133 - 138
  • [32] Dynamic Representation Learning for Video Action Recognition Using Temporal Residual Networks
    Kong, Yongqiang
    Huang, Jianhui
    Huang, Shanshan
    Wei, Zhengang
    Wang, Shengke
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 331 - 337
  • [33] Action Recognition in Still Images using Residual Neural Network Features
    Sreela, S. R.
    Idicula, Sumam Mary
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 563 - 569
  • [34] Basketball Action Recognition Method of Deep Neural Network Based on Dynamic Residual Attention Mechanism
    Xiao, Jiongen
    Tian, Wenchun
    Ding, Liping
    INFORMATION, 2023, 14 (01)
  • [35] Action recognition for sports video analysis using part-attention spatio-temporal graph convolutional network
    Liu, Jiatong
    Che, Yanli
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [36] Attention-based network for effective action recognition from multi-view video
    Hoang-Thuyen Nguyen
    Thi-Oanh Nguyen
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 971 - 980
  • [37] Attention-Based Video Disentangling and Matching Network for Zero-Shot Action Recognition
    Su, Yong
    Zhu, Shuang
    Xing, Meng
    Xu, Hengpeng
    Li, Zhengtao
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 368 - 375
  • [38] 3D-STARNET: Spatial-Temporal Attention Residual Network for Robust Action Recognition
    Yang, Jun
    Sun, Shulong
    Chen, Jiayue
    Xie, Haizhen
    Wang, Yan
    Yang, Zenglong
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [39] AR3D: Attention Residual 3D Network for Human Action Recognition
    Dong, Min
    Fang, Zhenglin
    Li, Yongfa
    Bi, Sheng
    Chen, Jiangcheng
    SENSORS, 2021, 21 (05) : 1 - 15
  • [40] Recurrent attention network using spatial-temporal relations for action recognition
    Zhang, Mingxing
    Yang, Yang
    Ji, Yanli
    Xie, Ning
    Shen, Fumin
    SIGNAL PROCESSING, 2018, 145 : 137 - 145