Recognizing human activities with the use of Convolutional Block Attention Module

被引:0
|
作者
Zakariah, Mohammed [1 ]
Alnuaim, Abeer [1 ]
机构
[1] King Saud Univ, Coll Appl Studies & Community Serv, Dept Comp Sci & Engn, POB 22459, Riyadh 11495, Saudi Arabia
关键词
Human activity recognition; Human behaviour recognition; Deep-learning; Convolutional Block Attention Module (CBAM); Convolution Neural Network; Spatial Attention Module; HUMAN ACTION RECOGNITION;
D O I
10.1016/j.eij.2024.100536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human Activity Recognition (HAR) is crucial for the advancement of applications in smart environments, communication, IoT, security, and healthcare monitoring. Convolutional neural networks (CNNs) have made substantial contributions to human activity recognition (HAR). However, they frequently encounter difficulties in accurately discerning intricate human actions in real-time situations. This study aims to fill a significant research gap by incorporating the Convolutional Block Attention Module (CBAM) into CNN architectures. The goal is to improve the extraction of features from video sequences. The CBAM boosts the performance of the network by selectively prioritizing significant spatial and channel-wise data, resulting in improved detection of subtle activity patterns and increased stability in categorization. CBAM's attention mechanism directly focuses and amplifies essential characteristics, which sets it apart from typical CNNs that lack a refined focus mechanism. This unique approach results in improved performance in behavior identification tests. The proposed CBAMenhanced model has been extensively tested on benchmark datasets, yielding an accuracy of 94.23% on the HMDB51 dataset. It also achieved competitive results of 83.4% and 88.9% on the UCF-101 and UCF-50 datasets, respectively. However, there is still a lack of study in comprehending how CBAM adjusts to different CNN architectures and its suitability in varied HAR situations beyond controlled datasets. In future studies, it is imperative for researchers to investigate the integration of CBAM with other CNN frameworks, assess its efficacy in practical scenarios, and explore multi-modal sensor fusion techniques to enhance its reliability and utility. This study showcases the ability of CBAM to enhance HAR capabilities and also paves the way for future research to improve activity identification systems for wider and more practical uses.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] High-precision Gesture Recognition Based on DenseNet and Convolutional Block Attention Module
    Zhao Y.
    Song Y.
    Wu H.
    He S.
    Liu P.
    Wu L.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (03): : 967 - 976
  • [32] An Attention Module for Convolutional Neural Networks
    Zhu, Baozhou
    Hofstee, Peter
    Lee, Jinho
    Al-Ars, Zaid
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 167 - 178
  • [33] WCAM: Wavelet Convolutional Attention Module
    Alaba, Simegnew Yihunie
    Ball, John E.
    SOUTHEASTCON 2024, 2024, : 854 - 859
  • [34] ADCM: attention dropout convolutional module
    Liu, Zhigang
    Du, Juan
    Wang, Mei
    Ge, Shuzhi Sam
    NEUROCOMPUTING, 2020, 394 : 95 - 104
  • [35] Recognition of sports and daily activities through deep learning and convolutional block attention
    Mekruksavanich S.
    Phaphan W.
    Hnoohom N.
    Jitpattanakul A.
    PeerJ Computer Science, 2024, 10
  • [36] Recognition of sports and daily activities through deep learning and convolutional block attention
    Mekruksavanich, Sakorn
    Phaphan, Wikanda
    Hnoohom, Narit
    Jitpattanakul, Anuchit
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [37] Human Violence Detection in Videos Using Key Frame Identification and 3D CNN with Convolutional Block Attention Module
    Akula, Venkatesh
    Kavati, Ilaiah
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (12) : 7924 - 7950
  • [38] Recognizing human activities
    Masoud, O
    Papanikolopoulos, N
    IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 157 - 162
  • [39] MONAURAL SPEECH ENHANCEMENT WITH COMPLEX CONVOLUTIONAL BLOCK ATTENTION MODULE AND JOINT TIME FREQUENCY LOSSES
    Zhao, Shengkui
    Nguyen, Trung Hieu
    Ma, Bin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6648 - 6652
  • [40] Identification and Suppression of Multicomponent Noise in Audio Magnetotelluric Data Based on Convolutional Block Attention Module
    Zhang, Liang
    Li, Guang
    Chen, Huang
    Tang, Jingtian
    Yang, Guanci
    Yu, Mingbiao
    Hu, Yong
    Xu, Jun
    Sun, Jing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15