ESTS-GCN: An Ensemble Spatial-Temporal Skeleton-Based Graph Convolutional Networks for Violence Detection

被引:0
|
作者
Janbi, Nourah Fahad [1 ]
Ghaseb, Musrea Abdo [2 ]
Almazroi, Abdulwahab Ali [1 ]
机构
[1] Univ Jeddah, Coll Comp & Informat Technol Khulais, Dept Informat Technol, Jeddah, Saudi Arabia
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
graph attention networks; graph convolutional networks; safe community; self-attention; skeleton; smart city; smart surveillance; violence detection;
D O I
10.1155/2024/2323337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surveillance systems are essential for social and personal security. However, monitoring multiple video feeds with multiple targets is challenging for human operators. Therefore, automatic and smart surveillance systems have been introduced to support or replace traditional surveillance systems and build safer communities. Advancements in artificial intelligence techniques, particularly in the field of computer vision, have boosted this area of research. Most existing works have focused on image-based (RGB-based) machine learning and deep learning algorithms for detecting anomalous and violent events. In this study, we propose a unique Ensemble Spatial-Temporal Skeleton-Based Graph Convolutional Networks (ESTS-GCNs) model for violence detection that automatically uses spatial and temporal data to detect violence in surveillance videos. Skeleton-based algorithms are less sensitive to pixel-based noise and background interference, making them excellent candidates for activity and anomaly detection. Our proposed ensemble-based architecture utilizes Graph Convolutional Networks (GCNs) and comprises multiple spatial and temporal modules. Three different spatial pipelines are exploited: channel-wise topologies, self-attention mechanism, and graph attention networks. The models were trained and evaluated using two skeleton-based datasets introduced by us: Skeleton-based Real-Life Violence Situations (RLVS) and NTU-Violence (NTU-V). Our model achieved a maximum accuracy of around 93% and outperformed existing models by more than 10%.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] TEST-GCN: Topologically Enhanced Spatial-Temporal Graph Convolutional Networks for Traffic Forecasting
    Ali, Muhammad Afif
    Venkatesan, Suriya
    Liang, Victor
    Kruppa, Hannes
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 982 - 987
  • [42] Skeleton-Based Detection of Abnormalities in Human Actions Using Graph Convolutional Networks
    Yu, Bruce X. B.
    Liu, Yan
    Chan, Keith C. C.
    2020 SECOND INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2020), 2020, : 131 - 137
  • [43] Focus on temporal graph convolutional networks with unified attention for skeleton-based action recognition
    Gao, Bing-Kun
    Dong, Le
    Bi, Hong-Bo
    Bi, Yun-Ze
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5608 - 5616
  • [44] Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
    Huang, Zhen
    Shen, Xu
    Tian, Xinmei
    Li, Houqiang
    Huang, Jianqiang
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2122 - 2130
  • [45] Focus on temporal graph convolutional networks with unified attention for skeleton-based action recognition
    Bing-Kun Gao
    Le Dong
    Hong-Bo Bi
    Yun-Ze Bi
    Applied Intelligence, 2022, 52 : 5608 - 5616
  • [46] Hierarchical Spatial-Temporal Network for Skeleton-Based Temporal Action Segmentation
    Tan, Chenwei
    Sun, Tao
    Fu, Talas
    Wang, Yuhan
    Xu, Minjie
    Liu, Shenglan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 28 - 39
  • [47] CONFIDENCE-AWARE SPATIAL-TEMPORAL ATTENTION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED EXPERT-NOVICE LEVEL CLASSIFICATION
    Seino, Tatsuki
    Saito, Naoki
    Ogawa, Takahiro
    Asamizu, Satoshi
    Haseyama, Miki
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1796 - 1800
  • [48] Multi-scale spatial-temporal convolutional neural network for skeleton-based action recognition
    Cheng, Qin
    Cheng, Jun
    Ren, Ziliang
    Zhang, Qieshi
    Liu, Jianming
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1303 - 1315
  • [49] Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    SENSORS, 2021, 21 (01) : 1 - 16
  • [50] Skeleton-based emotion recognition based on two-stream self-attention enhanced spatial-temporal graph convolutional network
    Shi, Jiaqi
    Liu, Chaoran
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    Sensors (Switzerland), 2021, 21 (01): : 1 - 16