Multimodal cooperative self-attention network for action recognition

被引:2
|
作者
Zhong, Zhuokun [1 ]
Hou, Zhenjie [1 ]
Liang, Jiuzhen [1 ]
Lin, En [2 ]
Shi, Haiyong [1 ]
机构
[1] Changzhou Univ, Sch Comp & Artificial Intelligence, Changzhou 213000, Peoples R China
[2] Goldcard Smart Grp Co Ltd, Hangzhou, Peoples R China
关键词
computer vision; image fusion;
D O I
10.1049/ipr2.12754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal human behaviour recognition is a research hotspot in computer vision. To fully use both skeleton and depth data, this paper constructs a new multimodal network identification scheme combined with the self-attention mechanism. The system comprises a transformer-based skeleton self-attention subnetwork and a depth self-attention subnetwork based on CNN. In the skeleton self-attention subnetwork, this paper proposes a motion synergy space feature that can integrate the information of each joint point according to the entirety and synergy of human motion and puts forward a quantitative standard for the contribution degree of each joint motion. In this paper, the results from the skeleton self-attention subnetwork and the depth self-attention subnetwork are integrated and they are verified on the NTU RGB+D and UTD-MHAD datasets. The authors have achieved 90% recognition rate on UTD-MHAD dataset, and the CS recognition rate of the authors' method on the NTU RGB+D dataset reaches 90.5% and the recognition rate of CV is 94.7%. Experimental results show that the network structure proposed in this paper achieves a high recognition rate, and its performance is better than most current methods.
引用
收藏
页码:1775 / 1783
页数:9
相关论文
共 50 条
  • [31] The function of the self-attention network
    Cunningham, Sheila J.
    COGNITIVE NEUROSCIENCE, 2016, 7 (1-4) : 21 - 22
  • [32] Multimodal Depression Detection Based on Self-Attention Network With Facial Expression and Pupil
    Liu, Xiang
    Shen, Hao
    Li, Huiru
    Tao, Yongfeng
    Yang, Minqiang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025, 12 (01): : 64 - 76
  • [33] A framework for facial expression recognition using deep self-attention network
    Indolia S.
    Nigam S.
    Singh R.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9543 - 9562
  • [34] Dual-branch self-attention network for pedestrian attribute recognition
    Liu, Zhenyu
    Zhang, Zhang
    Li, Da
    Zhang, Peng
    Shan, Caifeng
    PATTERN RECOGNITION LETTERS, 2022, 163 : 112 - 120
  • [35] Ghost imaging object recognition based on self-attention mechanism network
    He, Yunting
    Yuan, Sheng
    Song, Jiali
    AIP ADVANCES, 2023, 13 (12)
  • [36] Self-attention for Speech Emotion Recognition
    Tarantino, Lorenzo
    Garner, Philip N.
    Lazaridis, Alexandros
    INTERSPEECH 2019, 2019, : 2578 - 2582
  • [37] Regional Self-Attention Convolutional Neural Network for Facial Expression Recognition
    Zhou, Lifang
    Wang, Yi
    Lei, Bangjun
    Yang, Weibin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (08)
  • [38] Advancing classroom fatigue recognition: A multimodal fusion approach using self-attention mechanism
    Cao, Lei
    Wang, Wenrong
    Dong, Yilin
    Fan, Chunjiang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [39] 3D-ShuffleViT: An Efficient Video Action Recognition Network with Deep Integration of Self-Attention and Convolution
    Wang, Yinghui
    Zhu, Anlei
    Ma, Haomiao
    Ai, Lingyu
    Song, Wei
    Zhang, Shaojie
    MATHEMATICS, 2023, 11 (18)
  • [40] SPAR: An efficient self-attention network using Switching Partition Strategy for skeleton-based action recognition
    Zhu, ZiJie
    Ying, RenDong
    Wen, Fei
    Liu, PeiLin
    NEUROCOMPUTING, 2023, 562