Multimodal cooperative self-attention network for action recognition

被引：2

作者：

Zhong, Zhuokun ^{[1
]}

Hou, Zhenjie ^{[1
]}

Liang, Jiuzhen ^{[1
]}

Lin, En ^{[2
]}

Shi, Haiyong ^{[1
]}

机构：

[1] Changzhou Univ, Sch Comp & Artificial Intelligence, Changzhou 213000, Peoples R China

[2] Goldcard Smart Grp Co Ltd, Hangzhou, Peoples R China

来源：

IET IMAGE PROCESSING | 2023年 / 17卷 / 06期

关键词：

computer vision; image fusion;

D O I：

10.1049/ipr2.12754

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal human behaviour recognition is a research hotspot in computer vision. To fully use both skeleton and depth data, this paper constructs a new multimodal network identification scheme combined with the self-attention mechanism. The system comprises a transformer-based skeleton self-attention subnetwork and a depth self-attention subnetwork based on CNN. In the skeleton self-attention subnetwork, this paper proposes a motion synergy space feature that can integrate the information of each joint point according to the entirety and synergy of human motion and puts forward a quantitative standard for the contribution degree of each joint motion. In this paper, the results from the skeleton self-attention subnetwork and the depth self-attention subnetwork are integrated and they are verified on the NTU RGB+D and UTD-MHAD datasets. The authors have achieved 90% recognition rate on UTD-MHAD dataset, and the CS recognition rate of the authors' method on the NTU RGB+D dataset reaches 90.5% and the recognition rate of CV is 94.7%. Experimental results show that the network structure proposed in this paper achieves a high recognition rate, and its performance is better than most current methods.

引用

页码：1775 / 1783

页数：9

共 50 条

[31] The function of the self-attention network
Cunningham, Sheila J.
COGNITIVE NEUROSCIENCE, 2016, 7 (1-4) : 21 - 22
[32] Multimodal Depression Detection Based on Self-Attention Network With Facial Expression and Pupil
Liu, Xiang
Shen, Hao
Li, Huiru
Tao, Yongfeng
Yang, Minqiang
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2025, 12 (01): : 64 - 76
[33] A framework for facial expression recognition using deep self-attention network
Indolia S.
Nigam S.
Singh R.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (07) : 9543 - 9562
[34] Dual-branch self-attention network for pedestrian attribute recognition
Liu, Zhenyu
Zhang, Zhang
Li, Da
Zhang, Peng
Shan, Caifeng
PATTERN RECOGNITION LETTERS, 2022, 163 : 112 - 120
[35] Ghost imaging object recognition based on self-attention mechanism network
He, Yunting
Yuan, Sheng
Song, Jiali
AIP ADVANCES, 2023, 13 (12)
[36] Self-attention for Speech Emotion Recognition
Tarantino, Lorenzo
Garner, Philip N.
Lazaridis, Alexandros
INTERSPEECH 2019, 2019, : 2578 - 2582
[37] Regional Self-Attention Convolutional Neural Network for Facial Expression Recognition
Zhou, Lifang
Wang, Yi
Lei, Bangjun
Yang, Weibin
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (08)
[38] Advancing classroom fatigue recognition: A multimodal fusion approach using self-attention mechanism
Cao, Lei
Wang, Wenrong
Dong, Yilin
Fan, Chunjiang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
[39] 3D-ShuffleViT: An Efficient Video Action Recognition Network with Deep Integration of Self-Attention and Convolution
Wang, Yinghui
Zhu, Anlei
Ma, Haomiao
Ai, Lingyu
Song, Wei
Zhang, Shaojie
MATHEMATICS, 2023, 11 (18)
[40] SPAR: An efficient self-attention network using Switching Partition Strategy for skeleton-based action recognition
Zhu, ZiJie
Ying, RenDong
Wen, Fei
Liu, PeiLin
NEUROCOMPUTING, 2023, 562

← 1 2 3 4 5 →