An attention mechanism based convolutional LSTM network for video action recognition

被引：0

作者：

Hongwei Ge

Zehang Yan

Wenhao Yu

Liang Sun

机构：

[1] Dalian University of Technology,College of Computer Science and Technology

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Attention mechanism; Convolutional LSTM; Spatial transformer; Video action recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As an important issue in video classification, human action recognition is becoming a hot topic in computer vision. The ways of effectively representing the spatial static and temporal dynamic information of videos are important problems in video action recognition. This paper proposes an attention mechanism based convolutional LSTM action recognition algorithm to improve the accuracy of recognition by extracting the salient regions of actions in videos effectively. First, GoogleNet is used to extract the features of video frames. Then, those feature maps are processed by the spatial transformer network for the attention. Finally the sequential information of the features is modeled via the convolutional LSTM to classify the action in the original video. To accelerate the training speed, we adopt the analysis of temporal coherence to reduce the redundant features extracted by GoogleNet with trivial accuracy loss. In comparison with the state-of-the-art algorithms for video action recognition, competitive results are achieved on three widely-used datasets, UCF-11, HMDB-51 and UCF-101. Moreover, by using the analysis of temporal coherence, desirable results are obtained while the training time is reduced.

引用

页码：20533 / 20556

页数：23

共 50 条

[41] Recognition of Teachers' Facial Expression Intensity Based on Convolutional Neural Network and Attention Mechanism
Zheng, Kun
Yang, Dong
Liu, Junhua
Cui, Jinling
IEEE ACCESS, 2020, 8 : 226437 - 226444
[42] Seatbelt Recognition Method Based on Convolutional Attention Mechanism
Chen, Guangxi
Lv, Fangfang
Zhan, Yijun
Huang, Yong
2019 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2019, : 187 - 192
[43] Convolutional neural network based on attention mechanism and Bi-LSTM for bearing remaining life prediction
Jiahang Luo
Xu Zhang
Applied Intelligence, 2022, 52 : 1076 - 1091
[44] Radar HRRP sequence target recognition method of attention mechanism based stacked LSTM network
Zhang Y.
Zhang S.
Liu Y.
Jing F.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (10): : 2775 - 2781
[45] Exposing DeepFake Videos Using Attention Based Convolutional LSTM Network
Yishan Su
Huawei Xia
Qi Liang
Weizhi Nie
Neural Processing Letters, 2021, 53 : 4159 - 4175
[46] Skeleton-based human action recognition using LSTM and depthwise separable convolutional neural network
Le, Hoangcong
Lu, Cheng-Kai
Hsu, Chen-Chien
Huang, Shao-Kang
APPLIED INTELLIGENCE, 2025, 55 (04)
[47] Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism
Hu Z.
Diao P.
Zhang R.
Li S.
Zhao M.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 892 - 900
[48] GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION
Zhang, Han
Song, Yonghong
Zhang, Yuanlin
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 412 - 417
[49] Context-Aware Memory Attention Network for Video-Based Action Recognition
Koh, Thean Chun
Yeo, Chai Kiat
Vaitesswar, U. S.
Jing, Xuan
2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
[50] Exposing DeepFake Videos Using Attention Based Convolutional LSTM Network
Su, Yishan
Xia, Huawei
Liang, Qi
Nie, Weizhi
NEURAL PROCESSING LETTERS, 2021, 53 (06) : 4159 - 4175

← 1 2 3 4 5 →