An attention mechanism based convolutional LSTM network for video action recognition

被引：0

作者：

Hongwei Ge

Zehang Yan

Wenhao Yu

Liang Sun

机构：

[1] Dalian University of Technology,College of Computer Science and Technology

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Attention mechanism; Convolutional LSTM; Spatial transformer; Video action recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As an important issue in video classification, human action recognition is becoming a hot topic in computer vision. The ways of effectively representing the spatial static and temporal dynamic information of videos are important problems in video action recognition. This paper proposes an attention mechanism based convolutional LSTM action recognition algorithm to improve the accuracy of recognition by extracting the salient regions of actions in videos effectively. First, GoogleNet is used to extract the features of video frames. Then, those feature maps are processed by the spatial transformer network for the attention. Finally the sequential information of the features is modeled via the convolutional LSTM to classify the action in the original video. To accelerate the training speed, we adopt the analysis of temporal coherence to reduce the redundant features extracted by GoogleNet with trivial accuracy loss. In comparison with the state-of-the-art algorithms for video action recognition, competitive results are achieved on three widely-used datasets, UCF-11, HMDB-51 and UCF-101. Moreover, by using the analysis of temporal coherence, desirable results are obtained while the training time is reduced.

引用

页码：20533 / 20556

页数：23

共 50 条

[1] An attention mechanism based convolutional LSTM network for video action recognition
Ge, Hongwei
Yan, Zehang
Yu, Wenhao
Sun, Liang
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (14) : 20533 - 20556
[2] Video action recognition method based on attention residual network and LSTM
Zhang, Yu
Dong, Pengyue
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3611 - 3616
[3] An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
Si, Chenyang
Chen, Wentao
Wang, Wei
Wang, Liang
Tan, Tieniu
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1227 - 1236
[4] Attention-Based Convolutional LSTM for Describing Video
Liu, Zhongyu
Chen, Tian
Ding, Enjie
Liu, Yafeng
Yu, Wanli
IEEE Access, 2020, 8 : 133713 - 133724
[5] Attention-Based Convolutional LSTM for Describing Video
Liu, Zhongyu
Chen, Tian
Ding, Enjie
Liu, Yafeng
Yu, Wanli
IEEE ACCESS, 2020, 8 : 133713 - 133724
[6] Dual attention convolutional network for action recognition
Li, Xiaoqiang
Xie, Miao
Zhang, Yin
Ding, Guangtai
Tong, Weiqin
IET IMAGE PROCESSING, 2020, 14 (06) : 1059 - 1065
[7] An Attention Enhanced Spatial-Temporal Graph Convolutional LSTM Network for Action Recognition in Karate
Guo, Jianping
Liu, Hong
Li, Xi
Xu, Dahong
Zhang, Yihan
APPLIED SCIENCES-BASEL, 2021, 11 (18):
[8] Recurrent Region Attention and Video Frame Attention Based Video Action Recognition Network Design
Sang H.-F.
Zhao Z.-Y.
He D.-K.
Zhao, Zi-Yu (Maikuraky1022@outlook.com), 1600, Chinese Institute of Electronics (48): : 1052 - 1061
[9] Attention in Convolutional LSTM for Gesture Recognition
Zhang, Liang
Zhu, Guangming
Mei, Lin
Shen, Peiyi
Shah, Syed Afaq Ali
Bennamoun, Mohammed
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[10] Target Recognition of Robot Based on Attention Mechanism and Convolutional Neural Network
Li, Hexi
Li, Jihua
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2578 - 2584

← 1 2 3 4 5 →