An attention mechanism based convolutional LSTM network for video action recognition

被引：0

作者：

Hongwei Ge

Zehang Yan

Wenhao Yu

Liang Sun

机构：

[1] Dalian University of Technology,College of Computer Science and Technology

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Attention mechanism; Convolutional LSTM; Spatial transformer; Video action recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As an important issue in video classification, human action recognition is becoming a hot topic in computer vision. The ways of effectively representing the spatial static and temporal dynamic information of videos are important problems in video action recognition. This paper proposes an attention mechanism based convolutional LSTM action recognition algorithm to improve the accuracy of recognition by extracting the salient regions of actions in videos effectively. First, GoogleNet is used to extract the features of video frames. Then, those feature maps are processed by the spatial transformer network for the attention. Finally the sequential information of the features is modeled via the convolutional LSTM to classify the action in the original video. To accelerate the training speed, we adopt the analysis of temporal coherence to reduce the redundant features extracted by GoogleNet with trivial accuracy loss. In comparison with the state-of-the-art algorithms for video action recognition, competitive results are achieved on three widely-used datasets, UCF-11, HMDB-51 and UCF-101. Moreover, by using the analysis of temporal coherence, desirable results are obtained while the training time is reduced.

引用

页码：20533 / 20556

页数：23

共 50 条

[21] Human Action Recognition Network Based on Improved Channel Attention Mechanism
Chen Ying
Gong Suming
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3538 - 3545
[22] Two-Level Attention Model Based Video Action Recognition Network
Sang, Haifeng
Zhao, Ziyu
He, Dakuo
IEEE ACCESS, 2019, 7 : 118388 - 118401
[23] Spatial-Temporal Convolutional Attention Network for Action Recognition
Luo, Huilan
Chen, Han
Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
[24] SCN: Dilated silhouette convolutional network for video action recognition
Hua, Michelle
Gao, Mingqi
Zhong, Zichun
COMPUTER AIDED GEOMETRIC DESIGN, 2021, 85
[25] Generative Adversarial Network Based on LSTM and Convolutional Block Attention Module for Industrial Smoke Image Recognition
Li, Dahai
Yang, Rui
Chen, Su
COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (04) : 1707 - 1728
[26] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
Xing, Hao
Burschka, Darius
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3333 - 3340
[27] Independent Dual Graph Attention Convolutional Network for Skeleton-Based Action Recognition
Huo, Jinze
Cai, Haibin
Meng, Qinggang
NEUROCOMPUTING, 2024, 583
[28] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
Xing, Hao
Burschka, Darius
arXiv, 2022,
[29] Convolutional Neural Network-Based Video Super-Resolution for Action Recognition
Zhang, Haochen
Liu, Dong
Xiong, Zhiwei
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 746 - 750
[30] TCN-attention-HAR: human activity recognition based on attention mechanism time convolutional network
Wei, Xiong
Wang, Zifan
SCIENTIFIC REPORTS, 2024, 14 (01)

← 1 2 3 4 5 →