An attention mechanism based convolutional LSTM network for video action recognition

被引:0
|
作者
Hongwei Ge
Zehang Yan
Wenhao Yu
Liang Sun
机构
[1] Dalian University of Technology,College of Computer Science and Technology
来源
关键词
Attention mechanism; Convolutional LSTM; Spatial transformer; Video action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
As an important issue in video classification, human action recognition is becoming a hot topic in computer vision. The ways of effectively representing the spatial static and temporal dynamic information of videos are important problems in video action recognition. This paper proposes an attention mechanism based convolutional LSTM action recognition algorithm to improve the accuracy of recognition by extracting the salient regions of actions in videos effectively. First, GoogleNet is used to extract the features of video frames. Then, those feature maps are processed by the spatial transformer network for the attention. Finally the sequential information of the features is modeled via the convolutional LSTM to classify the action in the original video. To accelerate the training speed, we adopt the analysis of temporal coherence to reduce the redundant features extracted by GoogleNet with trivial accuracy loss. In comparison with the state-of-the-art algorithms for video action recognition, competitive results are achieved on three widely-used datasets, UCF-11, HMDB-51 and UCF-101. Moreover, by using the analysis of temporal coherence, desirable results are obtained while the training time is reduced.
引用
收藏
页码:20533 / 20556
页数:23
相关论文
共 50 条
  • [21] Human Action Recognition Network Based on Improved Channel Attention Mechanism
    Chen Ying
    Gong Suming
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3538 - 3545
  • [22] Two-Level Attention Model Based Video Action Recognition Network
    Sang, Haifeng
    Zhao, Ziyu
    He, Dakuo
    IEEE ACCESS, 2019, 7 : 118388 - 118401
  • [23] Spatial-Temporal Convolutional Attention Network for Action Recognition
    Luo, Huilan
    Chen, Han
    Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
  • [24] SCN: Dilated silhouette convolutional network for video action recognition
    Hua, Michelle
    Gao, Mingqi
    Zhong, Zichun
    COMPUTER AIDED GEOMETRIC DESIGN, 2021, 85
  • [25] Generative Adversarial Network Based on LSTM and Convolutional Block Attention Module for Industrial Smoke Image Recognition
    Li, Dahai
    Yang, Rui
    Chen, Su
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (04) : 1707 - 1728
  • [26] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
    Xing, Hao
    Burschka, Darius
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3333 - 3340
  • [27] Independent Dual Graph Attention Convolutional Network for Skeleton-Based Action Recognition
    Huo, Jinze
    Cai, Haibin
    Meng, Qinggang
    NEUROCOMPUTING, 2024, 583
  • [28] Skeletal Human Action Recognition using Hybrid Attention based Graph Convolutional Network
    Xing, Hao
    Burschka, Darius
    arXiv, 2022,
  • [29] Convolutional Neural Network-Based Video Super-Resolution for Action Recognition
    Zhang, Haochen
    Liu, Dong
    Xiong, Zhiwei
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 746 - 750
  • [30] TCN-attention-HAR: human activity recognition based on attention mechanism time convolutional network
    Wei, Xiong
    Wang, Zifan
    SCIENTIFIC REPORTS, 2024, 14 (01)