An attention mechanism based convolutional LSTM network for video action recognition

被引:0
|
作者
Hongwei Ge
Zehang Yan
Wenhao Yu
Liang Sun
机构
[1] Dalian University of Technology,College of Computer Science and Technology
来源
关键词
Attention mechanism; Convolutional LSTM; Spatial transformer; Video action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
As an important issue in video classification, human action recognition is becoming a hot topic in computer vision. The ways of effectively representing the spatial static and temporal dynamic information of videos are important problems in video action recognition. This paper proposes an attention mechanism based convolutional LSTM action recognition algorithm to improve the accuracy of recognition by extracting the salient regions of actions in videos effectively. First, GoogleNet is used to extract the features of video frames. Then, those feature maps are processed by the spatial transformer network for the attention. Finally the sequential information of the features is modeled via the convolutional LSTM to classify the action in the original video. To accelerate the training speed, we adopt the analysis of temporal coherence to reduce the redundant features extracted by GoogleNet with trivial accuracy loss. In comparison with the state-of-the-art algorithms for video action recognition, competitive results are achieved on three widely-used datasets, UCF-11, HMDB-51 and UCF-101. Moreover, by using the analysis of temporal coherence, desirable results are obtained while the training time is reduced.
引用
收藏
页码:20533 / 20556
页数:23
相关论文
共 50 条
  • [41] Recognition of Teachers' Facial Expression Intensity Based on Convolutional Neural Network and Attention Mechanism
    Zheng, Kun
    Yang, Dong
    Liu, Junhua
    Cui, Jinling
    IEEE ACCESS, 2020, 8 : 226437 - 226444
  • [42] Seatbelt Recognition Method Based on Convolutional Attention Mechanism
    Chen, Guangxi
    Lv, Fangfang
    Zhan, Yijun
    Huang, Yong
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2019, : 187 - 192
  • [43] Convolutional neural network based on attention mechanism and Bi-LSTM for bearing remaining life prediction
    Jiahang Luo
    Xu Zhang
    Applied Intelligence, 2022, 52 : 1076 - 1091
  • [44] Radar HRRP sequence target recognition method of attention mechanism based stacked LSTM network
    Zhang Y.
    Zhang S.
    Liu Y.
    Jing F.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (10): : 2775 - 2781
  • [45] Exposing DeepFake Videos Using Attention Based Convolutional LSTM Network
    Yishan Su
    Huawei Xia
    Qi Liang
    Weizhi Nie
    Neural Processing Letters, 2021, 53 : 4159 - 4175
  • [46] Skeleton-based human action recognition using LSTM and depthwise separable convolutional neural network
    Le, Hoangcong
    Lu, Cheng-Kai
    Hsu, Chen-Chien
    Huang, Shao-Kang
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [47] Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism
    Hu Z.
    Diao P.
    Zhang R.
    Li S.
    Zhao M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 892 - 900
  • [48] GRAPH CONVOLUTIONAL LSTM MODEL FOR SKELETON-BASED ACTION RECOGNITION
    Zhang, Han
    Song, Yonghong
    Zhang, Yuanlin
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 412 - 417
  • [49] Context-Aware Memory Attention Network for Video-Based Action Recognition
    Koh, Thean Chun
    Yeo, Chai Kiat
    Vaitesswar, U. S.
    Jing, Xuan
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [50] Exposing DeepFake Videos Using Attention Based Convolutional LSTM Network
    Su, Yishan
    Xia, Huawei
    Liang, Qi
    Nie, Weizhi
    NEURAL PROCESSING LETTERS, 2021, 53 (06) : 4159 - 4175