Select and Focus: Action Recognition with Spatial-Temporal Attention

被引:0
|
作者
Chan, Wensong [1 ]
Tian, Zhiqiang [1 ]
Liu, Shuai [1 ]
Ren, Jing [2 ]
Lan, Xuguang [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[2] Xian Aeronaut Univ, Xian, Peoples R China
[3] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Human action recognition; Deep learning; Attention;
D O I
10.1007/978-3-030-27535-8_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of neural networks, human action recognition has been achieved great improvement by using convolutional neural networks (CNN) or recurrent neural networks (RNN). In this paper, we propose a model based on weighted spatial-temporal attention for action recognition. This model selects the key parts in each video frame and important frames in each video sequence. Then the model focuses on analyzing these key parts and frames. Therefore, the most important tasks of our model is to find out the key parts spatially and the important frames temporally for recognizing the action. Our model is trained and tested on three datasets including UCF-11, UCF-101, and HMDB51. The experiments demonstrate that our model can achieve a satisfactory result for human action recognition.
引用
收藏
页码:461 / 471
页数:11
相关论文
共 50 条
  • [21] Spatial-Temporal gated graph attention network for skeleton-based action recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 929 - 939
  • [22] Attention-based spatial-temporal hierarchical ConvLSTM network for action recognition in videos
    Xue, Fei
    Ji, Hongbing
    Zhang, Wenbo
    Cao, Yi
    IET COMPUTER VISION, 2019, 13 (08) : 708 - 718
  • [23] Extreme Low-Resolution Action Recognition with Confident Spatial-Temporal Attention Transfer
    Yucai Bai
    Qin Zou
    Xieyuanli Chen
    Lingxi Li
    Zhengming Ding
    Long Chen
    International Journal of Computer Vision, 2023, 131 : 1550 - 1565
  • [24] An Attention Enhanced Spatial-Temporal Graph Convolutional LSTM Network for Action Recognition in Karate
    Guo, Jianping
    Liu, Hong
    Li, Xi
    Xu, Dahong
    Zhang, Yihan
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [25] Extreme Low-Resolution Action Recognition with Confident Spatial-Temporal Attention Transfer
    Bai, Yucai
    Zou, Qin
    Chen, Xieyuanli
    Li, Lingxi
    Ding, Zhengming
    Chen, Long
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (06) : 1550 - 1565
  • [26] Spatial-Temporal Dynamic Graph Attention Network for Skeleton-Based Action Recognition
    Rahevar, Mrugendrasinh
    Ganatra, Amit
    Saba, Tanzila
    Rehman, Amjad
    Bahaj, Saeed Ali
    IEEE ACCESS, 2023, 11 : 21546 - 21553
  • [27] Streamer action recognition in live video with spatial-temporal attention and deep dictionary learning
    Li, Chenhao
    Zhang, Jing
    Yao, Jiacheng
    NEUROCOMPUTING, 2021, 453 : 383 - 392
  • [28] Activity Recognition Based on Spatial-Temporal Attention LSTM
    Xie, Zhao
    Zhou, Yi
    Wu, Ke-Wei
    Zhang, Shun-Ran
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (02): : 261 - 274
  • [29] Action Recognition by Joint Spatial-Temporal Motion Feature
    Zhang, Weihua
    Zhang, Yi
    Gao, Chaobang
    Zhou, Jiliu
    JOURNAL OF APPLIED MATHEMATICS, 2013,
  • [30] Spatial-Temporal Pyramid Graph Reasoning for Action Recognition
    Geng, Tiantian
    Zheng, Feng
    Hou, Xiaorong
    Lu, Ke
    Qi, Guo-Jun
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5484 - 5497