Select and Focus: Action Recognition with Spatial-Temporal Attention

被引：0

作者：

Chan, Wensong ^{[1
]}

Tian, Zhiqiang ^{[1
]}

Liu, Shuai ^{[1
]}

Ren, Jing ^{[2
]}

Lan, Xuguang ^{[3
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China

[2] Xian Aeronaut Univ, Xian, Peoples R China

[3] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China

来源：

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III | 2019年 / 11742卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Human action recognition; Deep learning; Attention;

D O I：

10.1007/978-3-030-27535-8_41

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid development of neural networks, human action recognition has been achieved great improvement by using convolutional neural networks (CNN) or recurrent neural networks (RNN). In this paper, we propose a model based on weighted spatial-temporal attention for action recognition. This model selects the key parts in each video frame and important frames in each video sequence. Then the model focuses on analyzing these key parts and frames. Therefore, the most important tasks of our model is to find out the key parts spatially and the important frames temporally for recognizing the action. Our model is trained and tested on three datasets including UCF-11, UCF-101, and HMDB51. The experiments demonstrate that our model can achieve a satisfactory result for human action recognition.

引用

页码：461 / 471

页数：11

共 50 条

[1] Spatial-Temporal Attention for Action Recognition
Sun, Dengdi
Wu, Hanqing
Ding, Zhuanlian
Luo, Bin
Tang, Jin
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 854 - 864
[2] Joint spatial-temporal attention for action recognition
Yu, Tingzhao
Guo, Chaoxu
Wang, Lingfeng
Gu, Huxiang
Xiang, Shiming
Pan, Chunhong
PATTERN RECOGNITION LETTERS, 2018, 112 : 226 - 233
[3] Spatial-Temporal Convolutional Attention Network for Action Recognition
Luo, Huilan
Chen, Han
Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
[4] Spatial-Temporal Separable Attention for Video Action Recognition
Guo, Xi
Hu, Yikun
Chen, Fang
Jin, Yuhui
Qiao, Jian
Huang, Jian
Yang, Qin
2022 INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML, 2022, : 224 - 228
[5] Spatial-temporal saliency action mask attention network for action recognition
Jiang, Min
Pan, Na
Kong, Jun
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
[6] Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos
Du, Wenbin
Wang, Yali
Qiao, Yu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1347 - 1360
[7] Spatial-temporal channel-wise attention network for action recognition
Chen, Lin
Liu, Yungang
Man, Yongchao
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 21789 - 21808
[8] Recurrent attention network using spatial-temporal relations for action recognition
Zhang, Mingxing
Yang, Yang
Ji, Yanli
Xie, Ning
Shen, Fumin
SIGNAL PROCESSING, 2018, 145 : 137 - 145
[9] Spatial-temporal channel-wise attention network for action recognition
Lin Chen
Yungang Liu
Yongchao Man
Multimedia Tools and Applications, 2021, 80 : 21789 - 21808
[10] STAP: Spatial-Temporal Attention-Aware Pooling for Action Recognition
Nguyen, Tam V.
Song, Zheng
Yan, Shuicheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (01) : 77 - 86

← 1 2 3 4 5 →