Action Recognition Using Visual Attention with Reinforcement Learning

被引：11

作者：

Li, Hongyang ^{[1
,3
]}

Chen, Jun ^{[1
,2
]}

Hu, Ruimin ^{[1
,2
]}

Yu, Mei ^{[3
]}

Chen, Huafeng ^{[4
]}

Xu, Zengmin ^{[1
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

[3] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang, Peoples R China

[4] Jingchu Univ Technol, Jingmen, Peoples R China

来源：

MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷

关键词：

Human action recognition; Reinforcement learning; Visual attention;

D O I：

10.1007/978-3-030-05716-9_30

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Human action recognition in videos is a challenging and significant task with a broad range of applications. The advantage of the visual attention mechanism is that it can effectively reduce noise interference by focusing on the relevant parts of the image and ignoring the irrelevant part. We propose a deep visual attention model with reinforcement learning for this task. We use Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM) units as a learning agent. The agent interact with video and decides both where to look next frame and where to locate the most relevant region of the selected video frame. REINFORCE method is used to learn the agent's decision policy and back-propagation method is used to train the action classifier. The experimental results demonstrate that this glimpse window can focus on important clues. Our model achieves significant performance improvement on the action recognition datasets: UCF101 and HMDB51.

引用

页码：365 / 376

页数：12

共 50 条

[1] Better Deep Visual Attention with Reinforcement Learning in Action Recognition
Wang, Gang
Wang, Wenmin
Wang, Jingzhuo
Bu, Yaohua
[J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
[2] Learning of joint visual attention by reinforcement learning
Matsuda, G
Omori, T
[J]. ICCM - 2001: PROCEEDINGS OF THE 2001 FOURTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2001, : 157 - 162
[3] Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition
Dong, Wenkai
Zhang, Zhaoxiang
Tan, Tieniu
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8247 - 8254
[4] Deep Learning for Human Visual Attention Recognition Using Transfer Learning
Nam Vu Hoai
Huong Nguyen Mai
Cuong Pham
[J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2018, : 42 - 46
[5] Spatial attention based visual semantic learning for action recognition in still images
Zheng, Yunpeng
Zheng, Xiangtao
Lu, Xiaoqiang
Wu, Siyuan
[J]. NEUROCOMPUTING, 2020, 413 : 383 - 396
[6] Unsupervised Visual Attention and Invariance for Reinforcement Learning
Wang, Xudong
Lian, Long
Yu, Stella X.
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6673 - 6683
[7] Action Recognition with Visual Attention on Skeleton Images
Yang, Zhengyuan
Li, Yuncheng
Yang, Jianchao
Luo, Jiebo
[J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3309 - 3314
[8] Visual Reinforcement Learning for Object Recognition in Robotics
Malowany, Dan
Guterman, Hugo
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING (ICSEE), 2016,
[9] Reinforcement learning for decision making in sequential visual attention
Paletta, Lucas
Fritz, Gerald
[J]. ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 293 - 306
[10] Deep Reinforcement Learning With Visual Attention for Vehicle Classification
Zhao, Dongbin
Chen, Yaran
Lv, Le
[J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 9 (04) : 356 - 367

← 1 2 3 4 5 →