Action Recognition Using Visual Attention with Reinforcement Learning

被引:11
|
作者
Li, Hongyang [1 ,3 ]
Chen, Jun [1 ,2 ]
Hu, Ruimin [1 ,2 ]
Yu, Mei [3 ]
Chen, Huafeng [4 ]
Xu, Zengmin [1 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang, Peoples R China
[4] Jingchu Univ Technol, Jingmen, Peoples R China
来源
关键词
Human action recognition; Reinforcement learning; Visual attention;
D O I
10.1007/978-3-030-05716-9_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition in videos is a challenging and significant task with a broad range of applications. The advantage of the visual attention mechanism is that it can effectively reduce noise interference by focusing on the relevant parts of the image and ignoring the irrelevant part. We propose a deep visual attention model with reinforcement learning for this task. We use Recurrent Neural Network (RNN) with Long Short-Term Memory (LSTM) units as a learning agent. The agent interact with video and decides both where to look next frame and where to locate the most relevant region of the selected video frame. REINFORCE method is used to learn the agent's decision policy and back-propagation method is used to train the action classifier. The experimental results demonstrate that this glimpse window can focus on important clues. Our model achieves significant performance improvement on the action recognition datasets: UCF101 and HMDB51.
引用
收藏
页码:365 / 376
页数:12
相关论文
共 50 条
  • [1] Better Deep Visual Attention with Reinforcement Learning in Action Recognition
    Wang, Gang
    Wang, Wenmin
    Wang, Jingzhuo
    Bu, Yaohua
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
  • [2] Learning of joint visual attention by reinforcement learning
    Matsuda, G
    Omori, T
    [J]. ICCM - 2001: PROCEEDINGS OF THE 2001 FOURTH INTERNATIONAL CONFERENCE ON COGNITIVE MODELING, 2001, : 157 - 162
  • [3] Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition
    Dong, Wenkai
    Zhang, Zhaoxiang
    Tan, Tieniu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8247 - 8254
  • [4] Deep Learning for Human Visual Attention Recognition Using Transfer Learning
    Nam Vu Hoai
    Huong Nguyen Mai
    Cuong Pham
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2018, : 42 - 46
  • [5] Spatial attention based visual semantic learning for action recognition in still images
    Zheng, Yunpeng
    Zheng, Xiangtao
    Lu, Xiaoqiang
    Wu, Siyuan
    [J]. NEUROCOMPUTING, 2020, 413 : 383 - 396
  • [6] Unsupervised Visual Attention and Invariance for Reinforcement Learning
    Wang, Xudong
    Lian, Long
    Yu, Stella X.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6673 - 6683
  • [7] Action Recognition with Visual Attention on Skeleton Images
    Yang, Zhengyuan
    Li, Yuncheng
    Yang, Jianchao
    Luo, Jiebo
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3309 - 3314
  • [8] Visual Reinforcement Learning for Object Recognition in Robotics
    Malowany, Dan
    Guterman, Hugo
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING (ICSEE), 2016,
  • [9] Reinforcement learning for decision making in sequential visual attention
    Paletta, Lucas
    Fritz, Gerald
    [J]. ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 293 - 306
  • [10] Deep Reinforcement Learning With Visual Attention for Vehicle Classification
    Zhao, Dongbin
    Chen, Yaran
    Lv, Le
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 9 (04) : 356 - 367