LSTA: Long Short-Term Attention for Egocentric Action Recognition

被引:94
|
作者
Sudhakaran, Swathikiran [1 ,2 ]
Escalera, Sergio [3 ,4 ]
Lanz, Oswald [1 ]
机构
[1] Fdn Bruno Kessler, Trento, Italy
[2] Univ Trento, Trento, Italy
[3] Comp Vis Ctr, Barcelona, Spain
[4] Univ Barcelona, Barcelona, Spain
关键词
D O I
10.1109/CVPR.2019.01019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric activity recognition is one of the most challenging tasks in video analysis. It requires a fine-grained discrimination of small objects and their manipulation. While some methods base on strong supervision and attention mechanisms, they are either annotation consuming or do not take spatio-temporal patterns into account. In this paper we propose LSTA as a mechanism to focus on features from relevant spatial parts while attention is being tracked smoothly across the video sequence. We demonstrate the effectiveness of LSTA on egocentric activity recognition with an end-to-end trainable two-stream architecture, achieving state-of-the-art performance on four standard benchmarks.
引用
收藏
页码:9946 / 9955
页数:10
相关论文
共 50 条
  • [1] Long Short-Term Attention
    Zhong, Guoqiang
    Lin, Xin
    Chen, Kang
    Li, Qingyang
    Huang, Kaizhu
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, 2020, 11691 : 45 - 54
  • [2] Lattice Long Short-Term Memory for Human Action Recognition
    Sun, Lin
    Jia, Kui
    Chen, Kevin
    Yeung, Dit Yan
    Shi, Bertram E.
    Savarese, Silvio
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2166 - 2175
  • [3] Robust Human Action Recognition via Long Short-Term Memory
    Grushin, Alexander
    Monner, Derek D.
    Reggia, James A.
    Mishra, Ajay
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [4] Deep historical long short-term memory network for action recognition
    Cai, Jiaxin
    Hu, Junlin
    Tang, Xin
    Hung, Tzu-Yi
    Tan, Yap-Peng
    [J]. NEUROCOMPUTING, 2020, 407 : 428 - 438
  • [5] Deep Attention Network for Egocentric Action Recognition
    Lu, Minlong
    Li, Ze-Nian
    Wang, Yueming
    Pan, Gang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3703 - 3713
  • [6] Learning Spatiotemporal Attention for Egocentric Action Recognition
    Lu, Minlong
    Liao, Danping
    Li, Ze-Nian
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4425 - 4434
  • [7] Short-Term Action Learning for Video Action Recognition
    Ting-Long, Liu
    [J]. IEEE Access, 2024, 12 : 30867 - 30875
  • [8] Short-Term Action Learning for Video Action Recognition
    Ting-Long, Liu
    [J]. IEEE ACCESS, 2024, 12 : 30867 - 30875
  • [9] The Study of Human Action Recognition in Videos with Long Short-Term Memory Model
    Khan, Hussan
    Habib, Sammra
    Qasim, Amna
    Hussain, Nisar
    Usman, Muhammad
    Mahmood, Ahmad
    Shaukat, Zainab
    Afzal, Asghar
    Zain, Muhammad
    [J]. INFORMATION MANAGEMENT AND BIG DATA, SIMBIG 2023, 2024, 2142 : 217 - 230
  • [10] Semi-supervised long short-term memory for human action recognition
    Liu, Hong
    Liu, Chang
    Ding, Runwei
    [J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 373 - 378