Top-Down Deep Appearance Attention for Action Recognition

被引:0
|
作者
Anwer, Rao Muhammad [1 ]
Khan, Fahad Shahbaz [2 ]
de Weijer, Joost van [3 ]
Laaksonen, Jorma [1 ]
机构
[1] Aalto Univ, Sch Sci, Dept Comp Sci, Espoo, Finland
[2] Linkoping Univ, Comp Vis Lab, Linkoping, Sweden
[3] Univ Autonoma Barcelona, Comp Vis Ctr, CS Dept, Barcelona, Spain
来源
IMAGE ANALYSIS, SCIA 2017, PT I | 2017年 / 10269卷
基金
芬兰科学院;
关键词
Action recognition; CNNs; Feature fusion; FEATURES;
D O I
10.1007/978-3-319-59126-1_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing human actions in videos is a challenging problem in computer vision. Recently, convolutional neural network based deep features have shown promising results for action recognition. In this paper, we investigate the problem of fusing deep appearance and motion cues for action recognition. We propose a video representation which combines deep appearance and motion based local convolutional features within the bag-of-deep-features framework. Firstly, dense deep appearance and motion based local convolutional features are extracted from spatial (RGB) and temporal (flow) networks, respectively. Both visual cues are processed in parallel by constructing separate visual vocabularies for appearance and motion. A category-specific appearance map is then learned to modulate the weights of the deep motion features. The proposed representation is discriminative and binds the deep local convolutional features to their spatial locations. Experiments are performed on two challenging datasets: JHMDB dataset with 21 action classes and ACT dataset with 43 categories. The results clearly demonstrate that our approach outperforms both standard approaches of early and late feature fusion. Further, our approach is only employing action labels and without exploiting body part information, but achieves competitive performance compared to the state-of-the-art deep features based approaches.
引用
收藏
页码:297 / 309
页数:13
相关论文
共 50 条
  • [21] Top-Down Cues for Event Recognition
    Li, Li
    Yuan, Chunfeng
    Hu, Weiming
    Li, Bing
    COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 691 - 702
  • [22] Top-down facilitation of visual recognition
    Bar, M
    Kassam, KS
    Ghuman, AS
    Boshyan, J
    Schmidt, AM
    Dale, AM
    Hämäläinen, MS
    Marinkovic, K
    Schacter, DL
    Rosen, BR
    Halgren, E
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) : 449 - 454
  • [23] Top-down constraint on recognition memory
    Justin Kantner
    D. Stephen Lindsay
    Memory & Cognition, 2013, 41 : 465 - 479
  • [24] Action perception: Top-down effects
    Destro, M. F.
    Avanzini, P.
    Pascarella, A.
    Cattaneo, L.
    Rizzolatti, G.
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2012, 85 (03) : 370 - 371
  • [25] TOP-DOWN ATTENTION WITH FEATURES MISSING AT RANDOM
    Karadogan, Seliz G.
    Marchegiani, Letizia
    Larsen, Jan
    Hansen, Lars Kai
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [26] Top-Down Neural Attention by Excitation Backprop
    Jianming Zhang
    Sarah Adel Bargal
    Zhe Lin
    Jonathan Brandt
    Xiaohui Shen
    Stan Sclaroff
    International Journal of Computer Vision, 2018, 126 : 1084 - 1102
  • [27] Top-Down Neural Attention by Excitation Backprop
    Zhang, Jianming
    Lin, Zhe
    Brandt, Jonathan
    Shen, Xiaohui
    Sclaroff, Stan
    COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 543 - 559
  • [28] TOP-DOWN MODULATION OF ATTENTION AND PERCEPTION BY EMOTION
    Mohanty, Aprajita
    Sussman, Tamara J.
    Jin, Jingwen
    Szekely, Akos
    PSYCHOPHYSIOLOGY, 2014, 51 : S6 - S6
  • [29] Top-Down Neural Attention by Excitation Backprop
    Zhang, Jianming
    Bargal, Sarah Adel
    Lin, Zhe
    Brandt, Jonathan
    Shen, Xiaohui
    Sclaroff, Stan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (10) : 1084 - 1102
  • [30] Top-down attention selection is fine grained
    Navalpakkam, Vidhya
    Itti, Laurent
    JOURNAL OF VISION, 2006, 6 (11): : 1180 - 1193