Deep multiple aggregation networks for action recognition

被引:0
|
作者
Ahmed Mazari
Hichem Sahbi
机构
[1] Sorbonne University,CNRS, LIP6
关键词
Multiple aggregation design; 2-Stream networks; Action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Most of the current action recognition algorithms are based on deep networks which stack multiple convolutional, pooling and fully connected layers. While convolutional and fully connected operations have been widely studied in the literature, the design of pooling operations that handle action recognition, with different sources of temporal granularity in action categories, has comparatively received less attention, and existing solutions rely mainly on max or averaging operations. The latter are clearly powerless to fully exhibit the actual temporal granularity of action categories and thereby constitute a bottleneck in classification performances. In this paper, we introduce a novel hierarchical pooling design that captures different levels of temporal granularity in action recognition. Our design principle is coarse-to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network—which best fits a given ground-truth—is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length and resolution agnostic. Extensive experiments conducted on the challenging UCF-101, HMDB-51 and JHMDB-21 databases corroborate all these statements.
引用
收藏
相关论文
共 50 条
  • [31] Stratified pooling based deep convolutional neural networks for human action recognition
    Yu, Sheng
    Cheng, Yun
    Su, Songzhi
    Cai, Guorong
    Li, Shaozi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (11) : 13367 - 13382
  • [32] EMG Signals based Human Action Recognition via Deep Belief Networks
    Zhang, Jianhua
    Ling, Chen
    Li, Sunan
    [J]. IFAC PAPERSONLINE, 2019, 52 (19): : 271 - 276
  • [33] Exploiting deep residual networks for human action recognition from skeletal data
    Huy-Hieu Pham
    Khoudour, Louandi
    Crouzil, Alain
    Zegers, Pablo
    Velastin, Sergio A.
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 170 : 51 - 66
  • [34] Data Driven Sensing for Action Recognition Using Deep Convolutional Neural Networks
    Gupta, Ronak
    Anand, Prashant
    Kaushik, Vinay
    Chaudhury, Santanu
    Lall, Brejesh
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT I, 2019, 11941 : 250 - 259
  • [35] Modelling Human Body Pose for Action Recognition Using Deep Neural Networks
    Chengyang Li
    Ruofeng Tong
    Min Tang
    [J]. Arabian Journal for Science and Engineering, 2018, 43 : 7777 - 7788
  • [36] Stratified pooling based deep convolutional neural networks for human action recognition
    Sheng Yu
    Yun Cheng
    Songzhi Su
    Guorong Cai
    Shaozi Li
    [J]. Multimedia Tools and Applications, 2017, 76 : 13367 - 13382
  • [37] Action Recognition From Depth Maps Using Deep Convolutional Neural Networks
    Wang, Pichao
    Li, Wanqing
    Gao, Zhimin
    Zhang, Jing
    Tang, Chang
    Ogunbona, Philip O.
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2016, 46 (04) : 498 - 509
  • [38] Skeleton-based Action Recognition with Lie Group and Deep Neural Networks
    Li, Yanshan
    Guo, Tianyu
    Liu, Xing
    Xia, Rongjie
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 26 - 30
  • [39] Deep Residual Split Directed Graph Convolutional Neural Networks for Action Recognition
    Fu, Bo
    Fu, Shilin
    Wang, Liyan
    Dong, Yuhan
    Ren, Yonggong
    [J]. IEEE MULTIMEDIA, 2020, 27 (04) : 9 - 17
  • [40] Emotion and Gesture Guided Action Recognition in Videos Using Supervised Deep Networks
    Nigam, Nitika
    Dutta, Tanima
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (05) : 2546 - 2556