Deep multiple aggregation networks for action recognition

被引:1
|
作者
Mazari, Ahmed [1 ]
Sahbi, Hichem [1 ]
机构
[1] Sorbonne Univ, LIP6, CNRS, F-75005 Paris, France
关键词
Multiple aggregation design; 2-Stream networks; Action recognition; BEHAVIOR;
D O I
10.1007/s13735-023-00317-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the current action recognition algorithms are based on deep networks which stack multiple convolutional, pooling and fully connected layers. While convolutional and fully connected operations have been widely studied in the literature, the design of pooling operations that handle action recognition, with different sources of temporal granularity in action categories, has comparatively received less attention, and existing solutions rely mainly on max or averaging operations. The latter are clearly powerless to fully exhibit the actual temporal granularity of action categories and thereby constitute a bottleneck in classification performances. In this paper, we introduce a novel hierarchical pooling design that captures different levels of temporal granularity in action recognition. Our design principle is coarse-to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network-which best fits a given ground-truth-is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length and resolution agnostic. Extensive experiments conducted on the challenging UCF-101, HMDB-51 and JHMDB-21 databases corroborate all these statements.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Survey on Deep Neural Networks for Human Action Recognition based on Skeleton Information
    Wang, Hongyu
    [J]. RECENT DEVELOPMENTS IN INTELLIGENT SYSTEMS AND INTERACTIVE APPLICATIONS (IISA2016), 2017, 541 : 329 - 336
  • [42] Action Recognition Using Deep 3D CNNs with Sequential Feature Aggregation and Attention
    Anvarov, Fazliddin
    Kim, Dae Ha
    Song, Byung Cheol
    [J]. ELECTRONICS, 2020, 9 (01)
  • [43] Human Action Recognition Based on Multiple Features and Modified Deep Learning Model
    Zhu, Shaoping
    Xiao, Yongliang
    Ma, Weimin
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (10)
  • [44] Learning Action Images Using Deep Convolutional Neural Networks For 3D Action Recognition
    Thien Huynh-The
    Hua, Cam-Hao
    Kim, Dong-Seong
    [J]. 2019 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2019,
  • [45] TEA: Temporal Excitation and Aggregation for Action Recognition
    Li, Yan
    Ji, Bin
    Shi, Xintian
    Zhang, Jianguo
    Kang, Bin
    Wang, Limin
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 906 - 915
  • [46] Multiple temporal scale aggregation graph convolutional network for skeleton-based action recognition
    Li, Xuanfeng
    Lu, Jian
    Zhou, Jian
    Liu, Wei
    Zhang, Kaibing
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [47] Multimodal Deep Feature Aggregation for Facial Action Unit Recognition using Visible Images and Physiological Signals
    Lakshminarayana, Nagashri N.
    Sankaran, Nishant
    Setlur, Srirangaraj
    Govindaraju, Venu
    [J]. 2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 458 - 461
  • [48] Multiview-Based 3-D Action Recognition Using Deep Networks
    Li, Chuankun
    Hou, Yonghong
    Wang, Pichao
    Li, Wanqing
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2019, 49 (01) : 95 - 104
  • [49] Human Action Recognition Using Fusion of Modern Deep Convolutional and Recurrent Neural Networks
    Tkachenko, Dmytro
    [J]. 2018 IEEE FIRST INTERNATIONAL CONFERENCE ON SYSTEM ANALYSIS & INTELLIGENT COMPUTING (SAIC), 2018, : 181 - 185
  • [50] A survey on deep neural networks for human action recognition in RGB image and depth image
    Wang, Hongyu
    [J]. ENERGY SCIENCE AND APPLIED TECHNOLOGY (ESAT 2016), 2016, : 697 - 703