COARSE-TO-FINE AGGREGATION FOR CROSS-GRANULARITY ACTION RECOGNITION

被引:0
|
作者
Mazari, Ahmed [1 ]
Sahbi, Hichem [1 ]
机构
[1] Sorbonne Univ, LIP6, CNRS, UPMC, F-75005 Paris, France
关键词
Hierarchical pooling; deep multiple representation learning; action recognition;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we introduce a novel hierarchical aggregation design that captures different levels of temporal granularity in action recognition. Our design principle is coarse- to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network - which best fits a given ground-truth - is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length agnostic and resilient to misalignments in actions. Extensive experiments conducted on the challenging UCF-101 database corroborate these statements.
引用
收藏
页码:1541 / 1545
页数:5
相关论文
共 50 条
  • [21] Coarse-to-Fine Based Matching for Audio Commercial Recognition
    Liu, Nan
    Zhao, Yao
    Zhu, Zhenfeng
    2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 87 - 90
  • [22] A Coarse-to-Fine Framework for Resource Efficient Video Recognition
    Wu, Zuxuan
    Li, Hengduo
    Zheng, Yingbin
    Xiong, Caiming
    Jiang, Yu-Gang
    Davis, Larry S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (11) : 2965 - 2977
  • [23] Robust Coarse-to-Fine Sparse Representation for Face Recognition
    Sun, Yunlian
    Tistarelli, Massimo
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT II, 2013, 8157 : 171 - 180
  • [24] A coarse-to-fine network for aphid recognition and detection in the field
    Li, Rui
    Wang, Rujing
    Xie, Chengjun
    Liu, Liu
    Zhang, Jie
    Wang, Fangyuan
    Liu, Wancai
    BIOSYSTEMS ENGINEERING, 2019, 187 : 39 - 52
  • [25] Coarse-to-fine object recognition using shock graphs
    Bataille, A
    Dickinson, S
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2005, 3434 : 203 - 212
  • [26] A coarse-to-fine classification scheme for facial expression recognition
    Feng, XY
    Hadid, A
    Pietikäinen, M
    IMAGE ANALYSIS AND RECOGNITION, PT 2, PROCEEDINGS, 2004, 3212 : 668 - 675
  • [27] EFFICIENT HUMAN ACTION DETECTION: A COARSE-TO-FINE STRATEGY
    Wu, Xian
    Lai, Jianhuang
    Chen, Xilin
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 701 - 704
  • [28] PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization
    Su, Rui
    Xu, Dong
    Sheng, Lu
    Ouyang, Wanli
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2103 - 2113
  • [29] Cross-Granularity Attention Network for Semantic Segmentation
    Zhu, Lingyu
    Wang, Tinghuai
    Aksu, Emre
    Kamarainen, Joni-Kristian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1920 - 1930
  • [30] LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition
    Wu, Zuxuan
    Xiong, Caiming
    Jiang, Yu-Gang
    Davis, Larry S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32