COARSE-TO-FINE AGGREGATION FOR CROSS-GRANULARITY ACTION RECOGNITION

被引:0
|
作者
Mazari, Ahmed [1 ]
Sahbi, Hichem [1 ]
机构
[1] Sorbonne Univ, LIP6, CNRS, UPMC, F-75005 Paris, France
关键词
Hierarchical pooling; deep multiple representation learning; action recognition;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we introduce a novel hierarchical aggregation design that captures different levels of temporal granularity in action recognition. Our design principle is coarse- to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network - which best fits a given ground-truth - is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length agnostic and resilient to misalignments in actions. Extensive experiments conducted on the challenging UCF-101 database corroborate these statements.
引用
收藏
页码:1541 / 1545
页数:5
相关论文
共 50 条
  • [31] PALMPRINT RECOGNITION USING COARSE-TO-FINE STATISTICAL IMAGE REPRESENTATION
    Han, Yufei
    Sun, Zhenan
    Tan, Tieniu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1969 - 1972
  • [32] A fast coarse-to-fine vehicle logo detection and recognition method
    Wang Yunqiong
    Liu Zhifang
    Xiao Fei
    2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-5, 2007, : 691 - 696
  • [33] Coarse-to-Fine Pre-training for Named Entity Recognition
    Xue, Mengge
    Yu, Bowen
    Zhang, Zhenyu
    Liu, Tingwen
    Zhang, Yue
    Bin Wang
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6345 - 6354
  • [34] Is coarse-to-fine tuning in object recognition one of size or scale?
    Fiser, J.
    Subramaniam, S.
    Biederman, I.
    PERCEPTION, 1996, 25 : 49 - 49
  • [35] HUMAN-AWARE COARSE-TO-FINE ONLINE ACTION DETECTION
    Yang, Zichen
    Huang, Di
    Qin, Jie
    Wang, Yunhong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2455 - 2459
  • [36] SUB-SAMPLED DICTIONARIES FOR COARSE-TO-FINE SPARSE REPRESENTATION-BASED HUMAN ACTION RECOGNITION
    Lee, JongHo
    Min, Hyun-seok
    Seo, Jeong-jik
    De Neve, Wesley
    Ro, Yong Man
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [37] Coarse-to-fine matching via cross fusion of satellite images
    Li, Liangzhi
    Han, Ling
    Gao, Kyle
    He, Hongjie
    Wang, Lanying
    Li, Jonathan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 125
  • [38] Coarse-to-fine manifold learning
    Castro, R
    Willett, R
    Nowak, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 992 - 995
  • [39] 'Coarse-to-fine' cyclopean processing
    Popple, AV
    Findlay, JM
    PERCEPTION, 1999, 28 (02) : 155 - 165
  • [40] Coarse-to-fine face detection
    Fleuret, F
    Geman, D
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 41 (1-2) : 85 - 107