COARSE-TO-FINE AGGREGATION FOR CROSS-GRANULARITY ACTION RECOGNITION

被引:0
|
作者
Mazari, Ahmed [1 ]
Sahbi, Hichem [1 ]
机构
[1] Sorbonne Univ, LIP6, CNRS, UPMC, F-75005 Paris, France
关键词
Hierarchical pooling; deep multiple representation learning; action recognition;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we introduce a novel hierarchical aggregation design that captures different levels of temporal granularity in action recognition. Our design principle is coarse- to-fine and achieved using a tree-structured network; as we traverse this network top-down, pooling operations are getting less invariant but timely more resolute and well localized. Learning the combination of operations in this network - which best fits a given ground-truth - is obtained by solving a constrained minimization problem whose solution corresponds to the distribution of weights that capture the contribution of each level (and thereby temporal granularity) in the global hierarchical pooling process. Besides being principled and well grounded, the proposed hierarchical pooling is also video-length agnostic and resilient to misalignments in actions. Extensive experiments conducted on the challenging UCF-101 database corroborate these statements.
引用
收藏
页码:1541 / 1545
页数:5
相关论文
共 50 条
  • [1] Multiple Granularity Modeling: A Coarse-to-Fine Framework for Fine-grained Action Analysis
    Ni, Bingbing
    Paramathayalan, Vignesh R.
    Li, Teng
    Moulin, Pierre
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (01) : 28 - 43
  • [2] Multiple Granularity Modeling: A Coarse-to-Fine Framework for Fine-grained Action Analysis
    Bingbing Ni
    Vignesh R. Paramathayalan
    Teng Li
    Pierre Moulin
    International Journal of Computer Vision, 2016, 120 : 28 - 43
  • [3] Towards Cross-Granularity Few-Shot Learning: Coarse-to-Fine Pseudo-Labeling with Visual-Semantic Meta-Embedding
    Yang, Jinhai
    Yang, Hua
    Chen, Lin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3005 - 3014
  • [4] A Context Knowledge Map Guided Coarse-to-Fine Action Recognition
    Ji, Yanli
    Zhan, Yue
    Yang, Yang
    Xu, Xing
    Shen, Fumin
    Shen, Heng Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2742 - 2752
  • [5] Coarse-to-Fine Context Aggregation Network for Vehicle Make and Model Recognition
    Tan, Shi Hao
    Chuah, Joon Huang
    Chow, Chee-Onn
    Kanesan, Jeevan
    IEEE ACCESS, 2023, 11 : 126733 - 126747
  • [6] A coarse-to-fine method for shape recognition
    Tang H.-X.
    Wei H.
    Journal of Computer Science and Technology, 2007, 22 (02) : 330 - 334
  • [7] Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion
    Lin, Weiyao
    Mi, Yang
    Wu, Jianxin
    Lu, Ke
    Xiong, Hongkai
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7130 - 7137
  • [8] A Coarse-to-Fine Boundary Localization method for Naturalistic Driving Action Recognition
    Ding, Guanchen
    Han, Wenwei
    Wang, Chenglong
    Cui, Mingpeng
    Zhou, Lin
    Pan, Dianbo
    Wang, Jiayi
    Zhang, Junxi
    Chen, Zhenzhong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3233 - 3240
  • [9] Exploring Coarse-to-Fine Action Token Localization and Interaction for Fine-grained Video Action Recognition
    Sun, Baoli
    Ye, Xinchen
    Wang, Zhihui
    Li, Haojie
    Wang, Zhiyong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5070 - 5078
  • [10] A fusion approach for coarse-to-fine target recognition
    Folkesson, Martin
    Gronwall, Christina
    Jungert, Erland
    MULTISENSOR, MULTISOURCE INFORMATIN FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2006, 2006, 6242