AGPN: Action Granularity Pyramid Network for Video Action Recognition

被引:21
|
作者
Chen, Yatong [1 ]
Ge, Hongwei [1 ]
Liu, Yuxuan [1 ]
Cai, Xinye [1 ]
Sun, Liang [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
Video action recognition; pyramid network; multi-scale; multi-granularity; REPRESENTATIONS;
D O I
10.1109/TCSVT.2023.3235522
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video action recognition is a fundamental task for video understanding. Action recognition in complex spatio-temporal contexts generally requires fusing of different multi-granularity action information. However, existing works do not consider spatio-temporal information modeling and fusion from the perspective of action granularity. To address this problem, this paper proposes an Action Granularity Pyramid Network (AGPN) for action recognition, which can be flexibly integrated into 2D backbone networks. The core module is the Action Granularity Pyramid Module (AGPM), a hierarchical pyramid structure with residual connections, which is established to fuse multi-granularity action spatio-temporal information. From top to bottom level in the designed pyramid structure, the receptive field decreases and action granularity becomes more refined. To enrich temporal information of the inputs, a Multiple Frame Rate Module (MFM) is proposed to mix different frame rates at a fine-grained pixel-wise level. Moreover, a Spatio-temporal Anchor Module (SAM) is employed to fix spatio-temporal feature anchors to promote the effectiveness of feature extraction. We conduct extensive experiments on three large-scale action recognition datasets, Something-Something V1 & V2 and Kinetics-400. The results demonstrate that our proposed AGPN outperforms the state-of-the-art methods for the tasks of video action recognition.
引用
收藏
页码:3912 / 3923
页数:12
相关论文
共 50 条
  • [21] SCN: Dilated silhouette convolutional network for video action recognition
    Hua, Michelle
    Gao, Mingqi
    Zhong, Zichun
    COMPUTER AIDED GEOMETRIC DESIGN, 2021, 85
  • [22] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Haiping Zhang
    Zepeng Hu
    Dongjin Yu
    Liming Guan
    Xu Liu
    Conghao Ma
    Neural Processing Letters, 56
  • [23] ATTENTIONAL FUSED TEMPORAL TRANSFORMATION NETWORK FOR VIDEO ACTION RECOGNITION
    Yang, Ke
    Wang, Zhiyuan
    Dai, Huadong
    Shen, Tianlong
    Qiao, Peng
    Niu, Xin
    Jiang, Jie
    Li, Dongsheng
    Dou, Yong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4377 - 4381
  • [24] FREQUENCY ENHANCEMENT NETWORK FOR EFFICIENT COMPRESSED VIDEO ACTION RECOGNITION
    Ming, Yue
    Xiong, Lu
    Jia, Xia
    Zheng, Qingfang
    Zhou, Jiangwan
    Feng, Fan
    Hu, Nannan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 825 - 829
  • [25] SDAN: Stacked Diverse Attention Network for Video Action Recognition
    Zhu, Xiaoguang
    Huang, Siran
    Fan, Wenjing
    Cheng, Yuhao
    Shao, Huaqing
    Liu, Peilin
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [26] Multipath Attention and Adaptive Gating Network for Video Action Recognition
    Zhang, Haiping
    Hu, Zepeng
    Yu, Dongjin
    Guan, Liming
    Liu, Xu
    Ma, Conghao
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [27] DEEP TEMPORAL PYRAMID DESIGN FOR ACTION RECOGNITION
    Mazari, Ahmed
    Sahbi, Hichem
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2077 - 2081
  • [28] Action recognition on continuous video
    Chang, Y. L.
    Chan, C. S.
    Remagnino, P.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1233 - 1243
  • [29] Compressed Video Action Recognition
    Wu, Chao-Yuan
    Zaheer, Manzil
    Hu, Hexiang
    Manmatha, R.
    Smola, Alexander J.
    Krahenbuhl, Philipp
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6026 - 6035
  • [30] Human Action Recognition in Video
    Singh, Dushyant Kumar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 54 - 66