Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition

被引:219
|
作者
Chen, Tailin [1 ,3 ,4 ]
Zhou, Desen [2 ]
Wang, Jian [2 ]
Wang, Shidong [1 ]
Guan, Yu [1 ]
He, Xuming [3 ]
Ding, Errui [2 ]
机构
[1] Newcastle Univ, Open Lab, Newcastle Upon Tyne, Tyne & Wear, England
[2] Baidu Inc, Dept Comp Vis Technol VIS, Beijing, Peoples R China
[3] ShanghaiTech Univ, Shanghai, Peoples R China
[4] Baidu VIS, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
基金
英国工程与自然科学研究理事会;
关键词
Action Recognition; Skeleton-based; Multi-granular; Spatial temporal; attention; DualHead-Net;
D O I
10.1145/3474085.3475574
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of skeleton-based action recognition remains a core challenge in human-centred scene understanding due to the multiple granularities and large variation in human motion. Existing approaches typically employ a single neural representation for different motion patterns, which has difficulty in capturing fine-grained action classes given limited training data. To address the aforementioned problems, we propose a novel multi-granular spatiotemporal graph network for skeleton-based action classification that jointly models the coarse- and fine-grained skeleton motion patterns. To this end, we develop a dual-head graph network consisting of two interleaved branches, which enables us to extract features at two spatio-temporal resolutions in an effective and efficient manner. Moreover, our network utilises a cross-head communication strategy to mutually enhance the representations of both heads. We conducted extensive experiments on three large-scale datasets, namely NTU RGB+D 60, NTU RGB+D 120, and KineticsSkeleton, and achieves the state-of-the-art performance on all the benchmarks, which validates the effectiveness of our method1.
引用
收藏
页码:4334 / 4342
页数:9
相关论文
共 50 条
  • [1] Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition
    Li, Bin
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8561 - 8568
  • [2] Lightweight Multiscale Spatio-Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
    Zheng, Zhiyun
    Yuan, Qilong
    Zhang, Huaizhu
    Wang, Yizhou
    Wang, Junfeng
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 310 - 325
  • [3] PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION
    Heidari, Negar
    Iosifidis, Alexandros
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3220 - 3224
  • [4] Spatio-Temporal Motion Topology Aware Graph Convolutional Network for Skeleton-Based Action Recognition
    Ma, Ji
    Liu, Wei
    Ding, Linlin
    Luo, Hao
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 549 - 560
  • [5] Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation
    Li, Qing
    Qiu, Zhaofan
    Yao, Ting
    Mei, Tao
    Rui, Yong
    Luo, Jiebo
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 159 - 166
  • [6] Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition
    Huang, Zhen
    Shen, Xu
    Tian, Xinmei
    Li, Houqiang
    Huang, Jianqiang
    Hua, Xian-Sheng
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2122 - 2130
  • [7] Skeleton-based action recognition based on spatio-temporal adaptive graph convolutional neural-network
    Cao Y.
    Liu C.
    Huang Z.
    Sheng Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (11): : 5 - 10
  • [8] Spatio-temporal neural network with handcrafted features for skeleton-based action recognition
    Nan, Mihai
    Trascau, Mihai
    Florea, Adina-Magda
    NEURAL COMPUTING & APPLICATIONS, 2024, : 9221 - 9243
  • [9] Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
    Wang, Mingdao
    Li, Xueming
    Chen, Siqi
    Zhang, Xianlin
    Ma, Lei
    Zhang, Yue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3207 - 3220
  • [10] Global spatio-temporal synergistic topology learning for skeleton-based action recognition
    Dai, Meng
    Sun, Zhonghua
    Wang, Tianyi
    Feng, Jinchao
    Jia, Kebin
    PATTERN RECOGNITION, 2023, 140