Global spatio-temporal synergistic topology learning for skeleton-based action recognition

被引:15
|
作者
Dai, Meng [1 ,2 ,3 ,4 ]
Sun, Zhonghua [1 ,2 ,3 ,5 ]
Wang, Tianyi [1 ,2 ,3 ,6 ]
Feng, Jinchao [1 ,2 ,3 ,5 ]
Jia, Kebin [1 ,2 ,3 ,5 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Lab Adv Informat Networks, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[4] Beijing Univ Technol BJUT, Informat & Commun Engn Dept, Beijing, Peoples R China
[5] BJUT, Fac Informat Technol, Beijing, Peoples R China
[6] BJUT, Informat & Commun Engn Dept, Beijing, Peoples R China
关键词
Action recognition; Spatio-temporal synergistic; Skeleton; Topology learning;
D O I
10.1016/j.patcog.2023.109540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compared to RGB video-based action recognition, skeleton-based action recognition algorithm has attracted much more attention due to being more lightweight, better generalization and robustness. The extraction of temporal and spatial features is a crucial factor for skeleton-based action recognition. However, existing feature extraction methods suffer from two limitations: (1) the isolated extraction of temporal and spatial feature cannot capture temporal feature connections among non-adjacent joints and (2) convolution-limited perceptual fields cannot capture global temporal features of joints effectively. In this work, we propose a global spatio-temporal synergistic feature learning module (GSTL), which generates global spatio-temporal synergistic topology of joints by spatio-temporal feature fusion. By further combining the GSTL with a temporal modeling unit, we develop a powerful global spatio-temporal synergistic topology learning network (GSTLN), and it achieves competitive performance with fewer parameters on three challenge datasets: NTU RGB + D, NTU RGB + D 120, and NW-UCLA.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition
    Wang, Mingdao
    Li, Xueming
    Chen, Siqi
    Zhang, Xianlin
    Ma, Lei
    Zhang, Yue
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3207 - 3220
  • [2] Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition
    Hu, Guyue
    Cui, Bo
    Yu, Shan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2207 - 2220
  • [3] Spatio-temporal stacking model for skeleton-based action recognition
    Yufeng Zhong
    Qiuyan Yan
    [J]. Applied Intelligence, 2022, 52 : 12116 - 12130
  • [4] Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Cho, Suhwan
    Woo, Sungmin
    Jang, Sungjun
    Lee, Sangyoun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10221 - 10230
  • [5] Spatio-temporal segments attention for skeleton-based action recognition
    Qiu, Helei
    Hou, Biao
    Ren, Bo
    Zhang, Xiaohua
    [J]. NEUROCOMPUTING, 2023, 518 : 30 - 38
  • [6] Spatio-Temporal Difference Descriptor for Skeleton-Based Action Recognition
    Ding, Chongyang
    Liu, Kai
    Korhonen, Jari
    Belyaev, Evgeny
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1227 - 1235
  • [7] Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition
    Li, Bin
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8561 - 8568
  • [8] Leveraging Spatio-Temporal Dependency for Skeleton-Based Action Recognition
    Lee, Jungho
    Lee, Minhyeok
    Cho, Suhwan
    Woo, Sungmin
    Jang, Sungjun
    Lee, Sangyoun
    [J]. arXiv, 2022,
  • [9] Spatio-temporal stacking model for skeleton-based action recognition
    Zhong, Yufeng
    Yan, Qiuyan
    [J]. APPLIED INTELLIGENCE, 2022, 52 (11) : 12116 - 12130
  • [10] Global Spatio-Temporal Deformable Network for Skeleton-Based Gesture Recognition
    Shi D.
    Lin H.
    Liu Y.
    Zhang X.
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (01): : 60 - 66