Multi-stream Global-Local Motion Fusion Network for skeleton-based action recognition

被引:0
|
作者
Qi, Yanpeng [1 ]
Pang, Chen [1 ]
Liu, Yiliang [1 ,3 ]
Lyu, Lei [1 ,2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Shandong Prov Key Lab Distributed Comp Software No, Jinan, Peoples R China
[3] Shandong Prov Acad Educ Recruitment & Examinat, Jinan, Peoples R China
关键词
Action recognition; Grouping graph convolution; Spatial-temporal self-attention; Multi-stream fusion strategy; LSTM;
D O I
10.1016/j.asoc.2023.110536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Skeleton-based action recognition is widely used in varied areas such as human-machine interaction and virtual reality. Benefit from the powerful expression ability to depict structural data, graph convolutional networks (GCNs) have been developed to address this task by modeling the human body skeletons as spatial-temporal graphs. However, most existing GCN-based methods usually ignore the diversity of the motion information between channels of the input feature. And how to enhance the ability to capture the long-term global correlations in spatial and temporal dimensions is also a fundamental challenge. In this work, we propose a novel multi-stream framework Global-Local Motion Fusion Network (GLMFN), which integrates the global and local motion information of spatial-temporal dimensions. Specifically, we design a grouping graph convolution module to enforce the ability to aggregate local spatial motion information. Besides, to learn richer semantic features, we propose two modules based on the self-attention operator: a spatial self-attention module and a temporal self-attention module. The former is responsible for extracting spatial long-term motion relationships, while the latter aims to capture temporal long-term motion relationships. Moreover, we present a multi-stream fusion strategy with a series of treatments for body joints to achieve a better recognition effect. To validate the efficacy and efficiency of the proposed model, we perform exhaustive experiments on the NTU-RGBD dataset and NTU-RGBD-120 dataset, and our method achieves the state-of-the-art performance on both datasets. (c) 2023 Published by Elsevier B.V.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Extended multi-stream temporal-attention module for skeleton-based human action recognition (HAR)
    Mehmood, Faisal
    Guo, Xin
    Chen, Enqing
    Akbar, Muhammad Azeem
    Khan, Arif Ali
    Ullah, Sami
    Computers in Human Behavior, 2025, 163
  • [32] Partially Occluded Skeleton Action Recognition Based on Multi-stream Fusion Graph Convolutional Networks
    Li, Dan
    Shi, Wuzhen
    ADVANCES IN COMPUTER GRAPHICS, CGI 2021, 2021, 13002 : 178 - 189
  • [33] Symmetrical Enhanced Fusion Network for Skeleton-Based Action Recognition
    Kong, Jun
    Deng, Haoyang
    Jiang, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (11) : 4394 - 4408
  • [34] MSST-RT: Multi-Stream Spatial-Temporal Relative Transformer for Skeleton-Based Action Recognition
    Sun, Yan
    Shen, Yixin
    Ma, Liyan
    SENSORS, 2021, 21 (16)
  • [35] Hybrid features for skeleton-based action recognition based on network fusion
    Chen, Zhangmeng
    Pan, Junjun
    Yang, Xiaosong
    Qin, Hong
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
  • [36] Skeleton Action Recognition Based on Multi-Stream Spatial Attention Graph Convolutional SRU Network
    Zhao J.-N.
    She Q.-S.
    Meng M.
    Chen Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (07): : 1579 - 1585
  • [37] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhou, Huijian
    Tian, Zhiqiang
    Du, Shaoyi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120
  • [38] Gaitdlf: global and local fusion for skeleton-based gait recognition in the wild
    Wei, Siwei
    Liu, Weijie
    Wei, Feifei
    Wang, Chunzhi
    Xiong, Neal N.
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17606 - 17632
  • [39] EARLY FUSION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED ACTION RECOGNITION
    Zhao, Xiaoxue
    Liu, Cuiwei
    Shi, Xiangbin
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [40] A Multi-Stream Graph Convolutional Networks-Hidden Conditional Random Field Model for Skeleton-Based Action Recognition
    Liu, Kai
    Gao, Lei
    Khan, Naimul Mefraz
    Qi, Lin
    Guan, Ling
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 64 - 76