Semantic-guided multi-scale human skeleton action recognition

被引:6
|
作者
Qi, Yongfeng [1 ]
Hu, Jinlin [1 ]
Zhuang, Liqiang [1 ]
Pei, Xiaoxu [1 ]
机构
[1] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Gansu, Peoples R China
关键词
Human skeleton; Action recognition; Semantic information; Multi-scale neural network; Multi-scale receptive field; GRAPH CONVOLUTIONAL NETWORKS; LSTM; FUSION; GCN;
D O I
10.1007/s10489-022-03968-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of depth sensors and pose estimation algorithms, action recognition technology based on the human skeleton has attracted wide attention from researchers. The human skeleton action recognition methods embedded with semantic information have excellent performance in terms of computational cost and recognition results by extracting spatio-temporal features of all joints, nevertheless, they will cause information redundancy and are of limitations in extracting long-term context spatio-temporal features. In this work, we propose a semantic-guided multi-scale neural network (SGMSN) method for skeleton action recognition. For spatial modeling, the key insight of our approach is to achieve multi-scale graph convolution by manipulating the data level (without adding additional computational cost). For temporal modeling, we build the multi-scale temporal convolutional network with a multi-scale receptive field across the temporal dimensions. Several experiments have been carried out on two publicly available large-scale skeleton datasets, NTU RGB+D and NTU RGB+D 120. On the NTU RGB+D datasets, the accuracy is 90.1% (cross-subject) and 95.8% (cross-view) respectively. The experimental results show that the performance of the proposed network architecture is superior to most current state-of-the-art action recognition models.
引用
收藏
页码:9763 / 9778
页数:16
相关论文
共 50 条
  • [1] Semantic-guided multi-scale human skeleton action recognition
    Yongfeng Qi
    Jinlin Hu
    Liqiang Zhuang
    Xiaoxu Pei
    Applied Intelligence, 2023, 53 : 9763 - 9778
  • [2] Improved semantic-guided network for skeleton-based action recognition
    Mansouri, Amine
    Bakir, Toufik
    Elzaar, Abdellah
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [3] CONSISTENT AND MULTI-SCALE SCENE GRAPH TRANSFORMER FOR SEMANTIC-GUIDED IMAGE OUTPAINTING
    Yang, Chiao-An
    Wu, Meng-Lin
    Yeh, Raymond A.
    Wang, Yu-Chiang Frank
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 176 - 180
  • [4] ASMGCN: Attention-Based Semantic-Guided Multistream Graph Convolution Network for Skeleton Action Recognition
    Zhang, Moyan
    Quan, Zhenzhen
    Wang, Wei
    Chen, Zhe
    Guo, Xiaoshan
    Li, Yujun
    IEEE SENSORS JOURNAL, 2024, 24 (12) : 20064 - 20075
  • [5] Multi-scale skeleton adaptive weighted GCN for skeleton-based human action recognition in IoT
    Xu Weiyao
    Wu Muqing
    Zhu Jie
    Zhao Min
    APPLIED SOFT COMPUTING, 2021, 104
  • [6] Multi-scale skeleton simplification graph convolutional network for skeleton-based action recognition
    Fan, Zhang
    Ding, Chongyang
    Kai, Liu
    Liu, Hongjin
    IET COMPUTER VISION, 2024, 18 (07) : 992 - 1003
  • [7] MTT: Multi-Scale Temporal Transformer for Skeleton-Based Action Recognition
    Kong, Jun
    Bian, Yuhang
    Jiang, Min
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 528 - 532
  • [8] Multi-Scale Adaptive Skeleton Transformer for action
    Wang, Xiaotian
    Chen, Kai
    Zhao, Zhifu
    Shi, Guangming
    Xie, Xuemei
    Jiang, Xiang
    Yang, Yifan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 250
  • [9] Semantic-Guided Relation Propagation Network for Few-shot Action Recognition
    Wang, Xiao
    Ye, Weirong
    Qi, Zhongang
    Zhao, Xun
    Wang, Guangge
    Shan, Ying
    Wang, Hanzi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 816 - 825
  • [10] AeS-GCN: Attention-enhanced semantic-guided graph convolutional networks for skeleton-based action recognition
    Xu, Qing
    LiU, Feng
    Fu, Ziwang
    Zhou, Aimin
    Qi, Jiayin
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (3-4)