Continuous Sign Language Recognition Based on Spatial-Temporal Graph Attention Network

被引:4
|
作者
Guo, Qi [1 ]
Zhang, Shujun [1 ]
Li, Hui [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
来源
关键词
Continuous sign language recognition; graph attention network; bidirectional long short-term memory; connectionist temporal classification;
D O I
10.32604/cmes.2022.021784
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Continuous sign language recognition (CSLR) is challenging due to the complexity of video background, hand gesture variability, and temporal modeling difficulties. This work proposes a CSLR method based on a spatial-temporal graph attention network to focus on essential features of video series. The method considers local details of sign language movements by taking the information on joints and bones as inputs and constructing a spatial-temporal graph to reflect inter-frame relevance and physical connections between nodes. The graph-based multi-head attention mechanism is utilized with adjacent matrix calculation for better local-feature exploration, and short-term motion correlation modeling is completed via a temporal convolutional network. We adopted BLSTM to learn the long-term dependence and connectionist temporal classification to align the word-level sequences. The proposed method achieves competitive results regarding word error rates (1.59%) on the Chinese Sign Language dataset and the mean Jaccard Index (65.78%) on the ChaLearn LAP Continuous Gesture Dataset.
引用
收藏
页码:1653 / 1670
页数:18
相关论文
共 50 条
  • [21] Gait Recognition Algorithm based on Spatial-temporal Graph Neural Network
    Zhou, Jian
    Yan, Shi
    Zhang, Jie
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 63 - 67
  • [22] Gait Recognition Algorithm based on Spatial-temporal Graph Neural Network
    Lan, TianYi
    Shi, ZongBin
    Wang, KeJun
    Yin, ChaoQun
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 55 - 58
  • [23] A Dual Attention Spatial-Temporal Graph Convolutional Network for Emotion Recognition from Gait
    Liu, Jiaqing
    Kisita, Shoji
    Chai, Shurong
    Tateyama, Tomoko
    Iwamoto, Yutaro
    Chen, Yen-Wei
    Journal of the Institute of Image Electronics Engineers of Japan, 2022, 51 (04): : 309 - 317
  • [24] An Attention Enhanced Spatial-Temporal Graph Convolutional LSTM Network for Action Recognition in Karate
    Guo, Jianping
    Liu, Hong
    Li, Xi
    Xu, Dahong
    Zhang, Yihan
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [25] Spatial-Temporal Graph Neural Network based Hand Gesture Recognition
    Yuan G.
    Bing R.
    Liu X.
    Dai W.
    Zhang Y.-M.
    Cai Z.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (04): : 921 - 931
  • [26] Gait Recognition Algorithm based on Spatial-temporal Graph Neural Network
    Shi, Huan
    Hui, Bo
    Hu, Biao
    Gu, RongJie
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 59 - 62
  • [27] A New Partitioned Spatial-Temporal Graph Attention Convolution Network for Human Motion Recognition
    Guo, Keyou
    Wang, Pengshuo
    Shi, Peipeng
    He, Chengbo
    Wei, Caili
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [28] StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition
    Shen, Xiaolong
    Zheng, Zhedong
    Yang, Yi
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (07)
  • [29] Spatial-Temporal Convolutional Attention Network for Action Recognition
    Luo, Huilan
    Chen, Han
    Computer Engineering and Applications, 2023, 59 (09): : 150 - 158
  • [30] Advancing Continuous Sign Language Recognition Through Denoising Diffusion Transformer-Based Spatial-Temporal Enhancement
    Kamal, Suhail Muhammad
    Chen, Yidong
    Li, Shaozi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (4-5):