Continuous Sign Language Recognition Based on Spatial-Temporal Graph Attention Network

被引:4
|
作者
Guo, Qi [1 ]
Zhang, Shujun [1 ]
Li, Hui [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
来源
关键词
Continuous sign language recognition; graph attention network; bidirectional long short-term memory; connectionist temporal classification;
D O I
10.32604/cmes.2022.021784
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Continuous sign language recognition (CSLR) is challenging due to the complexity of video background, hand gesture variability, and temporal modeling difficulties. This work proposes a CSLR method based on a spatial-temporal graph attention network to focus on essential features of video series. The method considers local details of sign language movements by taking the information on joints and bones as inputs and constructing a spatial-temporal graph to reflect inter-frame relevance and physical connections between nodes. The graph-based multi-head attention mechanism is utilized with adjacent matrix calculation for better local-feature exploration, and short-term motion correlation modeling is completed via a temporal convolutional network. We adopted BLSTM to learn the long-term dependence and connectionist temporal classification to align the word-level sequences. The proposed method achieves competitive results regarding word error rates (1.59%) on the Chinese Sign Language dataset and the mean Jaccard Index (65.78%) on the ChaLearn LAP Continuous Gesture Dataset.
引用
收藏
页码:1653 / 1670
页数:18
相关论文
共 50 条
  • [41] Emotion recognition using spatial-temporal EEG features through convolutional graph attention network
    Li, Zhongjie
    Zhang, Gaoyan
    Wang, Longbiao
    Wei, Jianguo
    Dang, Jianwu
    JOURNAL OF NEURAL ENGINEERING, 2023, 20 (01)
  • [42] STFE-Net: A Spatial-Temporal Feature Extraction Network for Continuous Sign Language Translation
    Hu, Jiwei
    Liu, Yunfei
    Lam, Kin-Man
    Lou, Ping
    IEEE ACCESS, 2023, 11 : 46204 - 46217
  • [43] Continuous sign language recognition based on iterative alignment network and attention mechanism
    Xue, Cuihong
    Yu, Ming
    Yan, Gang
    Gao, Yang
    Liu, Yuehao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 17195 - 17212
  • [44] Continuous sign language recognition based on iterative alignment network and attention mechanism
    Cuihong Xue
    Ming Yu
    Gang Yan
    Yang Gao
    Yuehao Liu
    Multimedia Tools and Applications, 2023, 82 : 17195 - 17212
  • [45] Spatial Temporal Aggregation for Efficient Continuous Sign Language Recognition
    Hu, Lianyu
    Gao, Liqing
    Li, Zekang
    Feng, Wei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (06): : 3925 - 3935
  • [46] Capturing spatial-temporal correlations with Attention based Graph Convolutional Network for network traffic prediction
    Guo, Yingya
    Peng, Yufei
    Hao, Run
    Tang, Xiang
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 220
  • [47] Spatial-temporal transformer for end-to-end sign language recognition
    Cui, Zhenchao
    Zhang, Wenbo
    Li, Zhaoxin
    Wang, Zhaoqi
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4645 - 4656
  • [48] Spatial-temporal Graph Transformer Network for Spatial-temporal Forecasting
    Dao, Minh-Son
    Zetsu, Koji
    Hoang, Duy-Tang
    Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 1276 - 1281
  • [49] Activity Recognition Based on Spatial-Temporal Attention LSTM
    Xie, Zhao
    Zhou, Yi
    Wu, Ke-Wei
    Zhang, Shun-Ran
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (02): : 261 - 274
  • [50] Convolution spatial-temporal attention network for EEG emotion recognition
    Cao, Lei
    Yu, Binlong
    Dong, Yilin
    Liu, Tianyu
    Li, Jie
    PHYSIOLOGICAL MEASUREMENT, 2024, 45 (12)