Skeleton-Aware Neural Sign Language Translation

被引:8
|
作者
Gan, Shiwei [1 ]
Yin, Yafeng [1 ]
Jiang, Zhiwei [1 ]
Xie, Lei [1 ]
Lu, Sanglu [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Sign Language Translation; Skeleton; Neural Network; RECOGNITION;
D O I
10.1145/3474085.3475577
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an essential communication way for deaf-mutes, sign languages are expressed by human actions. To distinguish human actions for sign language understanding, the skeleton which contains position information of human pose can provide an important cue, since different actions usually correspond to different poses/skeletons. However, skeleton has not been fully studied for Sign Language Translation (SLT), especially for end-to-end SLT. Therefore, in this paper, we propose a novel end-to-end Skeleton-Aware neural Network (SANet) for video-based SLT. Specifically, to achieve end-toend SLT, we design a self-contained branch for skeleton extraction. To efficiently guide the feature extraction from video with skeletons, we concatenate the skeleton channel and RGB channels of each frame for feature extraction. To distinguish the importance of clips, we construct a skeleton-based Graph Convolutional Network (GCN) for feature scaling, i.e., giving importance weight for each clip. The scaled features of each clip are then sent to a decoder module to generate spoken language. In our SANet, a joint training strategy is designed to optimize skeleton extraction and sign language translation jointly. Experimental results on two large scale SLT datasets demonstrate the effectiveness of our approach, which outperforms the state-of-the-art methods. Our code is available at https://github.com/SignLanguageCode/SANet.
引用
收藏
页码:4353 / 4361
页数:9
相关论文
共 50 条
  • [1] Pose and Skeleton-aware Neural IK for Pose and Motion Editing
    Agrawal, Dhruv
    Guay, Martin
    Buhmann, Jakob
    Borer, Dominik
    Sumner, Robert W.
    [J]. PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
  • [2] Skeleton-Aware Networks for Deep Motion Retargeting
    Aberman, Kfir
    Li, Peizhuo
    Lischinski, Dani
    Sorkine-Hornung, Olga
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (04):
  • [3] Skeleton-aware size variations in digital mannequins
    Kasap, Mustafa
    Magnenat-Thalmann, Nadia
    [J]. VISUAL COMPUTER, 2011, 27 (04): : 263 - 274
  • [4] Skeleton-aware size variations in digital mannequins
    Mustafa Kasap
    Nadia Magnenat-Thalmann
    [J]. The Visual Computer, 2011, 27 : 263 - 274
  • [5] Neural Sign Language Translation
    Camgoz, Necati Cihan
    Hadfield, Simon
    Koller, Oscar
    Ney, Hermann
    Bowden, Richard
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7784 - 7793
  • [6] Skeleton aware multi-modal sign language recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    [J]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2021, : 3408 - 3418
  • [7] Skeleton Aware Multi-modal Sign Language Recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3408 - 3418
  • [8] Neural Sign Language Translation by Learning Tokenization
    Orbay, Alptekin
    Akarun, Lale
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 222 - 228
  • [9] Learnable Skeleton-Aware 3D Point Cloud Sampling
    Wen, Cheng
    Yu, Baosheng
    Tao, Dacheng
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17671 - 17681
  • [10] Using Neural Machine Translation Methods for Sign Language Translation
    Angelova, Galina
    Avramidis, Eleftherios
    Moeller, Sebastian
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 273 - 284