Neural Sign Language Translation

被引:246
|
作者
Camgoz, Necati Cihan [1 ]
Hadfield, Simon [1 ]
Koller, Oscar [2 ]
Ney, Hermann [2 ]
Bowden, Richard [1 ]
机构
[1] Univ Surrey, Guildford, Surrey, England
[2] Rhein Westfal TH Aachen, Aachen, Germany
关键词
D O I
10.1109/CVPR.2018.00812
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sign Language Recognition (SLR) has been an active research field for the last two decades. However most research to date has considered SLR as a naive gesture recognition problem. SLR seeks to recognize a sequence of continuous signs but neglects the underlying rich grammatical and linguistic structures of sign language that differ from spoken language. In contrast, we introduce the Sign Language Translation (SLT) problem. Here, the objective is to generate spoken language translations from sign language videos, taking into account the different word orders and grammar. We formalize SLT in the framework of Neural Machine Translation (NMT) for both end-to-end and pretrained settings (using expert knowledge). This allows us to jointly learn the spatial representations, the underlying language model, and the mapping between sign and spoken language. To evaluate the performance of Neural SLT we collected the first publicly available Continuous SLT dataset, RWTH-PHOENIX-Weather 2014T(1). It provides spoken language translations and gloss level annotations for German Sign Language videos of weather broadcasts. Our dataset contains over .95M frames with >67K signs from a sign vocabulary of >1K and >99K words from a German vocabulary of >2.8K. We report quantitative and qualitative results for various SLT setups to underpin future research in this newly established field. The upper bound for translation performance is calculated at 19.26 BLEU-4, while our end-to-end frame-level and gloss-level tokenization networks were able to achieve 9.58 and 18.13 respectively.
引用
收藏
页码:7784 / 7793
页数:10
相关论文
共 50 条
  • [1] Neural Sign Language Translation by Learning Tokenization
    Orbay, Alptekin
    Akarun, Lale
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 222 - 228
  • [2] Using Neural Machine Translation Methods for Sign Language Translation
    Angelova, Galina
    Avramidis, Eleftherios
    Moeller, Sebastian
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 273 - 284
  • [3] Cross-modal Neural Sign Language Translation
    Duarte, Amanda
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1650 - 1654
  • [4] Neural Sign Language Translation with SF-Transformer
    Yin, Qifang
    Tao, Wenqi
    Liu, Xiaolong
    Hong, Yusheng
    [J]. 6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 64 - 68
  • [5] Neural machine translation from text to sign language
    De Martino, Jose Mario
    Silva, Ivani Rodrigues
    Marques, Janice Goncalves Temoteo
    Martins, Antonielle Cantarelli
    Poeta, Enzo Telles
    Christinele, Dener Stassun
    Campos, Joao Pedro Araujo Ferreira
    [J]. UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2023,
  • [6] Skeleton-Aware Neural Sign Language Translation
    Gan, Shiwei
    Yin, Yafeng
    Jiang, Zhiwei
    Xie, Lei
    Lu, Sanglu
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4353 - 4361
  • [7] Neural Sign Language Translation Based on Human Keypoint Estimation
    Ko, Sang-Ki
    Kim, Chang Jo
    Jung, Hyedong
    Cho, Choongsang
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):
  • [8] Sign Language Translation Using Deep Convolutional Neural Networks
    Abiyev, Rahib H.
    Arslan, Murat
    Idok, John Bush
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (02): : 631 - 653
  • [9] Leveraging Frozen Pretrained Written Language Models for Neural Sign Language Translation
    De Coster, Mathieu
    Dambre, Joni
    [J]. INFORMATION, 2022, 13 (05)
  • [10] Sign Language Translation
    Harini, R.
    Janani, R.
    Keerthana, S.
    Madhubala, S.
    Venkatasubramanian, S.
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 883 - 886