A Data-Driven Representation for Sign Language Production

被引:0
|
作者
Walsh, Harry [1 ]
Ravanshad, Abolfazl [2 ]
Rahmani, Mariam [2 ]
Bowden, Richard [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
[2] OmniBridge Ai, Washington, DC USA
基金
瑞士国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/FG59268.2024.10581995
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phonetic representations are used when recording spoken languages, but no equivalent exists for recording signed languages. As a result, linguists have proposed several annotation systems that operate on the gloss or sub-unit level; however, these resources are notably irregular and scarce. Sign Language Production (SLP) aims to automatically translate spoken language sentences into continuous sequences of sign language. However, current state-of-the-art approaches rely on scarce linguistic resources to work. This has limited progress in the field. This paper introduces an innovative solution by transforming the continuous pose generation problem into a discrete sequence generation problem. Thus, overcoming the need for costly annotation. Although, if available, we leverage the additional information to enhance our approach. By applying Vector Quantisation (VQ) to sign language data, we first learn a codebook of short motions that can be combined to create a natural sequence of sign. Where each token in the codebook can be thought of as the lexicon of our representation. Then using a transformer we perform a translation from spoken language text to a sequence of codebook tokens. Each token can be directly mapped to a sequence of poses allowing the translation to be performed by a single network. Furthermore, we present a sign stitching method to effectively join tokens together. We evaluate on the RWTH-PHOENIX-Weather-2014T (PHOENIX14T) and the more challenging meineDGST (mDGS) datasets. An extensive evaluation shows our approach outperforms previous methods, increasing the BLEU-1 back translation score by up to 72%.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] (Data-driven) knowledge representation in Industry 4.0 scheduling problems
    Rossit, Daniel A.
    Tohme, Fernando
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2022, 35 (10-11) : 1172 - 1187
  • [42] Dual System Representation And Prediction Method for Data-Driven Estimation
    Adachi, Ryosuke
    Wakasa, Yuji
    2021 60TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2021, : 1245 - 1250
  • [43] Efficient and Flexible Deformation Representation for Data-Driven Surface Modeling
    Gao, Lin
    Lai, Yu-Kun
    Liang, Dun
    Chen, Shu-Yu
    Xia, Shihong
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (05):
  • [44] Sign Determination Methods for the Respiratory Signal in Data-Driven PET Gating
    Bertolli, Ottavia
    Arridge, Simon
    Stearns, Charles W.
    Wollenweber, Scott D.
    Hutton, Brian F.
    Thielemans, Kris
    2015 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2015,
  • [45] Sign determination methods for the respiratory signal in data-driven PET gating
    Bertolli, Ottavia
    Arridge, Simon
    Wollenweber, Scott D.
    Stearns, Charles W.
    Hutton, Brian F.
    Thielemans, Kris
    PHYSICS IN MEDICINE AND BIOLOGY, 2017, 62 (08): : 3204 - 3220
  • [46] Data-Driven Rule Mining and Representation of Temporal Patterns in Physiological Sensor Data
    Banaee, Hadi
    Loutfi, Amy
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (05) : 1557 - 1566
  • [47] A model-based data-driven dictionary learning for seismic data representation
    Yarman, Can Evren
    Kumar, Rajiv
    Rickett, James
    GEOPHYSICAL PROSPECTING, 2018, 66 (01) : 98 - 123
  • [48] A full data-driven system for multiple language question answering
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Perez-Coutino, Manuel
    Gomez-Soriano, Jose Manuel
    Sanchis-Arnal, Emilio
    Rosso, Paolo
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 420 - 428
  • [49] Multiple Affordances of Language Corpora in Data-driven Learning.
    Smart, Jon
    Corpora, 2016, 11 (03) : 465 - 468
  • [50] Narrative Bytes: Data-Driven Content Production in Esports
    Block, Florian
    Hodge, Victoria
    Hobson, Stephen
    Sephton, Nick
    Devlin, Sam
    Ursu, Marian F.
    Drachen, Anders
    Cowling, Peter I.
    TVX 2018: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE EXPERIENCES FOR TV AND ONLINE VIDEO, 2018, : 29 - 41