A Data-Driven Representation for Sign Language Production

被引:0
|
作者
Walsh, Harry [1 ]
Ravanshad, Abolfazl [2 ]
Rahmani, Mariam [2 ]
Bowden, Richard [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
[2] OmniBridge Ai, Washington, DC USA
基金
瑞士国家科学基金会;
关键词
RECOGNITION;
D O I
10.1109/FG59268.2024.10581995
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phonetic representations are used when recording spoken languages, but no equivalent exists for recording signed languages. As a result, linguists have proposed several annotation systems that operate on the gloss or sub-unit level; however, these resources are notably irregular and scarce. Sign Language Production (SLP) aims to automatically translate spoken language sentences into continuous sequences of sign language. However, current state-of-the-art approaches rely on scarce linguistic resources to work. This has limited progress in the field. This paper introduces an innovative solution by transforming the continuous pose generation problem into a discrete sequence generation problem. Thus, overcoming the need for costly annotation. Although, if available, we leverage the additional information to enhance our approach. By applying Vector Quantisation (VQ) to sign language data, we first learn a codebook of short motions that can be combined to create a natural sequence of sign. Where each token in the codebook can be thought of as the lexicon of our representation. Then using a transformer we perform a translation from spoken language text to a sequence of codebook tokens. Each token can be directly mapped to a sequence of poses allowing the translation to be performed by a single network. Furthermore, we present a sign stitching method to effectively join tokens together. We evaluate on the RWTH-PHOENIX-Weather-2014T (PHOENIX14T) and the more challenging meineDGST (mDGS) datasets. An extensive evaluation shows our approach outperforms previous methods, increasing the BLEU-1 back translation score by up to 72%.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Data-driven background representation method to video surveillance
    Li, Zhihui
    Xia, Yingji
    Qu, Zhaowei
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (02) : 193 - 202
  • [22] Ontological representation, classification and data-driven computing of phenotypes
    Uciteli, Alexandr
    Beger, Christoph
    Kirsten, Toralf
    Meineke, Frank A.
    Herre, Heinrich
    JOURNAL OF BIOMEDICAL SEMANTICS, 2020, 11 (01)
  • [23] A Scalable Framework for Data-Driven Subspace Representation and Clustering
    Kim, Eunwoo
    Lee, Minsik
    Oh, Songhwai
    PATTERN RECOGNITION LETTERS, 2019, 125 : 742 - 749
  • [24] A Gloss-free Sign Language Production with Discrete Representation
    Hwang, Eui Jun
    Lee, Huije
    Park, Jong C.
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [25] New Data-Driven Constraints on the Sign of Gluon Polarization in the Proton
    Hunt-Smith, N. T.
    Cocuzza, C.
    Melnitchouk, W.
    Sato, N.
    Thomas, A. W.
    White, M. J.
    PHYSICAL REVIEW LETTERS, 2024, 133 (16)
  • [26] Multi-sensor heterogeneous data representation for data-driven ITS
    Xia, Yingjie
    Li, Xiumei
    2013 16TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS - (ITSC), 2013, : 1750 - 1755
  • [27] Data-Driven Sub-Units and Modeling Structure for Continuous Sign Language Recognition with Multiple-Cues
    Pitsikalis, Vassilis
    Theodorakis, Stavros
    Maragos, Petros
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : A196 - A203
  • [28] Data-driven detection of figurative language use in electronic language resources
    Peters, W
    Wilks, Y
    METAPHOR AND SYMBOL, 2003, 18 (03) : 161 - 173
  • [29] Multiple Affordances of Language Corpora for Data-driven Learning
    Yoon, Hyung-Jo
    Lim, Jungmin
    LANGUAGE LEARNING & TECHNOLOGY, 2016, 20 (01): : 42 - 45
  • [30] Data-Driven Production because of Digital Platforms
    Giese T.
    Hock F.
    Meldt L.
    Herrmann J.
    Wünschel W.
    Metternich J.
    Anderl R.
    Schleich B.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 119 (05): : 366 - 371