Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated

被引:6
|
作者
Ribeiro, Vinicius [1 ]
Isaieva, Karyna [2 ]
Leclere, Justine [2 ,3 ]
Vuissoz, Pierre-Andre [2 ]
Laprie, Yves [1 ]
机构
[1] Univ Lorraine, CNRS, Inria, LORIA, F-54000 Nancy, France
[2] Univ Lorraine, INSERM, U1254, IADI, F-54000 Nancy, France
[3] Hop Maison Blanche, Serv Medecine Bucco dentaire, F-51100 Reims, France
关键词
Phonetic-to-articulatory; Speech production; Vocal tract shape; MRI; SEGMENTATION;
D O I
10.1016/j.specom.2022.04.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Articulatory speech synthesis requires generating realistic vocal tract shapes from the sequence of phonemes to be articulated. This work proposes the first model trained from rt-MRI films to automatically predict all of the vocal tract articulators' contours. The data are the contours tracked in the rt-MRI database recorded for one speaker. Those contours were exploited to train an encoder-decoder network to map the sequence of phonemes and their durations to the exact gestures performed by the speaker. Different from other works, all the individual articulator contours are predicted separately, allowing the investigation of their interactions. We measure four tract variables closely coupled with critical articulators and observe their variations over time. The test demonstrates that our model can produce high-quality shapes of the complete vocal tract with a good correlation between the predicted and the target variables observed in rt-MRI films, even though the tract variables are not included in the optimization procedure.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [41] Automatic Test Path Generation from Sequence Diagram Using Genetic Algorithm
    Hoseini, Bahare
    Jalili, Saeed
    2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 106 - 111
  • [42] Relationship matrix based automatic assembly sequence generation from a CAD model
    Ou, Li-Ming
    Xu, Xun
    COMPUTER-AIDED DESIGN, 2013, 45 (07) : 1053 - 1067
  • [43] Complete genome sequence of Yokenella regensburgei isolated from a patient with urinary tract infection in India
    Sahni, Rani Diana
    Aravind, V
    Suji, Thangamani
    Sheeba, Annie, V
    Jayanth, Selvin Theodore
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2024, 13 (06):
  • [44] Generation of a Complete Set of Additive Shape-Invariant Potentials from an Euler Equation
    Bougie, Jonathan
    Gangopadhyaya, Asim
    Mallow, Jeffry V.
    PHYSICAL REVIEW LETTERS, 2010, 105 (21)
  • [45] Automatic Hanging Point Learning from Random Shape Generation and Physical Function Validation
    Takeuchi, Kosuke
    Yanokura, Iori
    Kakiuchi, Yohei
    Okada, Kei
    Inaba, Masayuki
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4237 - 4243
  • [46] Complete Genome of a Novel Endornavirus Assembled from Next-Generation Sequence Data
    Espach, Yolandi
    Maree, Hans J.
    Burger, Johan T.
    JOURNAL OF VIROLOGY, 2012, 86 (23) : 13142 - 13142
  • [47] AUTOMATIC-GENERATION OF PRIMARY SEQUENCE PATTERNS FROM SETS OF RELATED PROTEIN SEQUENCES
    SMITH, RF
    SMITH, TF
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (01) : 118 - 122
  • [48] Automatic generation of TestNG tests cases from UML sequence diagrams in Scrum process
    Elallaoui, Meryem
    Nafil, Khalid
    Touahni, Raja
    2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 65 - 70
  • [49] An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images
    Michel Belyk
    Christopher Carignan
    Carolyn McGettigan
    Behavior Research Methods, 2024, 56 : 2623 - 2635
  • [50] An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images
    Belyk, Michel
    Carignan, Christopher
    McGettigan, Carolyn
    BEHAVIOR RESEARCH METHODS, 2024, 56 (03) : 2623 - 2635