MRI Vocal Tract Sagittal Slices Estimation During Speech Production of CV

被引:0
|
作者
Douros, Ioannis K. [1 ]
Kulkarni, Ajinkya [1 ]
Xie, Yu [2 ]
Dourou, Chrysanthi [3 ]
Felblinger, Jacques [4 ]
Isaieva, Karyna [5 ]
Vuissoz, Pierre-Andre [5 ]
Laprie, Yves [1 ]
机构
[1] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[2] Wuhan Univ, Zhongnan Hosp, Dept Neurol, Wuhan 430071, Peoples R China
[3] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[4] Univ Lorraine, INSERM 1433, CIC IT, CHRU Nancy, F-54000 Nancy, France
[5] Univ Lorraine, INSERM, IADI, U1254, F-54000 Nancy, France
关键词
image transformation; rtMRI data; speech resources enrichment; vocal tract; REAL-TIME MRI; RESOLUTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
引用
收藏
页码:1115 / 1119
页数:5
相关论文
共 50 条
  • [41] Vocal Tract Images Reveal Neural Representations of Sensorimotor Transformation During Speech Imitation
    Carey, Daniel
    Miquel, Marc E.
    Evans, Bronwen G.
    Adank, Patti
    McGettigan, Carolyn
    CEREBRAL CORTEX, 2017, 27 (05) : 3064 - 3079
  • [42] Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI
    Saha, Pramit
    Srungarapu, Praneeth
    Fels, Sidney
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1249 - 1253
  • [43] Silent Speech and Emotion Recognition from Vocal Tract Shape Dynamics in Real-Time MRI
    Pandey, Laxmi
    Arif, Ahmed Sabbir
    SIGGRAPH '21: ACM SIGGRAPH 2021 POSTERS, 2021,
  • [44] Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data
    Parida, Satyabrata
    Kumar, Pattern Ashok
    Ghosh, Prasanta Kumar
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2147 - 2151
  • [45] Magnetic resonance imaging of the brain and vocal tract: Applications to the study of speech production and language learning
    Carey, Daniel
    McGettigan, Carolyn
    NEUROPSYCHOLOGIA, 2017, 98 : 201 - 211
  • [46] Changes in the human vocal tract due to aging and the acoustic correlates of speech production: A pilot study
    Xue, SA
    Hao, GJP
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2003, 46 (03): : 689 - 701
  • [47] Human Vocal Tract Analysis by in Vivo 3D MRI during Phonation: A Complete System for Imaging, Quantitative Modeling, and Speech Synthesis
    Wismueller, Axel
    Behrends, Johannes
    Hoole, Phil
    Leinsinger, Gerda L.
    Reiser, Maximilian F.
    Westesson, Per-Lennart
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2008, PT II, PROCEEDINGS, 2008, 5242 : 306 - 312
  • [48] Acoustic and Aerodynamic Coupling during Phonation in MRI-Based Vocal Tract Replicas
    Probst, Judith
    Lodermeyer, Alexander
    Fattoum, Sahar
    Becker, Stefan
    Echternach, Matthias
    Richter, Bernhard
    Doellinger, Michael
    Kniesburges, Stefan
    APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [49] A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation
    Mohammad H. Radfar
    Richard M. Dansereau
    Abolghasem Sayadiyan
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [50] A Maximum Likelihood Estimation of Vocal-Tract-Related Filter Characteristics for Single Channel Speech Separation
    Radfar, Mohammad H.
    Dansereau, RichardM.
    Sayadiyan, Abolghasem
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)