MRI Vocal Tract Sagittal Slices Estimation During Speech Production of CV

被引:0
|
作者
Douros, Ioannis K. [1 ]
Kulkarni, Ajinkya [1 ]
Xie, Yu [2 ]
Dourou, Chrysanthi [3 ]
Felblinger, Jacques [4 ]
Isaieva, Karyna [5 ]
Vuissoz, Pierre-Andre [5 ]
Laprie, Yves [1 ]
机构
[1] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[2] Wuhan Univ, Zhongnan Hosp, Dept Neurol, Wuhan 430071, Peoples R China
[3] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[4] Univ Lorraine, INSERM 1433, CIC IT, CHRU Nancy, F-54000 Nancy, France
[5] Univ Lorraine, INSERM, IADI, U1254, F-54000 Nancy, France
关键词
image transformation; rtMRI data; speech resources enrichment; vocal tract; REAL-TIME MRI; RESOLUTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
引用
收藏
页码:1115 / 1119
页数:5
相关论文
共 50 条
  • [21] The interrelationship between the face and vocal tract configuration during audiovisual speech
    Scholes, Chris
    Skipper, Jeremy, I
    Johnston, Alan
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (51) : 32791 - 32798
  • [22] Development and implementation of fiducial markers for vocal tract MRI imaging and speech articulatory modelling
    Badin, Pierre
    Vargas, Julian Andres Valdes
    Koncki, Arielle
    Lamalle, Laurent
    Savariaux, Christophe
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1320 - 1324
  • [23] Estimation of Vocal-Tract Shape from Speech Spectrum and Speech Resynthesis Based on a Generative Model
    Kaburagi, Tokihiko
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 422 - 426
  • [24] Estimation of Vocal Tract Area Function for Mandarin Vowel Sequences Using MRI
    Wang, Gaowu
    Dang, Jianwu
    Kong, Jiangping
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1182 - +
  • [26] Vocal-tract models and their applications in education for intuitive understanding of speech production
    Arai, Takayuki
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2016, 37 (04) : 148 - 156
  • [27] MR IMAGING OF THE VOCAL-TRACT DURING VOWEL PRODUCTION
    LAKSHMINARAYANAN, AV
    LEE, S
    MCCUTCHEON, MJ
    JMRI-JOURNAL OF MAGNETIC RESONANCE IMAGING, 1991, 1 (01): : 71 - 76
  • [28] A novel approach to the estimation of voice source and vocal tract parameters from speech signals
    Ding, W
    Kasuya, H
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1257 - 1260
  • [29] DIRECT ESTIMATION OF VOCAL-TRACT SHAPE BY INVERSE FILTERING OF ACOUSTIC SPEECH WAVEFORMS
    WAKITA, H
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (05): : 417 - 427
  • [30] Dynamic 3-D Visualization of Vocal Tract Shaping During Speech
    Zhu, Yinghua
    Kim, Yoon-Chul
    Proctor, Michael I.
    Narayanan, Shrikanth S.
    Nayak, Krishna S.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2013, 32 (05) : 838 - 848