MRI Vocal Tract Sagittal Slices Estimation During Speech Production of CV

被引:0
|
作者
Douros, Ioannis K. [1 ]
Kulkarni, Ajinkya [1 ]
Xie, Yu [2 ]
Dourou, Chrysanthi [3 ]
Felblinger, Jacques [4 ]
Isaieva, Karyna [5 ]
Vuissoz, Pierre-Andre [5 ]
Laprie, Yves [1 ]
机构
[1] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[2] Wuhan Univ, Zhongnan Hosp, Dept Neurol, Wuhan 430071, Peoples R China
[3] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[4] Univ Lorraine, INSERM 1433, CIC IT, CHRU Nancy, F-54000 Nancy, France
[5] Univ Lorraine, INSERM, IADI, U1254, F-54000 Nancy, France
关键词
image transformation; rtMRI data; speech resources enrichment; vocal tract; REAL-TIME MRI; RESOLUTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
引用
收藏
页码:1115 / 1119
页数:5
相关论文
共 50 条
  • [1] Vocal Tract Length during Speech Production
    Dusan, Sorin
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 13 - 16
  • [2] 3D dynamic MRI of the vocal tract during natural speech
    Lim, Yongwan
    Zhu, Yinghua
    Lingala, Sajan Goud
    Byrd, Dani
    Narayanan, Shrikanth
    Nayak, Krishna Shrinivas
    MAGNETIC RESONANCE IN MEDICINE, 2019, 81 (03) : 1511 - 1520
  • [3] Toward Dynamic Magnetic Resonance Imaging of the Vocal Tract During Speech Production
    Ventura, Sandra M. Rua
    Freitas, Diamantino Rui S.
    Tavares, Joao Manuel R. S.
    JOURNAL OF VOICE, 2011, 25 (04) : 511 - 518
  • [4] Learning and adaptation in speech production without a vocal tract
    Megan M. C. Thompson
    John F. Houde
    Srikantan S. Nagarajan
    Scientific Reports, 9
  • [5] Learning and adaptation in speech production without a vocal tract
    Thompson, Megan M. C.
    Houde, John F.
    Nagarajan, Srikantan S.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [6] ESTIMATION OF VOCAL TRACT PARAMETERS FOR THE CLASSIFICATION OF SPEECH UNDER STRESS
    Yao, Xiao
    Jitsuhiro, Takatoshi
    Miyajima, Chiyomi
    Kitaoka, Norihide
    Takeda, Kazuya
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7532 - 7536
  • [7] Analyzing vocal tract movements during speech accommodation
    Mukherjee, Sankar
    Legou, Thierry
    Lancia, Leonardo
    Hilt, Pauline
    Tomassini, Alice
    Fadiga, Luciano
    D'Ausilio, Alessandro
    Badino, Leonardo
    Nguyen, Noel
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 561 - 565
  • [8] MRI-based morphometric analysis of the human vocal tract during speech formation and implications for fossil hominin vocal abilities
    Zollikofer, Christoph P. E.
    Haenni, Serge
    Suter, Susanne K.
    De Leon, Marcia S. Ponce
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2011, 144 : 318 - 318
  • [9] A model of speech production based on the acoustic relativity of the vocal tract
    Story, Brad H.
    Bunton, Kate
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (04): : 2522 - 2528
  • [10] Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
    Douros, Ioannis K.
    Kulkarni, Ajinkya
    Dourou, Chrysanthi
    Xie, Yu
    Felblinger, Jacques
    Isaieva, Karyna
    Vuissoz, Pierre-Andre
    Laprie, Yves
    INTERSPEECH 2020, 2020, : 3730 - 3734