MRI Vocal Tract Sagittal Slices Estimation During Speech Production of CV

被引:0
|
作者
Douros, Ioannis K. [1 ]
Kulkarni, Ajinkya [1 ]
Xie, Yu [2 ]
Dourou, Chrysanthi [3 ]
Felblinger, Jacques [4 ]
Isaieva, Karyna [5 ]
Vuissoz, Pierre-Andre [5 ]
Laprie, Yves [1 ]
机构
[1] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[2] Wuhan Univ, Zhongnan Hosp, Dept Neurol, Wuhan 430071, Peoples R China
[3] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[4] Univ Lorraine, INSERM 1433, CIC IT, CHRU Nancy, F-54000 Nancy, France
[5] Univ Lorraine, INSERM, IADI, U1254, F-54000 Nancy, France
关键词
image transformation; rtMRI data; speech resources enrichment; vocal tract; REAL-TIME MRI; RESOLUTION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an algorithm for estimating vocal tract para sagittal slices in order to have a better overview of the behaviour of the articulators during speech production. The first step is to align the consonant-vowel (CV) data of the sagittal plains between them for the train speaker. Sets of transformations that connect the midsagittal frames with the neighbouring ones is acquired for the train speaker. Another set of transformations is calculated which transforms the midsagittal frames of the train speaker to the corresponding midsagittal frames of the test speaker and is used to adapt to the test speaker domain the previously computed sets of transformations. The newly adapted transformations are applied to the midsagittal frames of the test speaker in order to estimate the neighbouring sagittal frames. Several mono speaker models are combined to produce the final frame estimation. To evaluate the results, image cross-correlation between the original and the estimated frames was used. Results show good agreement between the original and the estimated frames.
引用
收藏
页码:1115 / 1119
页数:5
相关论文
共 50 条
  • [11] Estimation of vocal tract shape for VCV syllables for a speech training aid
    Shah, Milind S.
    Pandey, Prem C.
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 6642 - 6645
  • [12] Database of volumetric and real-time vocal tract MRI for speech science
    Sorensen, Tanner
    Skordilis, Zisis
    Toutios, Asterios
    Kim, Yoon-Chul
    Zhu, Yinghua
    Kim, Jangwon
    Lammert, Adam
    Ramanarayanan, Vikram
    Goldstein, Louis
    Byrd, Dani
    Nayak, Krishna
    Narayanan, Shrikanth
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 645 - 649
  • [13] Mathematical model of acoustic speech production with mobile walls of the vocal tract
    Lyubimov, N. A.
    Zakharov, E. V.
    ACOUSTICAL PHYSICS, 2016, 62 (02) : 225 - 234
  • [14] Mathematical model of acoustic speech production with mobile walls of the vocal tract
    N. A. Lyubimov
    E. V. Zakharov
    Acoustical Physics, 2016, 62 : 225 - 234
  • [15] SPEECH BANDWIDTH EXTENSION BASED ON SPEECH PHONETIC CONTENT AND SPEAKER VOCAL TRACT SHAPE ESTIMATION
    Katsir, Itai
    Cohen, Israel
    Malah, David
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 461 - 465
  • [16] Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms
    Wakita, H.
    1973, Au-21 (05): : 417 - 427
  • [17] Estimation of vocal tract shapes from speech sounds with a physiological articulatory model
    Dang, JW
    Honda, K
    JOURNAL OF PHONETICS, 2002, 30 (03) : 511 - 532
  • [18] Vocal Tract Length Estimation for Voiced and Whispered Speech Using Gammachirp Filterbank
    Irino, Toshio
    Okamoto, Erika
    Nisimura, Ryuichi
    Kawahara, Hideki
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [19] ESTIMATION OF VOCAL-TRACT SHAPES FROM ACOUSTIC ANALYSIS OF SPEECH SOUNDS
    WAKITA, H
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S37 - S37
  • [20] ESTIMATION OF VOCAL-TRACT SHAPE BY OPTIMUM INVERSE FILTER PROCESSING OF SPEECH
    WAKITA, H
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 182 - +