A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters

被引:44
|
作者
Kello, CT [1 ]
Plaut, DC
机构
[1] George Mason Univ, Dept Psychol, Fairfax, VA 22030 USA
[2] Carnegie Mellon Univ, Dept Psychol, Ctr Neural Basis Cognit, Pittsburgh, PA 15213 USA
来源
关键词
D O I
10.1121/1.1715112
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Three neural network models were trained on the forward mapping from articulatory positions to acoustic outputs for a single speaker of the Edinburgh multi-channel articulatory speech database. The model parameters (i.e., connection weights) were learned via the backpropagation of error signals generated by the difference between acoustic outputs of the models, and their acoustic targets. Efficacy of the trained models was assessed by subjecting the models' acoustic outputs to speech intelligibility tests. The results of these tests showed that enough phonetic information was captured by the models to support rates of word identification as high as 84%, approaching an identification rate of 92% for the actual target stimuli. These forward models could serve as one component of a data-driven articulatory synthesizer. The models also provide the first step toward building a model of spoken word acquisition and phonological development trained on real speech. (C) 2004 Acoustical Society of America.
引用
收藏
页码:2354 / 2364
页数:11
相关论文
共 50 条
  • [31] Compact representations of the articulatory-to-acoustic mapping
    Potard, Blaise
    Laprie, Yves
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2884 - 2887
  • [32] Articulatory-to-acoustic mapping for inverse problem
    Sorokin, VN
    Trushkin, AV
    SPEECH COMMUNICATION, 1996, 19 (02) : 105 - 118
  • [33] ARTICULATORY MODEL AND ESTIMATION OF ARTICULATORY PARAMETERS BY NONLINEAR-REGRESSION METHOD
    SHIRAI, K
    HONDA, M
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1977, 59 (08): : 35 - 43
  • [34] Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models
    Shahrebabaki, Abdolreza Sabzi
    Salvi, Giampiero
    Svendsen, Torbjorn
    Siniscalchi, Sabato Marco
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 135 - 147
  • [35] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
    Banik, Manoj
    Kotwal, Mohammed Rokibul Alam
    Hassan, Foyzul
    Islam, Gazi Md. Moshfiqul
    Rahman, Sharif Mohammad Musfiqur
    Hasan, Mohammad Mahedi
    Muhammad, Ghulam
    Huda, Mohammad Nurul
    PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
  • [36] An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language
    Chang, SY
    Wester, M
    Greenberg, S
    SPEECH COMMUNICATION, 2005, 47 (03) : 290 - 311
  • [37] Speech modelling based on acoustic-to-articulatory mapping
    Schoentgen, J
    NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 114 - 135
  • [38] Articulatory-acoustic vowel space: Application to clear speech in individuals with Parkinson's disease
    Whitfield, Jason A.
    Goberman, Alexander M.
    JOURNAL OF COMMUNICATION DISORDERS, 2014, 51 : 19 - 28
  • [39] An Empirical Investigation of the Nonuniqueness in the Acoustic-to-Articulatory Mapping
    Qin, Chao
    Carreira-Perpinan, Miguel A.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2300 - 2303
  • [40] REPRESENTATION LEARNING USING CONVOLUTION NEURAL NETWORK FOR ACOUSTIC-TO-ARTICULATORY INVERSION
    Illa, Aravind
    Ghosh, Prasanta Kumar
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5931 - 5935