A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters

被引：44

作者：

Kello, CT ^{[1
]}

Plaut, DC

机构：

[1] George Mason Univ, Dept Psychol, Fairfax, VA 22030 USA

[2] Carnegie Mellon Univ, Dept Psychol, Ctr Neural Basis Cognit, Pittsburgh, PA 15213 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2004年 / 116卷 / 04期

关键词：

D O I：

10.1121/1.1715112

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Three neural network models were trained on the forward mapping from articulatory positions to acoustic outputs for a single speaker of the Edinburgh multi-channel articulatory speech database. The model parameters (i.e., connection weights) were learned via the backpropagation of error signals generated by the difference between acoustic outputs of the models, and their acoustic targets. Efficacy of the trained models was assessed by subjecting the models' acoustic outputs to speech intelligibility tests. The results of these tests showed that enough phonetic information was captured by the models to support rates of word identification as high as 84%, approaching an identification rate of 92% for the actual target stimuli. These forward models could serve as one component of a data-driven articulatory synthesizer. The models also provide the first step toward building a model of spoken word acquisition and phonological development trained on real speech. (C) 2004 Acoustical Society of America.

引用

页码：2354 / 2364

页数：11

共 50 条

[31] Compact representations of the articulatory-to-acoustic mapping
Potard, Blaise
Laprie, Yves
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2884 - 2887
[32] Articulatory-to-acoustic mapping for inverse problem
Sorokin, VN
Trushkin, AV
SPEECH COMMUNICATION, 1996, 19 (02) : 105 - 118
[33] ARTICULATORY MODEL AND ESTIMATION OF ARTICULATORY PARAMETERS BY NONLINEAR-REGRESSION METHOD
SHIRAI, K
HONDA, M
ELECTRONICS & COMMUNICATIONS IN JAPAN, 1977, 59 (08): : 35 - 43
[34] Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models
Shahrebabaki, Abdolreza Sabzi
Salvi, Giampiero
Svendsen, Torbjorn
Siniscalchi, Sabato Marco
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 135 - 147
[35] Effect of Articulatory Δ and ΔΔ Parameters on Multilayer Neural Network based Speech Recognition
Banik, Manoj
Kotwal, Mohammed Rokibul Alam
Hassan, Foyzul
Islam, Gazi Md. Moshfiqul
Rahman, Sharif Mohammad Musfiqur
Hasan, Mohammad Mahedi
Muhammad, Ghulam
Huda, Mohammad Nurul
PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 624 - 627
[36] An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language
Chang, SY
Wester, M
Greenberg, S
SPEECH COMMUNICATION, 2005, 47 (03) : 290 - 311
[37] Speech modelling based on acoustic-to-articulatory mapping
Schoentgen, J
NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 114 - 135
[38] Articulatory-acoustic vowel space: Application to clear speech in individuals with Parkinson's disease
Whitfield, Jason A.
Goberman, Alexander M.
JOURNAL OF COMMUNICATION DISORDERS, 2014, 51 : 19 - 28
[39] An Empirical Investigation of the Nonuniqueness in the Acoustic-to-Articulatory Mapping
Qin, Chao
Carreira-Perpinan, Miguel A.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2300 - 2303
[40] REPRESENTATION LEARNING USING CONVOLUTION NEURAL NETWORK FOR ACOUSTIC-TO-ARTICULATORY INVERSION
Illa, Aravind
Ghosh, Prasanta Kumar
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5931 - 5935

← 1 2 3 4 5 →