A novel neural-based model for acoustic-articulatory inversion mapping

被引：0

作者：

Hossein Behbood

Seyyed Ali Seyyedsalehi

Hamid Reza Tohidypour

Mojtaba Najafi

Shahriar Gharibzadeh

机构：

[1] Amirkabir University of Technology,Department of Biomedical Engineering

[2] Azad University,undefined

来源：

Neural Computing and Applications | 2012年 / 21卷

关键词：

Bidirectional neural networks (BNNs); Feed-forward networks (FFNs); Time delay neural networks (TDNNs); MOCHA-TIMIT database; Acoustic-articulatory inversion mapping;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, a new bidirectional neural network for better acoustic-articulatory inversion mapping is proposed. The model is motivated by the parallel structure of human brain, processing information by having forward--reverse connections. In other words, there would be a feedback from articulatory system to the acoustic signals emitted from that organ. Inspired by this mechanism, a new bidirectional model is developed to map speech representations to articulatory features. Formation of attractor dynamics in such bidirectional model is first carried out by training the reference speaker subspace as the continuous attractor. Then, it is used to recognize the other speaker’s speech. In fact, the structure and training of this bidirectional model is designed in such a way that the network learns to denoise the signal step by step, using properties of attractors it has formed. In this work, the efficiency of a nonlinear feedforward network is compared to the same one with a bidirectional connection. The bidirectional model increases the accuracy up to approximately 3% (from 62.09 to 64.91%) in the phone recognition process.

引用

页码：935 / 943

页数：8

共 50 条

[1] A novel neural-based model for acoustic-articulatory inversion mapping
Behbood, Hossein
Seyyedsalehi, Seyyed Ali
Tohidypour, Hamid Reza
Najafi, Mojtaba
Gharibzadeh, Shahriar
[J]. NEURAL COMPUTING & APPLICATIONS, 2012, 21 (05): : 935 - 943
[2] A Trajectory Mixture Density Network for the Acoustic-Articulatory Inversion Mapping
Richmond, Korin
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 577 - 580
[3] A Multitask Learning Perspective on Acoustic-Articulatory Inversion
Richmond, Korin
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2356 - 2359
[4] ARTICULATORY COMPENSATION - STUDY OF AMBIGUITIES IN ACOUSTIC-ARTICULATORY MAPPING
ATAL, B
CHANG, JJ
MATHEWS, MV
TUKEY, JW
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S77 - S77
[5] Acoustic-articulatory mapping in vowels by locally weighted regression
McGowan, Richard S.
Berger, Michael A.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04): : 2011 - 2032
[6] Speaker Adaptation of an Acoustic-Articulatory Inversion Model using Cascaded Gaussian Mixture Regressions
Hueber, Thomas
Bailly, Gerard
Badin, Pierre
Elisei, Frederic
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2752 - 2756
[7] ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA
Maharana, Sarthak Kumar
Illa, Aravind
Mannem, Renuka
Belur, Yamini
Shetty, Preetie
Kumar, Veeramani Preethish
Vengalil, Seena
Polavarapu, Kiran
Atchayaram, Nalini
Ghosh, Prasanta Kumar
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6458 - 6462
[8] Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion
Richmond, Korin
[J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 263 - 272
[9] Acoustic-to-Articulatory Inversion Mapping based on Latent Trajectory Gaussian Mixture Model
Tobing, Patrick Lumban
Toda, Tomoki
Kameoka, Hirokazu
Nakamura, Satoshi
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 953 - 957
[10] A Comparison of Fitness Functions in a Genetic Algorithm for Acoustic-Articulatory Parameter Inversion of Vowels
Drayton, Jared
Miranda, Eduardo
Kirke, Alexis
[J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 271 - 272

← 1 2 3 4 5 →