A novel neural-based model for acoustic-articulatory inversion mapping

被引:0
|
作者
Hossein Behbood
Seyyed Ali Seyyedsalehi
Hamid Reza Tohidypour
Mojtaba Najafi
Shahriar Gharibzadeh
机构
[1] Amirkabir University of Technology,Department of Biomedical Engineering
[2] Azad University,undefined
来源
关键词
Bidirectional neural networks (BNNs); Feed-forward networks (FFNs); Time delay neural networks (TDNNs); MOCHA-TIMIT database; Acoustic-articulatory inversion mapping;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a new bidirectional neural network for better acoustic-articulatory inversion mapping is proposed. The model is motivated by the parallel structure of human brain, processing information by having forward--reverse connections. In other words, there would be a feedback from articulatory system to the acoustic signals emitted from that organ. Inspired by this mechanism, a new bidirectional model is developed to map speech representations to articulatory features. Formation of attractor dynamics in such bidirectional model is first carried out by training the reference speaker subspace as the continuous attractor. Then, it is used to recognize the other speaker’s speech. In fact, the structure and training of this bidirectional model is designed in such a way that the network learns to denoise the signal step by step, using properties of attractors it has formed. In this work, the efficiency of a nonlinear feedforward network is compared to the same one with a bidirectional connection. The bidirectional model increases the accuracy up to approximately 3% (from 62.09 to 64.91%) in the phone recognition process.
引用
收藏
页码:935 / 943
页数:8
相关论文
共 50 条
  • [1] A novel neural-based model for acoustic-articulatory inversion mapping
    Behbood, Hossein
    Seyyedsalehi, Seyyed Ali
    Tohidypour, Hamid Reza
    Najafi, Mojtaba
    Gharibzadeh, Shahriar
    [J]. NEURAL COMPUTING & APPLICATIONS, 2012, 21 (05): : 935 - 943
  • [2] A Trajectory Mixture Density Network for the Acoustic-Articulatory Inversion Mapping
    Richmond, Korin
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 577 - 580
  • [3] A Multitask Learning Perspective on Acoustic-Articulatory Inversion
    Richmond, Korin
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2356 - 2359
  • [4] ARTICULATORY COMPENSATION - STUDY OF AMBIGUITIES IN ACOUSTIC-ARTICULATORY MAPPING
    ATAL, B
    CHANG, JJ
    MATHEWS, MV
    TUKEY, JW
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S77 - S77
  • [5] Acoustic-articulatory mapping in vowels by locally weighted regression
    McGowan, Richard S.
    Berger, Michael A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04): : 2011 - 2032
  • [6] Speaker Adaptation of an Acoustic-Articulatory Inversion Model using Cascaded Gaussian Mixture Regressions
    Hueber, Thomas
    Bailly, Gerard
    Badin, Pierre
    Elisei, Frederic
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2752 - 2756
  • [7] ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA
    Maharana, Sarthak Kumar
    Illa, Aravind
    Mannem, Renuka
    Belur, Yamini
    Shetty, Preetie
    Kumar, Veeramani Preethish
    Vengalil, Seena
    Polavarapu, Kiran
    Atchayaram, Nalini
    Ghosh, Prasanta Kumar
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6458 - 6462
  • [8] Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion
    Richmond, Korin
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 263 - 272
  • [9] Acoustic-to-Articulatory Inversion Mapping based on Latent Trajectory Gaussian Mixture Model
    Tobing, Patrick Lumban
    Toda, Tomoki
    Kameoka, Hirokazu
    Nakamura, Satoshi
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 953 - 957
  • [10] A Comparison of Fitness Functions in a Genetic Algorithm for Acoustic-Articulatory Parameter Inversion of Vowels
    Drayton, Jared
    Miranda, Eduardo
    Kirke, Alexis
    [J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 271 - 272