REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS

被引:0
|
作者
Parada, Pablo Peso [1 ]
Sharma, Dushyant [1 ]
Naylor, Patrick A. [2 ]
van Waterschoot, Toon [3 ]
机构
[1] Nuance Commun Inc, Marlow, Bucks, England
[2] Imperial Coll London, Dept Elect & Elect Engn, London, England
[3] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS ETC, Leuven, Belgium
关键词
phone recognition; reverberation; confusability factor;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a phoneme confusion analysis that models the impact of reverberation on automatic speech recognition performance by formulating the problem in a Bayesian framework. Our analysis under reverberant conditions shows the relative robustness to reverberation of each phoneme and also indicates that substitutions and deletions correspond to the most common errors in a phoneme recognition task. Finally, a model is proposed to estimate the confusability of each phoneme depending on the reverberation level which is evaluated using two independent data sets.
引用
收藏
页码:567 / 571
页数:5
相关论文
共 50 条
  • [1] Modulation spectrum analysis for recognition of reverberant speech
    Mallidi, Sri Harish
    Ganapathy, Sriram
    Hermansky, Hynek
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 196 - 199
  • [2] PHONEME GROUPING FOR SPEECH RECOGNITION
    REDDY, DR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (05): : 1295 - &
  • [3] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
  • [4] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. Bhatt, Shobha (bhattsho@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (11): : 4213 - 4238
  • [5] Confusion analysis in phoneme based speech recognition in Hindi
    Shobha Bhatt
    Amita Dev
    Anurag Jain
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
  • [6] MULTIRESOLUTION CNN FOR REVERBERANT SPEECH RECOGNITION
    Park, Sunchan
    Jeong, Yongwon
    Kim, Hyung Soon
    [J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 214 - 217
  • [7] Clustering Approach in Speech Phoneme Recognition Based on Statistical Analysis
    Tak, Gaurav Kumar
    Bhargava, Vaibhav
    [J]. RECENT TRENDS IN NETWORK SECURITY AND APPLICATIONS, 2010, 89 : 483 - 489
  • [8] The Gamma MLP for speech phoneme recognition
    Lawrence, S
    Tsoi, AC
    Back, AD
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 785 - 791
  • [9] Reverberant Speech Recognition Based on Denoising Autoencoder
    Ishii, Takaaki
    Komiyama, Hiroki
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3479 - 3483
  • [10] A STUDY ON DATA AUGMENTATION OF REVERBERANT SPEECH FOR ROBUST SPEECH RECOGNITION
    Ko, Tom
    Peddinti, Vijayaditya
    Povey, Daniel
    Seltzer, Michael L.
    Khudanpur, Sanjeev
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5220 - 5224