REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS

被引:0
|
作者
Parada, Pablo Peso [1 ]
Sharma, Dushyant [1 ]
Naylor, Patrick A. [2 ]
van Waterschoot, Toon [3 ]
机构
[1] Nuance Commun Inc, Marlow, Bucks, England
[2] Imperial Coll London, Dept Elect & Elect Engn, London, England
[3] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS ETC, Leuven, Belgium
关键词
phone recognition; reverberation; confusability factor;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a phoneme confusion analysis that models the impact of reverberation on automatic speech recognition performance by formulating the problem in a Bayesian framework. Our analysis under reverberant conditions shows the relative robustness to reverberation of each phoneme and also indicates that substitutions and deletions correspond to the most common errors in a phoneme recognition task. Finally, a model is proposed to estimate the confusability of each phoneme depending on the reverberation level which is evaluated using two independent data sets.
引用
下载
收藏
页码:567 / 571
页数:5
相关论文
共 50 条
  • [41] HCRF-DRIVEN BEAMFORMING FOR REVERBERANT SPEECH RECOGNITION
    Hong, Wei-Tyng
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 578 - 581
  • [42] Deep belief networks for phoneme recognition in continuous Tamil speech-an analysis
    Raguram, Laxmi Sree Baskaran
    Shanmugam, Vijaya Madhaya
    TRAITEMENT DU SIGNAL, 2017, 34 (3-4) : 137 - 151
  • [43] SPEECH RECOGNITION IN A NOISY AND REVERBERANT ENVIRONMENT WITH AND WITHOUT EARMUFFS
    PEKKARINEN, E
    VILJANEN, V
    SALMIVALLI, A
    SUONPAA, J
    AUDIOLOGY, 1990, 29 (05): : 286 - 293
  • [44] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
    Janez Žibert
    Nikola Pavešić
    France Mihelič
    EURASIP Journal on Advances in Signal Processing, 2006
  • [45] Speech/non-speech segmentation based on phoneme recognition features
    Zibert, Janez
    Pavesic, Nikola
    Mihelic, France
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [46] Speech recognition through phoneme segmentation and neural classification
    Maeran, O
    Piuri, V
    Gajani, GS
    IMTC/97 - IEEE INSTRUMENTATION & MEASUREMENT TECHNOLOGY CONFERENCE: SENSING, PROCESSING, NETWORKING, PROCEEDINGS VOLS 1 AND 2, 1997, : 1215 - 1220
  • [47] Phoneme Set Design for Speech Recognition of English by Japanese
    Wang, Xiaoyun
    Zhang, Jinsong
    Nishida, Masafumi
    Yamamoto, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (01): : 148 - 156
  • [48] Improved Phoneme-Based Myoelectric Speech Recognition
    Zhou, Quan
    Jiang, Ning
    Englehart, Kevin
    Hudgins, Bernard
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (08) : 2016 - 2023
  • [49] Phoneme and Sentence-Level Ensembles for Speech Recognition
    Dimitrakakis, Christos
    Bengio, Samy
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011,
  • [50] Neural networks for text-to-speech phoneme recognition
    Embrechts, MJ
    Arciniegas, F
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 3582 - 3587