REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS

被引：0

作者：

Parada, Pablo Peso ^{[1
]}

Sharma, Dushyant ^{[1
]}

Naylor, Patrick A. ^{[2
]}

van Waterschoot, Toon ^{[3
]}

机构：

[1] Nuance Commun Inc, Marlow, Bucks, England

[2] Imperial Coll London, Dept Elect & Elect Engn, London, England

[3] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS ETC, Leuven, Belgium

来源：

2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP) | 2014年

关键词：

phone recognition; reverberation; confusability factor;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a phoneme confusion analysis that models the impact of reverberation on automatic speech recognition performance by formulating the problem in a Bayesian framework. Our analysis under reverberant conditions shows the relative robustness to reverberation of each phoneme and also indicates that substitutions and deletions correspond to the most common errors in a phoneme recognition task. Finally, a model is proposed to estimate the confusability of each phoneme depending on the reverberation level which is evaluated using two independent data sets.

引用

下载

页码：567 / 571

页数：5

共 50 条

[41] HCRF-DRIVEN BEAMFORMING FOR REVERBERANT SPEECH RECOGNITION
Hong, Wei-Tyng
PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 578 - 581
[42] Deep belief networks for phoneme recognition in continuous Tamil speech-an analysis
Raguram, Laxmi Sree Baskaran
Shanmugam, Vijaya Madhaya
TRAITEMENT DU SIGNAL, 2017, 34 (3-4) : 137 - 151
[43] SPEECH RECOGNITION IN A NOISY AND REVERBERANT ENVIRONMENT WITH AND WITHOUT EARMUFFS
PEKKARINEN, E
VILJANEN, V
SALMIVALLI, A
SUONPAA, J
AUDIOLOGY, 1990, 29 (05): : 286 - 293
[44] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
Janez Žibert
Nikola Pavešić
France Mihelič
EURASIP Journal on Advances in Signal Processing, 2006
[45] Speech/non-speech segmentation based on phoneme recognition features
Zibert, Janez
Pavesic, Nikola
Mihelic, France
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
[46] Speech recognition through phoneme segmentation and neural classification
Maeran, O
Piuri, V
Gajani, GS
IMTC/97 - IEEE INSTRUMENTATION & MEASUREMENT TECHNOLOGY CONFERENCE: SENSING, PROCESSING, NETWORKING, PROCEEDINGS VOLS 1 AND 2, 1997, : 1215 - 1220
[47] Phoneme Set Design for Speech Recognition of English by Japanese
Wang, Xiaoyun
Zhang, Jinsong
Nishida, Masafumi
Yamamoto, Seiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (01): : 148 - 156
[48] Improved Phoneme-Based Myoelectric Speech Recognition
Zhou, Quan
Jiang, Ning
Englehart, Kevin
Hudgins, Bernard
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (08) : 2016 - 2023
[49] Phoneme and Sentence-Level Ensembles for Speech Recognition
Dimitrakakis, Christos
Bengio, Samy
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011,
[50] Neural networks for text-to-speech phoneme recognition
Embrechts, MJ
Arciniegas, F
SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 3582 - 3587

← 1 2 3 4 5 →