COMPARISON OF NOISE ROBUST METHODS IN LARGE VOCABULARY SPEECH RECOGNITION

被引：0

作者：

Keronen, Sami ^{[1
]}

Remes, Ulpu ^{[1
]}

Palomaki, Kalle J. ^{[1
]}

Virtanen, Tuomas ^{[2
]}

Kurimo, Mikko ^{[1
]}

机构：

[1] Aalto Univ, Adapt Informat Res Ctr, FI-00076 Aalto, Finland

[2] Tampere Univ Technol, Dept Signal Proc, FI-33101 Tampere, Finland

来源：

18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010) | 2010年

关键词：

TAYLOR-SERIES APPROACH;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a comparison of three fundamentally different noise robust approaches is carried out. The recognition performances of multicondition training, Data-driven Parallel Model Combination (DPMC), and cluster-based missing data reconstruction methods implemented in a large vocabulary continuous speech recognition system are evaluated with Finnish language speech data consisting of real recordings in noisy environments. All three methods improve the recognition accuracy substantially in poor signal-to-noise ratio (SNR) conditions when compared to a baseline system trained on clean speech. DPMC and missing data reconstruction systems give the best performance on high SNR conditions. On low SNR conditions, the performance of multicondition trained system is ranked the best, DPMC the second best and missing data reconstruction the third.

引用

页码：1973 / 1977

页数：5

共 50 条

[1] Quantile based histogram equalization for noise robust large vocabulary speech recognition
Hilger, F
Ney, H
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 845 - 854
[2] NORMALIZED AMPLITUDE MODULATION FEATURES FOR LARGE VOCABULARY NOISE-ROBUST SPEECH RECOGNITION
Mitra, Vikramjit
Franco, Horacio
Graciarena, Martin
Mandal, Arindam
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4117 - 4120
[3] A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition
Toth, Laszlo
Grosz, Tamas
[J]. TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 36 - 43
[4] Robust noise suppression methods in speech recognition
Cui, Yi
Zhang, Dong
Shi, Liangping
Chen, Liyuan
[J]. Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
[5] METHODS TOWARDS THE VERY LARGE VOCABULARY CHINESE SPEECH RECOGNITION
Department of Electronic Engineering, Tsinghna University, Beijing
100084, China
[J]. Eur. Conf. Speech Commun. Technol., EUROSPEECH, 1600, (215-218):
[6] Large vocabulary speech recognition in French
Adda-Decker, M
Adda, G
Gauvain, JL
Lamel, L
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 45 - 48
[7] Advances in Large Vocabulary Speech Recognition
Gauvain, JL
De Mori, R
Lamel, L
[J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 1 - 3
[8] Large vocabulary speech recognition in French
Adda-Decker, Martine
Adda, Gilles
Gauvain, Jean-Luc
Lamel, Lori
[J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 45 - 48
[9] Methods for Robust Speech Recognition in Reverberant Environments: A Comparison
Petrick, Rico
Feher, Thomas
Unoki, Masashi
Hoffmann, Ruediger
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 582 - +
[10] Evaluation of Modulation Spectrum Equalization Techniques for Large Vocabulary Robust Speech Recognition
Sun, Liang-che
Hsu, Chang-wen
Lee, Lin-shan
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1004 - 1007

← 1 2 3 4 5 →