COMPARISON OF NOISE ROBUST METHODS IN LARGE VOCABULARY SPEECH RECOGNITION

被引:0
|
作者
Keronen, Sami [1 ]
Remes, Ulpu [1 ]
Palomaki, Kalle J. [1 ]
Virtanen, Tuomas [2 ]
Kurimo, Mikko [1 ]
机构
[1] Aalto Univ, Adapt Informat Res Ctr, FI-00076 Aalto, Finland
[2] Tampere Univ Technol, Dept Signal Proc, FI-33101 Tampere, Finland
关键词
TAYLOR-SERIES APPROACH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a comparison of three fundamentally different noise robust approaches is carried out. The recognition performances of multicondition training, Data-driven Parallel Model Combination (DPMC), and cluster-based missing data reconstruction methods implemented in a large vocabulary continuous speech recognition system are evaluated with Finnish language speech data consisting of real recordings in noisy environments. All three methods improve the recognition accuracy substantially in poor signal-to-noise ratio (SNR) conditions when compared to a baseline system trained on clean speech. DPMC and missing data reconstruction systems give the best performance on high SNR conditions. On low SNR conditions, the performance of multicondition trained system is ranked the best, DPMC the second best and missing data reconstruction the third.
引用
收藏
页码:1973 / 1977
页数:5
相关论文
共 50 条
  • [1] Quantile based histogram equalization for noise robust large vocabulary speech recognition
    Hilger, F
    Ney, H
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 845 - 854
  • [2] NORMALIZED AMPLITUDE MODULATION FEATURES FOR LARGE VOCABULARY NOISE-ROBUST SPEECH RECOGNITION
    Mitra, Vikramjit
    Franco, Horacio
    Graciarena, Martin
    Mandal, Arindam
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4117 - 4120
  • [3] A Comparison of Deep Neural Network Training Methods for Large Vocabulary Speech Recognition
    Toth, Laszlo
    Grosz, Tamas
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 36 - 43
  • [4] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    [J]. Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14
  • [5] METHODS TOWARDS THE VERY LARGE VOCABULARY CHINESE SPEECH RECOGNITION
    Department of Electronic Engineering, Tsinghna University, Beijing
    100084, China
    [J]. Eur. Conf. Speech Commun. Technol., EUROSPEECH, 1600, (215-218):
  • [6] Large vocabulary speech recognition in French
    Adda-Decker, M
    Adda, G
    Gauvain, JL
    Lamel, L
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 45 - 48
  • [7] Advances in Large Vocabulary Speech Recognition
    Gauvain, JL
    De Mori, R
    Lamel, L
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 1 - 3
  • [8] Large vocabulary speech recognition in French
    Adda-Decker, Martine
    Adda, Gilles
    Gauvain, Jean-Luc
    Lamel, Lori
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 45 - 48
  • [9] Methods for Robust Speech Recognition in Reverberant Environments: A Comparison
    Petrick, Rico
    Feher, Thomas
    Unoki, Masashi
    Hoffmann, Ruediger
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 582 - +
  • [10] Evaluation of Modulation Spectrum Equalization Techniques for Large Vocabulary Robust Speech Recognition
    Sun, Liang-che
    Hsu, Chang-wen
    Lee, Lin-shan
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1004 - 1007