Improved robustness of noisy speech HMMs based on weighted variance expansion

被引:0
|
作者
Kanno, S [1 ]
Funada, T [1 ]
机构
[1] Kanazawa Univ, Ind Res Inst Ishikawa, Kanazawa, Ishikawa 9200223, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spectrum of noise and SNR often vary abruptly due to the non-stationary noise under field conditions. The performance of speech recognition degrades rapidly when the noise conditions in the recognition process are different from those in the process of training or adaptation, therefore it is necessary to make HMMs robust to abrupt variation of noise. In this paper, we propose a method to modify the output probability at the state sensitive to noise by using weighted variance expansion based on the power of state or probability distribution, in order to improve the performance. The effectiveness of this method was examined in two types of noisy speech HMMs (one was trained with a specific SNR. the other was trained with five kinds of SNRs), through the evaluation experiments of speaker independent word recognition using noises of two factories. As the results, this method improved the robustness of the HMMs against the variation of noise conditions (noise type and SNR).
引用
收藏
页码:556 / 559
页数:4
相关论文
共 50 条
  • [41] Pitch Detection Method for Noisy Speech Signals Based on Pre-Filter and Weighted Wavelet coefficients
    Li, Ru-wei
    Bao, Chang-chun
    Dou, Hui-jing
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 530 - 533
  • [42] Improved mean and variance normalization for robust speech recognition
    Jain, P
    Hermansky, H
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4015 - 4015
  • [43] Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction
    Mine, R
    Kobayashi, T
    Shirai, K
    SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (14) : 37 - 44
  • [44] An Improved Algorithm for Partial Fraction Expansion Based Frequency Weighted Balanced Truncation
    Muda, Wan Mariam Wan
    Sreeram, Victor
    Iu, Herbert Ho-Ching
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 5037 - 5042
  • [45] Improved tactile speech perception and noise robustness using audio-to-tactile sensory substitution with amplitude envelope expansion
    Fletcher, Mark D.
    Akis, Esma
    Verschuur, Carl A.
    Perry, Samuel W.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] An Improved Parallel Model Combination Method for Noisy Speech Recognition
    Veisi, Hadi
    Sameti, Hossein
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 237 - 242
  • [47] An improved noisy channel model for speech recognition error correction
    Li, Baoxiang
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    International Journal of Advancements in Computing Technology, 2012, 4 (12) : 110 - 118
  • [48] Noisy speech enhancement based on improved minimum statistics incorporating acoustic environment-awareness
    Chang, Joon-Hyuk
    DIGITAL SIGNAL PROCESSING, 2013, 23 (04) : 1233 - 1238
  • [49] Prediction of Intelligibility of Noisy and Time-Frequency Weighted Speech based on Mutual Information Between Amplitude Envelopes
    Jensen, Jesper
    Taal, Cees H.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1173 - 1177
  • [50] Enhancement of noisy speech signal based on variance and modified gain function with PDE preprocessing technique for digital hearing aid
    Deepa, D.
    Shanmugam, A.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2011, 70 (05): : 332 - 337