Combining feature compensation and Weighted Viterbi Decoding for noise robust speech recognition with limited adaptation data

被引:0
|
作者
Cui, XD [1 ]
Alwan, A [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoutic models trained with clean speech signals suffer in the presence of background noise. In some situations, only a limited amount of noisy data of the new environment is available based on which the clean models could be adapted. A feature compensation approach employing polynomial regression of the signal-to-noise ratio (SNR) is proposed in this paper. While clean acoustic models remain unchanged, a bias which is a polynomial function of utterance SNR is estimated and removed from the noisy feature. Depending on the amount of noisy data available, the algorithm could be flexibly carried out at different levels of granularity. Based on the Euclidean distance, the similarity between the residual distribution and the clean models are estimated and used as the confidence factor in a back-end Weighted Viterbi Decoding (WVD) algorithm. With limited amounts of noisy data, the feature compensation algorithm outperforms Maximum Likelihood Linear Regression (MLLR) for the Aurora2 database. Weighted Viterbi decoding further improves recognition accuracy.
引用
收藏
页码:969 / 972
页数:4
相关论文
共 50 条
  • [41] The perceptual wavelet feature for noise robust Vietnamese speech recognition
    Trung, Nguyen Quoc
    Nghia, Phung Trung
    2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 255 - +
  • [42] Combining log-spectral domain compensation with MVA feature post-processing for robust speech recognition
    Lei, Jianjun
    Wang, Jian
    Guo, Jun
    Liu, Gang
    Shen, Haifeng
    IIH-MSP: 2006 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2006, : 663 - +
  • [43] A data-driven model parameter compensation method for noise-robust speech recognition
    Chung, YJ
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 432 - 434
  • [44] Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition
    Chen, Yi
    Wan, Chia-yu
    Lee, Lin-shan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1008 - 1011
  • [45] A novel HMM model adaptation and compensation method for robust speech recognition
    Ning, GX
    Wei, G
    INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277
  • [46] Robust speech recognition with on-line unsupervised acoustic feature compensation
    Buera, Luis
    Miguel, Antonio
    Lleida, Eduardo
    Saz, Oscar
    Ortega, Alfonso
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 105 - 110
  • [47] Nonlinear noise compensation in feature domain for speech recognition with numerical methods
    Jiang, H
    Wang, Q
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 985 - 988
  • [48] COMBINING SPEAKER AND NOISE FEATURE NORMALIZATION TECHNIQUES FOR AUTOMATIC SPEECH RECOGNITION
    Garcia, L.
    Benitez, C.
    Segura, J. C.
    Umesh, S.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5496 - 5499
  • [49] Psychoacoustic Model Compensation for Robust Continuous Speech Recognition in Additive Noise
    Das, Biswajit
    Panda, Ashish
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 511 - 515
  • [50] Target Speech GMM-based Spectral Compensation for Noise Robust Speech Recognition
    Shinozaki, Takahiro
    Furui, Sadaoki
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1223 - 1226