Combining feature compensation and Weighted Viterbi Decoding for noise robust speech recognition with limited adaptation data

被引:0
|
作者
Cui, XD [1 ]
Alwan, A [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoutic models trained with clean speech signals suffer in the presence of background noise. In some situations, only a limited amount of noisy data of the new environment is available based on which the clean models could be adapted. A feature compensation approach employing polynomial regression of the signal-to-noise ratio (SNR) is proposed in this paper. While clean acoustic models remain unchanged, a bias which is a polynomial function of utterance SNR is estimated and removed from the noisy feature. Depending on the amount of noisy data available, the algorithm could be flexibly carried out at different levels of granularity. Based on the Euclidean distance, the similarity between the residual distribution and the clean models are estimated and used as the confidence factor in a back-end Weighted Viterbi Decoding (WVD) algorithm. With limited amounts of noisy data, the feature compensation algorithm outperforms Maximum Likelihood Linear Regression (MLLR) for the Aurora2 database. Weighted Viterbi decoding further improves recognition accuracy.
引用
收藏
页码:969 / 972
页数:4
相关论文
共 50 条
  • [21] ROBUST FEATURE SPACE ADAPTATION FOR TELEPHONY SPEECH RECOGNITION
    Lei, Xin
    Hamaker, Jon
    He, Xiaodong
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 773 - +
  • [22] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [23] Issues with uncertainty decoding for noise robust automatic speech recognition
    Liao, H.
    Gales, M. J. F.
    SPEECH COMMUNICATION, 2008, 50 (04) : 265 - 277
  • [24] Online feature compensation using modified quantile based noise estimation for robust speech recognition
    Lee, Heungkyu
    Kwon, Ohil
    Kim, June
    ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 236 - 242
  • [25] ROBUST SPEECH RECOGNITION USING DYNAMIC NOISE ADAPTATION
    Rennie, Steven
    Dognin, Pierre
    Fousek, Petr
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4592 - 4595
  • [26] AN INTEGRATED APPROACH TO FEATURE COMPENSATION COMBINING PARTICLE FILTERS AND HIDDEN MARKOV MODELS FOR ROBUST SPEECH RECOGNITION
    Mushtaq, Aleem
    Hui-Lee, Chin
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4757 - 4760
  • [27] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [28] Model-based feature compensation for robust speech recognition
    School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
    不详
    不详
    Fundam Inf, 2006, 4 (529-539):
  • [29] A Particle Filter Feature Compensation Approach to Robust Speech Recognition
    Mushtaq, Aleem
    Tsao, Yu
    Hui-Lee, Chin
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2054 - +
  • [30] On stochastic feature and model compensation approaches to robust speech recognition
    Lee, CH
    SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47