Combining feature compensation and Weighted Viterbi Decoding for noise robust speech recognition with limited adaptation data

被引:0
|
作者
Cui, XD [1 ]
Alwan, A [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoutic models trained with clean speech signals suffer in the presence of background noise. In some situations, only a limited amount of noisy data of the new environment is available based on which the clean models could be adapted. A feature compensation approach employing polynomial regression of the signal-to-noise ratio (SNR) is proposed in this paper. While clean acoustic models remain unchanged, a bias which is a polynomial function of utterance SNR is estimated and removed from the noisy feature. Depending on the amount of noisy data available, the algorithm could be flexibly carried out at different levels of granularity. Based on the Euclidean distance, the similarity between the residual distribution and the clean models are estimated and used as the confidence factor in a back-end Weighted Viterbi Decoding (WVD) algorithm. With limited amounts of noisy data, the feature compensation algorithm outperforms Maximum Likelihood Linear Regression (MLLR) for the Aurora2 database. Weighted Viterbi decoding further improves recognition accuracy.
引用
收藏
页码:969 / 972
页数:4
相关论文
共 50 条
  • [1] TWO-DIMENSIONAL FRAME-AND-FEATURE WEIGHTED VITERBI DECODING FOR ROBUST SPEECH RECOGNITION
    Chang, Yang
    Lee, Lin-shan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4689 - 4692
  • [2] Combining Noise Compensation and Missing-Feature Decoding for Large Vocabulary Speech Recognition in Noise
    Lu, Jianhua
    Ming, Ji
    Woods, Roger
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1269 - 1272
  • [3] Feature domain compensation of nonstationary noise for robust speech recognition
    Kim, NS
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 231 - 248
  • [4] Feature compensation based on independent noise estimation for robust speech recognition
    Lu, Yong
    Lin, Han
    Wu, Pingping
    Chen, Yitao
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [5] Feature compensation based on independent noise estimation for robust speech recognition
    Yong Lü
    Han Lin
    Pingping Wu
    Yitao Chen
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [6] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
    Chung, Yong-Joo
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
  • [7] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Das, Biswajit
    Kopparapu, Sunil Kumar
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [8] COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
    Gonzalez, Jose A.
    Peinado, Antonio M.
    Gomez, Angel M.
    Ma, Ning
    Barker, Jon
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4693 - 4696
  • [9] Weighted Viterbi decoding strategies for distributed speech recognition over IP networks
    Cardenal-Lopez, Antonio
    Garcia-Mateo, Carmen
    Docio-Fernandez, Laura
    SPEECH COMMUNICATION, 2006, 48 (11) : 1422 - 1434
  • [10] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398