Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition

被引:6
|
作者
Kallasjoki, Heikki [1 ]
Gemmeke, Jort F. [2 ]
Palomaki, Kalle J. [1 ]
机构
[1] Aalto Univ, Sch Elect Engn, Dept Signal Proc & Acoust, FI-00076 Aalto, Finland
[2] KU Leuven ESAT PSI, Dept Elect Engn, B-3001 Heverlee, Belgium
基金
芬兰科学院;
关键词
Exemplar-based; noise robustness; observation uncertainty; speech recognition; uncertainty estimation;
D O I
10.1109/TASLP.2013.2292328
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a method of improving automatic speech recognition performance under noisy conditions by using a source separation approach to extract the underlying clean speech signal. The feature enhancement processing is complemented with heuristic estimates of the uncertainty of the source separation, that are used to further assist the recognition. The uncertainty heuristics are converted to estimates of variance for the extracted clean speech using a Gaussian Mixture Model based mapping, and applied in the decoding stage under the observation uncertainty framework. We propose six heuristics, and evaluate them using both artificial and real-world noisy data, and with acoustic models trained on clean speech, a multi-condition noisy data set, and the multi-condition set processed with the source separation front-end. Taking the uncertainty of the enhanced features into account is shown to improve recognition performance when the acoustic models are trained on unenhanced data, while training on enhanced noisy data yields the lowest error rates.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [1] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Hurmalainen, Antti
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
  • [2] NOISE ROBUST EXEMPLAR-BASED CONNECTED DIGIT RECOGNITION
    Gemmeke, Jort F.
    Virtanen, Tuomas
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4546 - 4549
  • [3] SEMI-SUPERVISED NOISE DICTIONARY ADAPTATION FOR EXEMPLAR-BASED NOISE ROBUST SPEECH RECOGNITION
    Luan, Yi
    Saito, Daisuke
    Kashiwagi, Yosuke
    Minematsu, Nobuaki
    Hirose, Keikichi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] EXEMPLAR-BASED NOISE ROBUST AUTOMATIC SPEECH RECOGNITION USING MODULATION SPECTROGRAM FEATURES
    Baby, Deepak
    Virtanen, Tuomas
    Gemmeke, Jort F.
    Barker, Tom
    Van Hamme, Hugo
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 519 - 524
  • [5] Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition
    Baby, Deepak
    Virtanen, Tuomas
    Gemmeke, Jort F.
    van Hamme, Hugo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1788 - 1799
  • [6] HYBRID INPUT SPACES FOR EXEMPLAR-BASED NOISE ROBUST SPEECH RECOGNITION USING COUPLED DICTIONARIES
    Baby, Deepak
    Van Hamme, Hugo
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1676 - 1680
  • [7] Exemplar-Based Processing for Speech Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Nahamoo, David
    Kanevsky, Dimitri
    Van Compernolle, Dirk
    Demuynck, Kris
    Gemmeke, Jort Florent
    Bellegarda, Jerome R.
    Sundaram, Shiva
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 98 - 113
  • [8] EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
    Baby, Deepak
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Van hamme, Hugo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4485 - 4489
  • [9] Noise Robust Exemplar Matching for Speech Enhancement: Applications to Automatic Speech Recognition
    Yilmaz, Emre
    Baby, Deepak
    Van Hannne, Hugo
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 688 - 692
  • [10] ROBUST INTERNAL EXEMPLAR-BASED IMAGE ENHANCEMENT
    Xian, Yang
    Tian, Yingli
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2379 - 2383