DISCRIMINATIVE UNCERTAINTY ESTIMATION FOR NOISE ROBUST ASR

被引:0
|
作者
Tran, Dung T. [1 ,2 ,3 ]
Vincent, Emmanuel [1 ,2 ,3 ]
Jouvet, Denis [1 ,2 ,3 ]
机构
[1] Inria, F-54600 Villers Les Nancy, France
[2] CNRS, LORIA, UMR 7503, F-54600 Villers Les Nancy, France
[3] Univ Lorraine, LORIA, UMR 7503, F-54600 Villers Les Nancy, France
关键词
Automatic speech recognition; noise robustness; uncertainty handling; discriminative adaptation; SPEECH ENHANCEMENT; COMPENSATION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the problem of uncertainty estimation for noise-robust ASR. Existing uncertainty estimation techniques improve ASR accuracy but they still exhibit a gap compared to the use of oracle uncertainty. This comes partly from the highly non-linear feature transformation and from additional assumptions such as Gaussian distribution and independence between frequency bins in the spectral domain. In this paper, we propose a method to rescale the estimated feature-domain full uncertainty covariance matrix in a state-dependent fashion according to a discriminative criterion. The state-dependent and feature index-dependent scaling factors are learned from development data. Experimental evaluation on Track 1 of the 2nd CHiME challenge data shows that discriminative rescaling leads to better results than generative rescaling. Moreover, discriminative rescaling of the Wiener uncertainty estimator leads to 12% relative word error rate reduction compared to discriminative rescaling of the alternative estimator in [1].
引用
收藏
页码:5038 / 5042
页数:5
相关论文
共 50 条
  • [1] Nonparametric Uncertainty Estimation and Propagation for Noise Robust ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1835 - 1846
  • [2] Discriminative Classifiers with Generative Kernels for Noise Robust ASR
    Gales, M. J. F.
    Longworth, C.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1996 - 1999
  • [3] FUSION OF MULTIPLE UNCERTAINTY ESTIMATORS AND PROPAGATORS FOR NOISE ROBUST ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] EXTENSION OF UNCERTAINTY PROPAGATION TO DYNAMIC MFCCS FOR NOISE ROBUST ASR
    Tran, Dung T.
    Vincent, Emmanuel
    Jouvet, Denis
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Noise robust ASR
    Viikki, O
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 1 - 2
  • [6] AN EXTENDED EXPERIMENTAL INVESTIGATION OF DNN UNCERTAINTY PROPAGATION FOR NOISE ROBUST ASR
    Nathwani, Karan
    Morales-Cordovilla, Juan A.
    Sivasankaran, Sunit
    Illina, Irina
    Vincent, Emmanuel
    [J]. 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 26 - 30
  • [7] Assessing local noise level estimation methods: Application to noise robust ASR
    Ris, C
    Dupont, S
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 141 - 158
  • [8] DNN Uncertainty Propagation Using GMM-Derived Uncertainty Features for Noise Robust ASR
    Nathwani, Karan
    Vincent, Emmanuel
    Illina, Irina
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (03) : 338 - 342
  • [9] Model-based feature enhancement with uncertainty decoding for noise robust ASR
    Stouten, Veronique
    Van hamme, Hugo
    Warnbacq, Patrick
    [J]. SPEECH COMMUNICATION, 2006, 48 (11) : 1502 - 1514
  • [10] Noise robust discriminative models
    Le, Q
    Bengio, S
    [J]. Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 375 - 378