DISCRIMINATIVE UNCERTAINTY ESTIMATION FOR NOISE ROBUST ASR

被引：0

作者：

Tran, Dung T. ^{[1
,2
,3
]}

Vincent, Emmanuel ^{[1
,2
,3
]}

Jouvet, Denis ^{[1
,2
,3
]}

机构：

[1] Inria, F-54600 Villers Les Nancy, France

[2] CNRS, LORIA, UMR 7503, F-54600 Villers Les Nancy, France

[3] Univ Lorraine, LORIA, UMR 7503, F-54600 Villers Les Nancy, France

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

Automatic speech recognition; noise robustness; uncertainty handling; discriminative adaptation; SPEECH ENHANCEMENT; COMPENSATION; RECOGNITION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We consider the problem of uncertainty estimation for noise-robust ASR. Existing uncertainty estimation techniques improve ASR accuracy but they still exhibit a gap compared to the use of oracle uncertainty. This comes partly from the highly non-linear feature transformation and from additional assumptions such as Gaussian distribution and independence between frequency bins in the spectral domain. In this paper, we propose a method to rescale the estimated feature-domain full uncertainty covariance matrix in a state-dependent fashion according to a discriminative criterion. The state-dependent and feature index-dependent scaling factors are learned from development data. Experimental evaluation on Track 1 of the 2nd CHiME challenge data shows that discriminative rescaling leads to better results than generative rescaling. Moreover, discriminative rescaling of the Wiener uncertainty estimator leads to 12% relative word error rate reduction compared to discriminative rescaling of the alternative estimator in [1].

引用

页码：5038 / 5042

页数：5

共 50 条

[1] Nonparametric Uncertainty Estimation and Propagation for Noise Robust ASR
Tran, Dung T.
Vincent, Emmanuel
Jouvet, Denis
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1835 - 1846
[2] Discriminative Classifiers with Generative Kernels for Noise Robust ASR
Gales, M. J. F.
Longworth, C.
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1996 - 1999
[3] FUSION OF MULTIPLE UNCERTAINTY ESTIMATORS AND PROPAGATORS FOR NOISE ROBUST ASR
Tran, Dung T.
Vincent, Emmanuel
Jouvet, Denis
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] EXTENSION OF UNCERTAINTY PROPAGATION TO DYNAMIC MFCCS FOR NOISE ROBUST ASR
Tran, Dung T.
Vincent, Emmanuel
Jouvet, Denis
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[5] Noise robust ASR
Viikki, O
[J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 1 - 2
[6] AN EXTENDED EXPERIMENTAL INVESTIGATION OF DNN UNCERTAINTY PROPAGATION FOR NOISE ROBUST ASR
Nathwani, Karan
Morales-Cordovilla, Juan A.
Sivasankaran, Sunit
Illina, Irina
Vincent, Emmanuel
[J]. 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 26 - 30
[7] Assessing local noise level estimation methods: Application to noise robust ASR
Ris, C
Dupont, S
[J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 141 - 158
[8] DNN Uncertainty Propagation Using GMM-Derived Uncertainty Features for Noise Robust ASR
Nathwani, Karan
Vincent, Emmanuel
Illina, Irina
[J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (03) : 338 - 342
[9] Model-based feature enhancement with uncertainty decoding for noise robust ASR
Stouten, Veronique
Van hamme, Hugo
Warnbacq, Patrick
[J]. SPEECH COMMUNICATION, 2006, 48 (11) : 1502 - 1514
[10] Noise robust discriminative models
Le, Q
Bengio, S
[J]. Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 375 - 378

← 1 2 3 4 5 →