Upper and lower bounds on the mean of noisy speech: Application to minimax classification

被引:3
|
作者
Afify, M [1 ]
Siohan, O [1 ]
Lee, CH [1 ]
机构
[1] Bell Labs, Lucent Technol, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
来源
关键词
minimax classification; robust decision rules; speech recognition;
D O I
10.1109/89.985545
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we derive upper and lower bounds on the mean of speech corrupted by additive noise. The bounds are derived in the log spectral domain. Also approximate bounds on the first and second order time derivatives are developed. It is also shown how to transform these bounds to the Mel frequency cepstral coefficient (MFCC) domain. The proposed bounds are used to define the mismatch neighborhood for minimax classification. It is shown that this parametric neighborhood works quite well for artificially added noise and for a real-life mismatch scenario (moving car environment) which does not fully conform with the theoretical conditions used to derive the bounds. In contrast to traditional neighborhood structure for minimax classification, no empirical tuning of the bounds is required. It is believed that the applicability of the derived bounds is not limited to a minimax setting and can be potentially used to develop various compensation scenarios in the log spectral domain.
引用
收藏
页码:79 / 88
页数:10
相关论文
共 50 条