On the Combination of Auditory and Modulation Frequency Channels for ASR applications

被引:0
|
作者
Valente, Fabio [1 ]
Hermansky, Hynek [1 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
关键词
Modulation spectrum; Neural Network; LVCSR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work [1], we showed that combination of classifiers trained on different ranges of modulation frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verity that combination of classifiers trained on different ranges of auditory frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2% (from 45.8% to 39.6%) w.r.t the single classifier approach in a LVCSR task.
引用
收藏
页码:2242 / +
页数:2
相关论文
共 50 条