Graphical models for integrating syllabic information

被引:4
|
作者
Bartels, Chris D. [1 ]
Bilmes, Jeff A. [1 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2010年 / 24卷 / 04期
关键词
Speech recognition; Graphical models; Dynamic Bayesian networks; Syllables; SPEECH;
D O I
10.1016/j.csl.2009.11.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present graphical model based methodology that enhances a speech recognizer with information about syllabic segmentations. The segmentations are specified by locations of syllable nuclei, and the graphical models are able to consider these locations as "soft" information. The graphs give improved discrimination between speech and noise when compared to a baseline model. When using locations derived from oracle information an overall improvement is shown, and when the oracle syllable nuclei are augmented with information about lexical stress the methods give additional improvements over locations alone. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:685 / 697
页数:13
相关论文
共 50 条