Experiments with linguistic categories for language model optimization

被引:0
|
作者
Casillas, A [1 ]
Varona, A [1 ]
Torres, I [1 ]
机构
[1] Univ Basque Country, Fac Ciencias, Dpt Electricidad & Electron, E-48080 Bilbao, Spain
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS | 2003年 / 2588卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we obtain robust category-based language models to be integrated into speech recognition systems. Deductive rules are used to select linguistic categories and to match words with categories. Statistical techniques are then used to build n-gram Language Models based on lexicons that consist of sets of categories. The categorization procedure and the language model evaluation were carried out on a task-oriented Spanish corpus. The cooperation between deductive and inductive approaches has proved efficient in building small, reliable language models for speech understanding purposes.
引用
收藏
页码:511 / 515
页数:5
相关论文
共 50 条