Models of English text

被引:7
|
作者
Teahan, WJ
Cleary, JG
机构
关键词
D O I
10.1109/DCC.1997.581953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of constructing models of English text is considered. A number of applications of such models including cryptology, spelling correction and speech recognition are reviewed. The best current models for English text have been the result of research into compression. Not only is this an important application of such models but the amount of compression provides a measure of how well such models perform. Three main classes of models are considered: character based models, word based models, and models which use auxiliary information in the form of parts of speech. These models are compared in terms of their memory usage and compression.
引用
收藏
页码:12 / 21
页数:10
相关论文
共 50 条