Improving a lexicon-based spelling checker for Sesotho sa Leboa

被引:2
|
作者
Prinsloo, D. J. [1 ]
Eiselen, Roald [2 ]
机构
[1] Univ Pretoria, Dept African Languages, ZA-0002 Pretoria, South Africa
[2] North West Univ, Ctr Text Technol, Vanderbijlpark, South Africa
关键词
D O I
10.1080/02572117.2005.10587245
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The aim of this article is to investigate how (i) n-gram analysis and (ii) the application of grammatical rules can improve the lexical recall of the spelling checker for Sesotho sa Leboa developed by the Centre for Text Technology. North-West University in cooperation with the Department of African Languages at the University of Pretoria. It will be shown that for a disjunctively written language like Sesotho sa Leboa lexical recall exceeding 95% can be obtained by using a list of frequently occurring words. The paper will first investigate the efficiency of using grapheme-based n-gram models in the spellchecking procedure. Second. it will discuss the utilization of grammatical rules to increase lexical recall, focusing on nominal constructions such as the diminutive. locative and augmentative. and also on verbal suffixes and suffix combinations.
引用
收藏
页码:11 / 24
页数:14
相关论文
共 50 条