A Method for Recognizing Noisy Romanized Japanese Words in Learner English

被引:0
|
作者
Nagata, Ryo [1 ]
Kakegawa, Jun-ichi [2 ]
Sugimoto, Hiromi [3 ]
Yabuta, Yukiko [3 ]
机构
[1] Konan Univ, Kobe, Hyogo 6588501, Japan
[2] Hyogo Univ Teachers Educ, Kato 6731494, Japan
[3] Japanese Inst Educ Measurement Inc, Tokyo 1628680, Japan
来源
关键词
romanized Japanese words; English writing; grammatical error detection; learner English; language learning and teaching;
D O I
10.1093/ietisy/e91-d.10.2458
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a method for recognizing romanized Japanese words in learner English. They become noise and problematic in a variety of systems and tools for language learning and teaching including text analysis. spell checking. and grammatical error detection because they are Japanese words and thus mostly unknown to such systems and tools. A problem one encounters when recognizing romanized Japanese words in learner English is that the spelling rules of romanized Japanese words are often violated. To address this problem, the described method uses it Clustering algorithm reinforced by a small set of rules. Experiments show that it achieves an F-measure of 0.879 and outperforms other methods. They also show that it only requires, the target text and an English word list of reasonable size.
引用
收藏
页码:2458 / 2466
页数:9
相关论文
共 50 条