English grammar intelligent error correction technology based on the n-gram language model

被引：0

作者：

Xiao, Fan ^{[2
]}

Yin, Shehui ^{[1
]}

机构：

[1] Henan Polytech Inst, Fundamental Teaching Sect, Nanyang 473000, Peoples R China

[2] Henan Polytech Inst, Coll Int Educ & Cultural Tourism, Nanyang 473000, Peoples R China

来源：

JOURNAL OF INTELLIGENT SYSTEMS | 2024年 / 33卷 / 01期

关键词：

grammar error correction; move the window; n-gram algorithm; linear interpolation smoothing algorithm;

D O I：

10.1515/jisys-2023-0259

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the development of the Internet, the number of electronic texts has increased rapidly. Automatic grammar error correction technology is an effective safeguard measure for the quality of electronic texts. To improve the quality of electronic text, this study introduces a moving window algorithm and linear interpolation smoothing algorithm to build a Cn-gram language model. On this basis, a syntactic analysis strategy is introduced to construct a syntactic error correction model integrating Cn-gram and syntactic analysis, and English grammar intelligent error correction is carried out through the model. The results show that compared with the Bi-gram and Tri-gram, the precision of the Cn-gram model is 0.85 and 0.91% higher, and the F1 value is 0.97 and 1.14% higher, respectively. Compared with the results of test set Long, the Cn-gram model has better performance on verb error correction of the Short test set, and the precision rate, recall rate, and F1 value are increased by 0.86, 3.94, and 1.87%, respectively. The comparison of the precision, recall rate, and F1 value of the proposed grammar error correction model on the complete test set shows that the precision of the study is 19.10 and 5.41% higher for subject-verb agreement errors. The recall rate is 9.55 and 10.77% higher, respectively; F1 values are higher by 12.65 and 10.59%, respectively. The above results show that the error-correcting technique of the research design has excellent error-correcting performance. It is hoped that this experiment can provide a reference for the relevant research of automatic error correction technology of electronic text.

引用

页数：15

共 50 条

[1] N-gram based filler model for robust grammar authoring
Yu, Dong
Ju, Yun Cheng
Wang, Ye-Yi
Acero, Alex
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 565 - 568
[2] Similar N-gram Language Model
Gillot, Christian
Cerisara, Christophe
Langlois, David
Haton, Jean-Paul
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1824 - 1827
[3] A unified context-free grammar and n-gram model for spoken language processing
Wang, YY
Mahajan, M
Huang, XD
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1639 - 1642
[4] Chinese Error Correction of Searching Engine under N-gram Statistic Model
Liu, Gang
Chen, Zhipeng
2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
[5] Bangla Word Clustering Based on N-gram Language Model
Ismail, Sabir
Rahman, M. Shahidur
2014 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT 2014), 2014,
[6] A New Estimate of the n-gram Language Model
Aouragh, Si Lhoussain
Yousfi, Abdellah
Laaroussi, Saida
Gueddah, Hicham
Nejja, Mohammed
AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 211 - 215
[7] Development of the N-gram Model for Azerbaijani Language
Bannayeva, Aliya
Aslanov, Mustafa
2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
[8] English Grammar Error Correction Algorithm Based on Classification Model
Zhou, Shanchun
Liu, Wei
COMPLEXITY, 2021, 2021
[9] An intelligent extension of the training set for the Persian n-gram language model: an enrichment algorithm
Motavallian, Rezvan
Komeily, Masoud
ONOMAZEIN, 2023, (61): : 191 - 211
[10] A variant of n-gram based language classification
Tomovic, Andrija
Janicic, Predrag
AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 410 - +

← 1 2 3 4 5 →