English grammar intelligent error correction technology based on the n-gram language model

被引:0
|
作者
Xiao, Fan [2 ]
Yin, Shehui [1 ]
机构
[1] Henan Polytech Inst, Fundamental Teaching Sect, Nanyang 473000, Peoples R China
[2] Henan Polytech Inst, Coll Int Educ & Cultural Tourism, Nanyang 473000, Peoples R China
关键词
grammar error correction; move the window; n-gram algorithm; linear interpolation smoothing algorithm;
D O I
10.1515/jisys-2023-0259
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of the Internet, the number of electronic texts has increased rapidly. Automatic grammar error correction technology is an effective safeguard measure for the quality of electronic texts. To improve the quality of electronic text, this study introduces a moving window algorithm and linear interpolation smoothing algorithm to build a Cn-gram language model. On this basis, a syntactic analysis strategy is introduced to construct a syntactic error correction model integrating Cn-gram and syntactic analysis, and English grammar intelligent error correction is carried out through the model. The results show that compared with the Bi-gram and Tri-gram, the precision of the Cn-gram model is 0.85 and 0.91% higher, and the F1 value is 0.97 and 1.14% higher, respectively. Compared with the results of test set Long, the Cn-gram model has better performance on verb error correction of the Short test set, and the precision rate, recall rate, and F1 value are increased by 0.86, 3.94, and 1.87%, respectively. The comparison of the precision, recall rate, and F1 value of the proposed grammar error correction model on the complete test set shows that the precision of the study is 19.10 and 5.41% higher for subject-verb agreement errors. The recall rate is 9.55 and 10.77% higher, respectively; F1 values are higher by 12.65 and 10.59%, respectively. The above results show that the error-correcting technique of the research design has excellent error-correcting performance. It is hoped that this experiment can provide a reference for the relevant research of automatic error correction technology of electronic text.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] N-gram based filler model for robust grammar authoring
    Yu, Dong
    Ju, Yun Cheng
    Wang, Ye-Yi
    Acero, Alex
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 565 - 568
  • [2] Similar N-gram Language Model
    Gillot, Christian
    Cerisara, Christophe
    Langlois, David
    Haton, Jean-Paul
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1824 - 1827
  • [3] A unified context-free grammar and n-gram model for spoken language processing
    Wang, YY
    Mahajan, M
    Huang, XD
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1639 - 1642
  • [4] Chinese Error Correction of Searching Engine under N-gram Statistic Model
    Liu, Gang
    Chen, Zhipeng
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [5] Bangla Word Clustering Based on N-gram Language Model
    Ismail, Sabir
    Rahman, M. Shahidur
    2014 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT 2014), 2014,
  • [6] A New Estimate of the n-gram Language Model
    Aouragh, Si Lhoussain
    Yousfi, Abdellah
    Laaroussi, Saida
    Gueddah, Hicham
    Nejja, Mohammed
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 211 - 215
  • [7] Development of the N-gram Model for Azerbaijani Language
    Bannayeva, Aliya
    Aslanov, Mustafa
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [8] English Grammar Error Correction Algorithm Based on Classification Model
    Zhou, Shanchun
    Liu, Wei
    COMPLEXITY, 2021, 2021
  • [9] An intelligent extension of the training set for the Persian n-gram language model: an enrichment algorithm
    Motavallian, Rezvan
    Komeily, Masoud
    ONOMAZEIN, 2023, (61): : 191 - 211
  • [10] A variant of n-gram based language classification
    Tomovic, Andrija
    Janicic, Predrag
    AI(ASTERISK)IA 2007: ARTIFICIAL INTELLIGENCE AND HUMAN-ORIENTED COMPUTING, 2007, 4733 : 410 - +