A Comprehensive Survey of Grammatical Error Correction

被引:14
|
作者
Wang, Yu [1 ]
Wang, Yuelin [1 ]
Dang, Kai [1 ]
Liu, Jie [1 ]
Liu, Zhuo [1 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
Grammatical error correction; machine translation; natural language processing;
D O I
10.1145/3474840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grammatical error correction (GEC) is an important application aspect of natural language processing techniques, and GEC system is a kind of very important intelligent system that has long been explored both in academic and industrial communities. The past decade has witnessed significant progress achieved in GEC for the sake of increasing popularity of machine learning and deep learning. However, there is not a survey that untangles the large amount of research works and progress in this field. We present the first survey in GEC for a comprehensive retrospective of the literature in this area. We first give the definition of GEC task and introduce the public datasets and data annotation schema. After that, we discuss six kinds of basic approaches, six commonly applied performance boosting techniques for GEC systems, and three data augmentation methods. Since GEC is typically viewed as a sister task of Machine Translation (MI), we put more emphasis on the statistical machine translation (SMT)-based approaches and neural machine translation (NMT)-based approaches for the sake of their importance. Similarly, some performance-boosting techniques are adapted from MT and are successfully combined with GEC systems for enhancement on the final performance. More importantly, after the introduction of the evaluation in GEC, we make an in-depth analysis based on empirical results in aspects of GEC approaches and GEC systems for a clearer pattern of progress in GEC, where error type analysis and system recapitulation are clearly presented. Finally, we discuss five prospective directions for future GEC researches.
引用
收藏
页数:51
相关论文
共 50 条
  • [21] GECToR - Grammatical Error Correction: Tag, Not Rewrite
    Omelianchuk, Kostiantyn
    Atrasevych, Vitally
    Chernodub, Artem
    Skurzhanskyi, Oleksandr
    [J]. INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2020, : 163 - 170
  • [22] Enhancing Grammatical Error Correction Systems with Explanations
    Fei, Yuejiao
    Cui, Leyang
    Yang, Sen
    Lam, Wai
    Lan, Zhenzhong
    Shi, Shuming
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7489 - 7501
  • [23] Grammatical Error Correction: Machine Translation and Classifiers
    Rozovskaya, Alla
    Roth, Dan
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 2205 - 2215
  • [24] Revisiting the Evaluation for Chinese Grammatical Error Correction
    Wang, Hongfei
    Chen, Zhousi
    Zhang, Zizheng
    Ling, Zhidong
    Pan, Xiaomeng
    Duan, Wenjie
    Mita, Masato
    Komachi, Mamoru
    [J]. Journal of Advanced Computational Intelligence and Intelligent Informatics, 2024, 28 (06) : 1380 - 1390
  • [25] A Chinese Grammatical Error Correction Model Based On Grammatical Generalization And Parameter Sharing
    Lin, Nankai
    Lin, Xiaotian
    Fu, Yingwen
    Jiang, Shengyi
    Wang, Lianxi
    [J]. COMPUTER JOURNAL, 2023, 67 (05): : 1628 - 1636
  • [26] Grammatical Error Correction with Contrastive Learning in Low Error Density Domains
    Cao, Hannan
    Yang, Wenmian
    Ng, Hwee Tou
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4867 - 4874
  • [27] ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction
    Yuan, Xun
    Pham, Derek
    Davidson, Sam
    Yu, Zhou
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 76 - 84
  • [28] Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
    Chen, Mengyun
    Ge, Tao
    Zhang, Xingxing
    Wei, Furu
    Zhou, Ming
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7162 - 7169
  • [29] Deep Sentence Denoising beyond Grammatical Error Correction
    Liang, Zhantong
    Youssef, Abdou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1686 - 1691
  • [30] Revisiting Meta-evaluation for Grammatical Error Correction
    Kobayashi, Masamune
    Mita, Masato
    Komachi, Mamoru
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 837 - 855