Grammatical Error Correction: A Survey of the State of the Art

被引:7
|
作者
Bryant, Christopher [1 ,2 ]
Yuan, Zheng [3 ]
Qorib, Muhammad Reza [4 ]
Cao, Hannan [4 ]
Ng, Hwee Tou [4 ]
Briscoe, Ted [5 ]
机构
[1] Univ Cambridge, ALTA Inst, Dept Comp Sci & Technol, Cambridge, England
[2] Writer Inc, San Francisco, CA 94143 USA
[3] Kings Coll London, Dept Informat, London, England
[4] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[5] Mohamed bin Zayed Univ Artificial Intelligence, Nat Language Proc Dept, Abu Dhabi, U Arab Emirates
基金
新加坡国家研究基金会;
关键词
265;
D O I
10.1162/coli_a_00478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors, respectively. The field has seen significant progress in the last decade, motivated in part by a series of five shared tasks, which drove the development of rule-based methods, statistical classifiers, statistical machine translation, and finally neural machine translation systems, which represent the current dominant state of the art. In this survey paper, we condense the field into a single article and first outline some of the linguistic challenges of the task, introduce the most popular datasets that are available to researchers (for both English and other languages), and summarize the various methods and techniques that have been developed with a particular focus on artificial error generation. We next describe the many different approaches to evaluation as well as concerns surrounding metric reliability, especially in relation to subjective human judgments, before concluding with an overview of recent progress and suggestions for future work and remaining challenges. We hope that this survey will serve as a comprehensive resource for researchers who are new to the field or who want to be kept apprised of recent developments.
引用
收藏
页码:643 / 701
页数:59
相关论文
共 50 条
  • [1] A Comprehensive Survey of Grammatical Error Correction
    Wang, Yu
    Wang, Yuelin
    Dang, Kai
    Liu, Jie
    Liu, Zhuo
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)
  • [2] Neural Grammatical Error Correction with Finite State Transducers
    Stahlberg, Felix
    Bryant, Christopher
    Byrne, Bill
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 4033 - 4039
  • [3] Adversarial Grammatical Error Correction
    Raheja, Vipul
    Alikaniotis, Dimitris
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [4] Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction
    Pan, Fayu
    Cao, Bin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 525 - 530
  • [5] Towards Lithuanian Grammatical Error Correction
    Stankevicius, Lukas
    Lukosevicius, Mantas
    [J]. ARTIFICIAL INTELLIGENCE TRENDS IN SYSTEMS, VOL 2, 2022, 502 : 490 - 503
  • [6] Corpora Generation for Grammatical Error Correction
    Lichtarge, Jared
    Alberti, Chris
    Kumar, Shankar
    Shazeer, Noam
    Parmar, Niki
    Tong, Simon
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3291 - 3301
  • [7] Spoken Language 'Grammatical Error Correction'
    Lu, Yiting
    Gales, Mark J. F.
    Wang, Yu
    [J]. INTERSPEECH 2020, 2020, : 3840 - 3844
  • [8] Grammatical Error Correction with Dependency Distance
    Lin, Haowen
    Li, JinLong
    Zhang, Xu
    Chen, Huanhuan
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1018 - 1027
  • [9] Grammatical Error Correction with Denoising Autoencoder
    Pajak, Krzysztof
    Gonczarek, Adam
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 821 - 826
  • [10] Neural Grammatical Error Correction for Romanian
    Cotet, Teodor-Mihai
    Ruseti, Stefan
    Dascalu, Mihai
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 625 - 631