How Far are We from Fully Automatic High Quality Grammatical Error Correction?

被引:0
|
作者
Bryant, Christopher [1 ]
Hwee Tou Ng [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, 13 Comp Dr, Singapore 117417, Singapore
关键词
AGREEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we first explore the role of inter-annotator agreement statistics in grammatical error correction and conclude that they are less informative in fields where there may be more than one correct answer. We next created a dataset of 50 student essays, each corrected by 10 different annotators for all error types, and investigated how both human and GEC system scores vary when different combinations of these annotations are used as the gold standard. Upon learning that even humans are unable to score higher than 75% F-0.5, we propose a new metric based on the ratio between human and system performance. We also use this method to investigate the extent to which annotators agree on certain error categories, and find that similar results can be obtained from a smaller subset of just 10 essays.
引用
收藏
页码:697 / 707
页数:11
相关论文
共 50 条
  • [1] Automatic Metric Validation for Grammatical Error Correction
    Choshen, Leshem
    Abend, Omri
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1372 - 1382
  • [2] Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
    Suzuki, Daisuke
    Takahashi, Yujin
    Yamashita, Ikumi
    Aida, Taichi
    Hirasawa, Tosho
    Nakatsuji, Michitaka
    Mita, Masato
    Komachi, Mamoru
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5565 - 5572
  • [3] Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
    Suzuki, Daisuke
    Takahashi, Yujin
    Yamashita, Ikumi
    Aida, Taichi
    Hirasawa, Tosho
    Nakatsuji, Michitaka
    Mita, Masato
    Komachi, Mamoru
    [J]. arXiv, 2022,
  • [4] Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
    Suzuki, Daisuke
    Takahashi, Yujin
    Yamashita, Ikumi
    Aida, Taichi
    Hirasawa, Tosho
    Nakatsuji, Michitaka
    Mita, Masato
    Komachi, Mamoru
    [J]. 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 5565 - 5572
  • [5] Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction
    Bryant, Christopher
    Felice, Mariano
    Briscoe, Ted
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 793 - 805
  • [6] Neural Quality Estimation of Grammatical Error Correction
    Chollampatt, Shamil
    Ng, Hwee Tou
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2528 - 2539
  • [7] How Good (really) are Grammatical Error Correction Systems?
    Rozovskaya, Alla
    Roth, Dan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2686 - 2698
  • [8] Automatic Grammatical Error Correction Based on Edit Operations Information
    Wang, Quanbin
    Tan, Ying
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 494 - 505
  • [9] A Review of the Research on the Evaluation Metrics for Automatic Grammatical Error Correction System
    Long, Manli
    Wang, Yan
    Peng, Yifei
    Huang, Wanwu
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [10] Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
    Liu, Zhenghao
    Yi, Xiaoyuan
    Sun, Maosong
    Yang, Liner
    Chua, Tat-Seng
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5441 - 5452