Automatic Metric Validation for Grammatical Error Correction

被引:0
|
作者
Choshen, Leshem [1 ]
Abend, Omri [1 ,2 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci & Engn, Jerusalem, Israel
[2] Hebrew Univ Jerusalem, Dept Cognit Sci, Jerusalem, Israel
基金
以色列科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Metric validation in Grammatical Error Correction (GEC) is currently done by observing the correlation between human and metric-induced rankings. However, such correlation studies are costly, methodologically troublesome, and suffer from low inter-rater agreement. We propose MAEGE, an automatic methodology for GEC metric validation, that overcomes many of the difficulties with existing practices. Experiments with MAEGE shed a new light on metric quality, showing for example that the standard M-2 metric fares poorly on corpus-level ranking. Moreover, we use MAEGE to perform a detailed analysis of metric behavior, showing that correcting some types of errors is consistently penalized by existing metrics.
引用
收藏
页码:1372 / 1382
页数:11
相关论文
共 50 条
  • [1] Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction
    Bryant, Christopher
    Felice, Mariano
    Briscoe, Ted
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 793 - 805
  • [2] Automatic Grammatical Error Correction Based on Edit Operations Information
    Wang, Quanbin
    Tan, Ying
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 494 - 505
  • [3] A Review of the Research on the Evaluation Metrics for Automatic Grammatical Error Correction System
    Long, Manli
    Wang, Yan
    Peng, Yifei
    Huang, Wanwu
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [4] Adversarial Grammatical Error Correction
    Raheja, Vipul
    Alikaniotis, Dimitris
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [5] Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric
    Islam, Md Asadul
    Magnani, Enrico
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3009 - 3015
  • [6] Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
    Suzuki, Daisuke
    Takahashi, Yujin
    Yamashita, Ikumi
    Aida, Taichi
    Hirasawa, Tosho
    Nakatsuji, Michitaka
    Mita, Masato
    Komachi, Mamoru
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5565 - 5572
  • [7] Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
    Suzuki, Daisuke
    Takahashi, Yujin
    Yamashita, Ikumi
    Aida, Taichi
    Hirasawa, Tosho
    Nakatsuji, Michitaka
    Mita, Masato
    Komachi, Mamoru
    [J]. 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 5565 - 5572
  • [8] Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction
    Pan, Fayu
    Cao, Bin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2021, 2021, : 525 - 530
  • [9] Automatic Grammatical Error Correction for Sequence-to-sequence Text Generation: An Empirical Study
    Ge, Tao
    Zhang, Xingxing
    Wei, Furu
    Zhou, Ming
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6059 - 6064
  • [10] How Far are We from Fully Automatic High Quality Grammatical Error Correction?
    Bryant, Christopher
    Hwee Tou Ng
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 697 - 707