Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model

被引:0
|
作者
Katsumata, Satoru [1 ]
Komachi, Mamoru [1 ]
机构
[1] Tokyo Metropolitan Univ, Tokyo, Japan
基金
日本学术振兴会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Studies on grammatical error correction (GEC) have reported the effectiveness of pretraining a Seq2Seq model with a large amount of pseudodata. However, this approach requires time-consuming pretraining for GEC because of the size of the pseudodata. In this study, we explore the utility of bidirectional and auto-regressive transformers (BART) as a generic pretrained encoder-decoder model for GEC. With the use of this generic pretrained model for GEC, the time-consuming pretraining can be eliminated. We find that monolingual and multilingual BART models achieve high performance in GEC, with one of the results being comparable to the current strong results in English GEC. Our implementations are publicly available at GitHub(1).
引用
收藏
页码:827 / 832
页数:6
相关论文
共 50 条
  • [21] Unsupervised Encoder-Decoder Model for Anomaly Prediction Task
    Wu, Jinmeng
    Shu, Pengcheng
    Hong, Hanyu
    Li, Xingxun
    Ma, Lei
    Zhang, Yaozong
    Zhu, Ying
    Wang, Lei
    [J]. MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 549 - 561
  • [22] A joint encoder-decoder error control framework for stereoscopic video coding
    Xiang, Xinguang
    Zhao, Debin
    Wang, Qiang
    Ma, Siwei
    Gao, Wen
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (08) : 975 - 985
  • [23] Using FCOS and an Encoder-Decoder Model to Detect and Recognize Visual Mathematical Equations
    Wheelwright, Angel Jo
    Ng, Yiu-Kai
    [J]. 9TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING, ICMIP 2024, 2024, : 44 - 51
  • [24] Storm Surge Forecast Using an Encoder-Decoder Recurrent Neural Network Model
    Wei, Zhangping
    Nguyen, Hai Cong
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (12)
  • [25] An automated detection system for colonoscopy images using a dual encoder-decoder model
    Hwang, Maxwell
    Wang, Da
    Kong, Xiang-Xing
    Wang, Zhanhuai
    Li, Jun
    Jiang, Wei-Cheng
    Hwang, Kao-Shing
    Ding, Kefeng
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2020, 84
  • [26] A study on the role of latent variables in the encoder-decoder model using image datasets
    Okamoto, Saki
    Jin'no, Kenya
    [J]. IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2023, 14 (04): : 652 - 676
  • [27] Table Structure Recognition Using CoDec Encoder-Decoder
    Pegu, Bhanupriya
    Singh, Maneet
    Agarwal, Aakash
    Mitra, Aniruddha
    Singh, Karamjit
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 66 - 80
  • [28] Using LSTM encoder-decoder for rhetorical structure prediction
    de Moura, Gustavo Bennemann
    Feltrim, Valeria Delisandra
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 278 - 283
  • [29] Using Convolutional Encoder-Decoder for Document Image Binarization
    Peng, Xujun
    Cao, Huaigu
    Natarajan, Prem
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 708 - 713
  • [30] Unsupervised Feature Selection using Encoder-Decoder Networks
    SharifiPour, Sasan
    Fayyazi, Hossein
    Sabokro, Mohammad
    [J]. 2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,