A Chinese Grammatical Error Correction Method Based on Iterative Training and Sequence Tagging

被引:0
|
作者
Kuang, Hailan [1 ]
Wu, Kewen [1 ]
Ma, Xiaolin [1 ]
Liu, Xinhua [1 ]
机构
[1] Wuhan Univ Technol, Sch Informat Engn, Hubei Key Lab Broadband Wireless Commun & Sensor, Wuhan 430070, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期
关键词
Chinese grammatical error correction; sequence tagging; iterative training; conditional random field; focal loss;
D O I
10.3390/app12094364
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Chinese grammatical error correction (GEC) is under continuous development and improvement, and this is a challenging task in the field of natural language processing due to the high complexity and flexibility of Chinese grammar. Nowadays, the iterative sequence tagging approach is widely applied to Chinese GEC tasks because it has a faster inference speed than sequence generation approaches. However, the training phase of the iterative sequence tagging approach uses sentences for only one round, while the inference phase is an iterative process. This makes the model focus only on the current sentence's current error correction results rather than considering the results after multiple rounds of correction. In order to address this problem of mismatch between the training and inference processes, we propose a Chinese GEC method based on iterative training and sequence tagging (CGEC-IT). First, in the iterative training phase, we dynamically generate the target tags for each round by using the final target sentences and the input sentences of the current round. The final loss is the average of each round's loss. Next, by adding conditional random fields for sequence labeling, we ensure that the model pays more attention to the overall labeling results. In addition, we use the focal loss to solve the problem of category imbalance caused by the fact that most words in text error correction do not need error correction. Furthermore, the experiments on NLPCC 2018 Task 2 show that our method outperforms prior work by up to 2% on the F0.5 score, which verifies the efficiency of iterative training on the Chinese GEC model.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An Extended Sequence Tagging Vocabulary for Grammatical Error Correction
    Mesham, Stuart
    Bryant, Christopher
    Rei, Marek
    Yuan, Zheng
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1608 - 1619
  • [2] Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model
    Li, Si
    Zhao, Jianbo
    Shi, Guirong
    Tan, Yuanpeng
    Xu, Huifang
    Chen, Guang
    Lan, Haibo
    Lin, Zhiqing
    [J]. IEEE ACCESS, 2019, 7 : 72905 - 72913
  • [3] A Sequence to Sequence Learning for Chinese Grammatical Error Correction
    Ren, Hongkai
    Yang, Liner
    Xun, Endong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 401 - 410
  • [4] Multi-head sequence tagging model for Grammatical Error Correction
    Al-Sabahi, Kamal
    Yang, Kang
    Liu, Wangwang
    Jiang, Guanyu
    Li, Xian
    Yang, Ming
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [5] A Chinese Grammatical Error Correction Model Based On Grammatical Generalization And Parameter Sharing
    Lin, Nankai
    Lin, Xiaotian
    Fu, Yingwen
    Jiang, Shengyi
    Wang, Lianxi
    [J]. COMPUTER JOURNAL, 2023, 67 (05): : 1628 - 1636
  • [6] Pre-Training-Based Grammatical Error Correction Model for the Written Language of Chinese Hearing Impaired Students
    Chen, Binbin
    Zhang, Jingyu
    [J]. IEEE ACCESS, 2022, 10 : 35061 - 35072
  • [7] Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction
    Li, Piji
    Shi, Shuming
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4973 - 4984
  • [8] A Hybrid System for Chinese Grammatical Error Diagnosis and Correction
    Li, Chen
    Zhou, Junpei
    Bao, Zuyi
    Liu, Hengyou
    Xu, Guangwei
    Li, Linlin
    [J]. NATURAL LANGUAGE PROCESSING TECHNIQUES FOR EDUCATIONAL APPLICATIONS, 2018, : 60 - 69
  • [9] Leveraging Adversarial Training to Facilitate Grammatical Error Correction
    Dang, Kai
    Xie, Jiaying
    Liu, Jie
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 67 - 78
  • [10] Data Weighted Training Strategies for Grammatical Error Correction
    Lichtarge, Jared
    Alberti, Chris
    Kumar, Shankar
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 634 - 646