Grammatical Error Correction with Dependency Distance

被引:2
|
作者
Lin, Haowen [1 ]
Li, JinLong [1 ]
Zhang, Xu [1 ]
Chen, Huanhuan [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
关键词
grammatical error correction; text processing; natural language generation; self-attention; dependency parsing;
D O I
10.1145/3459637.3482348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Grammatical Error Correction (GEC) task is always considered as low resource machine translation task which translates a sentence in an ungrammatical language to a grammatical language. As the state-of-the-art approach to GEC task, transformer-based neural machine translation model takes input sentence as a token sequence without sentence's structure information, and may be misled by some strange ungrammatical contexts. In response, to lay more attention on a given token's correct collocation rather than the misleading tokens, we propose dependent self-attention to relatively increase the attention score between correct collocations according to the dependency distance between tokens. However, as the source sentence is ungrammatical in GEC task, the correct collocations can hardly be extracted by normal dependency parser. Therefore, we propose dependency parser for ungrammatical sentence to get the dependency distance between tokens in the ungrammatical sentence. Our method achieves competitive results on both BEA-2019 shared task, CoNLL-2014 shared task and JFLEG test sets.
引用
收藏
页码:1018 / 1027
页数:10
相关论文
共 50 条
  • [41] Controllable data synthesis method for grammatical error correction
    Yang, Liner
    Wang, Chengcheng
    Chen, Yun
    Du, Yongping
    Yang, Erhong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (04)
  • [42] A Sequence to Sequence Learning for Chinese Grammatical Error Correction
    Ren, Hongkai
    Yang, Liner
    Xun, Endong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 401 - 410
  • [43] Grammatical Error Correction: More Data with More Context
    Parnow, Kevin
    Li, Zuchao
    Zhao, Hai
    [J]. 2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 24 - 29
  • [44] GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
    Kara, Atakan
    Safian, Farrin Marouf
    Bond, Andrew
    Sahin, Gozde Gul
    [J]. 13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 278 - 290
  • [45] Improving Precision of Grammatical Error Correction with a Cheat Sheet
    Qiu, Mengyang
    Chen, Xuejiao
    Liu, Maggie
    Parvathala, Krishna
    Patil, Apurva
    Park, Jungyeul
    [J]. INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2019, : 240 - 245
  • [46] Data Weighted Training Strategies for Grammatical Error Correction
    Lichtarge, Jared
    Alberti, Chris
    Kumar, Shankar
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 634 - 646
  • [47] Frustratingly Easy System Combination for Grammatical Error Correction
    Qorib, Muhammad Reza
    Na, Seung-Hoon
    Ng, Hwee Tou
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1964 - 1974
  • [48] An efficient system for grammatical error correction on mobile devices
    Gothe, Sourabh Vasant
    Dogra, Sushant
    Chandra, Mritunjai
    Sanchi, Chandramouli
    Raja, Barath Raj Kandur
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 147 - 154
  • [49] Incorporating rich syntax information in Grammatical Error Correction
    Li, Zuchao
    Parnow, Kevin
    Zhao, Hai
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
  • [50] Massive Exploration of Pseudo Data for Grammatical Error Correction
    Kiyono, Shun
    Suzuki, Jun
    Mizumoto, Tomoya
    Inui, Kentaro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2134 - 2145