Prompt as a Knowledge Probe for Chinese Spelling Check

被引:0
|
作者
Peng, Kun [1 ,2 ]
Sun, Nannan [1 ]
Cao, Jiahao [1 ,2 ]
Liu, Rui [1 ,2 ]
Ren, Jiaqian [1 ,2 ]
Jiang, Lei [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
关键词
Chinese Spelling Check; Prompt-tuning; Natural language processing; Pre-training model; BERT;
D O I
10.1007/978-3-031-10989-8_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese Spelling Check (CSC) is a challenging task to detect and correct wrong characters in Chinese sentences. Since most Chinese spelling mistakes are caused by visual or pronunciation similarities of characters, recent researches tend to utilize external phonological and morphological resources for this task. However, their works rely heavily on hand-constructed confusion sets and multimodal data, causing high labor costs. To this end, we propose an end-to-end generative model called PromptCSC. First, we notice that the misspelling of characters causes unnatural semantic incoherence in sentences. By using the prompt template as a knowledge probe, PromptCSC detects and outputs the error probability of each character in the sentence. The error locations are then corrected using BERT's soft mask mechanism. Experimental results on the SIGHAN benchmarks show that our approach achieves excellent performance without external resources.
引用
收藏
页码:516 / 527
页数:12
相关论文
共 50 条
  • [1] DUKE: Distance Fusion and Knowledge Enhanced Framework for Chinese Spelling Check
    Liang, Jianzeng
    Huang, Wenkang
    Li, Fengyi
    Shi, Qiuhui
    [J]. 2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 1 - 5
  • [2] CLSpell: Contrastive learning with phonological and visual knowledge for chinese spelling check
    Mao, Xingliang
    Shan, Youran
    Li, Fangfang
    Chen, Xiaohong
    Zhang, Shichao
    [J]. NEUROCOMPUTING, 2023, 554
  • [3] Improve Chinese Spelling Check by Reevaluation
    Wang, Shuai
    Shang, Lin
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT III, 2022, 13282 : 237 - 248
  • [4] A Hybrid Model for Chinese Spelling Check
    Zhao, Hai
    Cai, Deng
    Xin, Yang
    Wang, Yuzhu
    Jia, Zhongye
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2017, 16 (03)
  • [5] A Probabilistic Framework for Chinese Spelling Check
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Hsin-Hsi
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2015, 14 (04)
  • [6] A Hybrid Ranking Approach to Chinese Spelling Check
    Liu, Xiaodong
    Cheng, Fei
    Duh, Kevin
    Matsumoto, Yuji
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2015, 14 (04)
  • [7] Chinese Spelling Check based on Sequence Labeling
    Han, Zijia
    Lv, Chengguo
    Wang, Qiansheng
    Fu, Guohong
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 373 - 378
  • [8] Dynamic Connected Networks for Chinese Spelling Check
    Wang, Baoxin
    Che, Wanxiang
    Wu, Dayong
    Wang, Shijin
    Hu, Guoping
    Liu, Ting
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2437 - 2446
  • [9] SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check
    Ji, Tuo
    Yan, Hang
    Qiu, Xipeng
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3544 - 3551
  • [10] A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check
    Wane, Dingmin
    Song, Yan
    Li, Jing
    Han, Jialong
    Zhang, Haisong
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2517 - 2527