Combining trigram and automatic weight distribution in chinese spelling error correction

被引:12
|
作者
Li, JH [1 ]
Wang, XL [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
spelling error correction; language model; edit distance; weight distribution;
D O I
10.1007/BF02960784
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The researches on spelling correction aiming at detecting errors in texts tend to focus on context-sensitive spelling error correction, which is more difficult than traditional isolated-word error correction. A novel and efficient algorithm for the system of Chinese spelling error correction, CInsunSpell, is presented. In this system, the work of correction includes two parts: checking phase and correcting phase. At the first phase, a Trigram algorithm within one fixed-size window is designed to locate potential errors in local area. The second phase employs. a new method of automatically and dynamically distributing weights among the characters in the confusion set as well as in the Bayesian language model. The tactics used above exhibits good performances.
引用
收藏
页码:915 / 923
页数:9
相关论文
共 50 条
  • [1] Combining trigram and automatic weight distribution in Chinese spelling error correction
    Jianhua Li
    Xiaolong Wang
    [J]. Journal of Computer Science and Technology, 2002, 17 : 915 - 923
  • [2] AUTOMATIC SPELLING CORRECTION USING A TRIGRAM SIMILARITY MEASURE
    ANGELL, RC
    FREUND, GE
    WILLETT, P
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1983, 19 (04) : 255 - 261
  • [3] Research and implementation on automatic Chinese and cantonese spelling error correction
    Wu, Yan
    Tang, Yunting
    [J]. 2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 357 - 360
  • [4] Progress of combining trigram and Winnow in Thai OCR error correction
    Meknavin, S
    Kijsirikul, B
    Chotimongkol, A
    Nuttee, C
    [J]. APCCAS '98 - IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS: MICROELECTRONICS AND INTEGRATING SYSTEMS, 1998, : 555 - 558
  • [5] THE USE OF TRIGRAM ANALYSIS FOR SPELLING ERROR-DETECTION
    ZAMORA, EM
    POLLOCK, JJ
    ZAMORA, A
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1981, 17 (06) : 305 - 316
  • [6] Global Attention Decoder for Chinese Spelling Error Correction
    Guo, Zhao
    Ni, Yuan
    Wang, Keqiang
    Zhu, Wei
    Xie, Guotong
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1419 - 1428
  • [7] AUTOMATIC SPELLING ERROR-DETECTION AND CORRECTION IN TEXTUAL DATABASES
    POLLOCK, JJ
    ZAMORA, A
    [J]. PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1982, 19 : 236 - 238
  • [8] A method for chinese spelling error correction based on character shapes
    Lu, XL
    [J]. ICCC2004: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION VOL 1AND 2, 2004, : 1184 - 1189
  • [9] The method of chinese spelling error correction based on SLM and rules
    Zhang, YS
    Yu, SW
    Ma, MY
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1568 - 1571
  • [10] Chinese Spelling Error Detection and Correction Based on Knowledge Graph
    Sun, Ximin
    Zhou, Jing
    Wang, Shuai
    Li, Huichao
    Jia, Jiangkai
    Zhu, Jiazheng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 149 - 159