Using part-of-speech and word-sense disambiguation for boosting string-edit distance spelling correction

被引:0
|
作者
Ruch, P
Baud, R
Geissbühler, A
Lovis, C
Rassinoux, AM
Rivière, A
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We report on the design of a system for correcting spelling errors resulting in non-existent words. The system aims at improving edition of medical reports. Unlike traditional systems, both semantic and syntactic contexts are considered here. The system is organized along three steps. The first module is based on a context independent string-to-string edit distance calculus. The second module, based on the morpho-syntactic context attempts to rank more relevantly the data set provided by the first module, finally a third contextual module processes words with the same part-of-speech by applying some contextual word-sense disambiguation. Modules 2 and 3 are using both hand written rules and data-driven Markovian matrices. A final evaluation shows a significant improvement compared to context-free spelling correction.
引用
收藏
页码:249 / 257
页数:9
相关论文
共 3 条
  • [1] A Part-Of-Speech Lexicographic Encoding for an Evolutionary Word Sense Disambiguation Approach
    Azzini, Antonia
    Dragoni, Mauro
    Tettamanzi, Andrea G. B.
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, 2011, 6624 : 244 - 253
  • [2] PosWSD: Low-Resource Word Sense Disambiguation Model using Part Of Speech Information
    Chen, Yazhen
    Zhang, Jian
    He, Qipeng
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 26 - 31
  • [3] SweetCoat-2D: Two-Dimensional Bangla Spelling Correction and Suggestion Using Levenshtein Edit Distance and String Matching Algorithm
    Hasan, Md Mahadi
    Mallick, David Dew
    Khan, Towhid
    Alam, Mustakin
    Mehedi, Md Humaion Kabir
    Rasel, Annajiat Alim
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,