Improving part of speech disambiguation rules by adding linguistic knowledge

被引:0
|
作者
Lindberg, N [1 ]
Eineborg, M
机构
[1] Royal Inst Technol, Dept Speech Mus & Hearing, Ctr Speech Technol, Stockholm, Sweden
[2] Stockholm Univ, Royal Inst Technol, Dept Comp Sci & Syst, Machine Learning Grp, S-10691 Stockholm, Sweden
来源
INDUCTIVE LOGIC PROGRAMMING | 1999年 / 1634卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedited Swedish text. Rules eliminating faulty tags have been induced using Progol. In previously reported experiments, almost no linguistically motivated background knowledge was used [5, 8]. Still, the result was rather promising (recall 97.7%, with a pending average ambiguity of 1.13 tags/word). Compared to the previous study, a much richer, more linguistically motivated, background knowledge has been supplied, consisting of examples of noun phrases, verb chains, auxiliary verbs, and sets of part of speech categories. The aim has been to create the background knowledge rapidly, without laborious hand-coding of linguistic knowledge. In addition to the new background knowledge, new, more expressive rule types have been induced for two part of speech categories and compared to the corresponding rules of the previous bottom-line experiment. The new rules perform considerably better, with a recall of 99.4% for the new rules, compared to 97.6% for the old rules. Precision was slightly better for the new rules.
引用
收藏
页码:186 / 197
页数:12
相关论文
共 50 条
  • [41] NL-processor and linguistic knowledge base in a speech recognition system
    Malkovsky, MG
    Subbotin, AV
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 237 - 242
  • [42] Data mining method to acquire part of speech rules in Chinese text
    Li, Xiaoli
    Shi, Zhongzhi
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (12): : 1409 - 1414
  • [43] Improving drivers' knowledge of road rules using digital games
    Li, Qing
    Tay, Richard
    ACCIDENT ANALYSIS AND PREVENTION, 2014, 65 : 8 - 10
  • [44] The Construction of Sentiment Lexicon Based on Context-Dependent Part-of-Speech Chunks for Semantic Disambiguation
    Yin, Fulian
    Wang, Yanyan
    Liu, Jianbo
    Lin, Lisha
    IEEE ACCESS, 2020, 8 (08): : 63359 - 63367
  • [45] Automatic information extraction from texts with inference and linguistic knowledge acquisition rules
    de Araujo, Denis A.
    Rigo, Sandro J.
    Muller, Carolina
    Chishman, Rove
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, : 151 - 154
  • [46] ACCESSING CHILDRENS KNOWLEDGE OF SOCIOLINGUISTIC RULES FOR SPEECH-THERAPY LESSONS
    RIPICH, DN
    PANAGOS, JM
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1985, 50 (04): : 335 - 346
  • [47] Rules for Improving Pharmacotherapy in Older Adult Patients: Part 1 (Rules 1-5)
    Wooten, James M.
    SOUTHERN MEDICAL JOURNAL, 2015, 108 (02) : 97 - 104
  • [48] Improving Subjectivity Detection for Spanish Texts using Subjectivity Word Sense Disambiguation based on Knowledge
    Sobrevilla Cabezudo, Marco Antonio
    La Serna Palomino, Nora
    Maguina Perez, Rolando
    2015 XLI LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2015, : 269 - 275
  • [49] Rules for Improving Pharmacotherapy in Older Adult Patients: Part 2 (Rules 6-10)
    Wooten, James M.
    SOUTHERN MEDICAL JOURNAL, 2015, 108 (03) : 145 - 150
  • [50] Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information
    Atmaja, Bagus Tris
    Akagi, Masato
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 166 - 171