Improving part of speech disambiguation rules by adding linguistic knowledge

被引:0
|
作者
Lindberg, N [1 ]
Eineborg, M
机构
[1] Royal Inst Technol, Dept Speech Mus & Hearing, Ctr Speech Technol, Stockholm, Sweden
[2] Stockholm Univ, Royal Inst Technol, Dept Comp Sci & Syst, Machine Learning Grp, S-10691 Stockholm, Sweden
来源
INDUCTIVE LOGIC PROGRAMMING | 1999年 / 1634卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedited Swedish text. Rules eliminating faulty tags have been induced using Progol. In previously reported experiments, almost no linguistically motivated background knowledge was used [5, 8]. Still, the result was rather promising (recall 97.7%, with a pending average ambiguity of 1.13 tags/word). Compared to the previous study, a much richer, more linguistically motivated, background knowledge has been supplied, consisting of examples of noun phrases, verb chains, auxiliary verbs, and sets of part of speech categories. The aim has been to create the background knowledge rapidly, without laborious hand-coding of linguistic knowledge. In addition to the new background knowledge, new, more expressive rule types have been induced for two part of speech categories and compared to the corresponding rules of the previous bottom-line experiment. The new rules perform considerably better, with a recall of 99.4% for the new rules, compared to 97.6% for the old rules. Precision was slightly better for the new rules.
引用
收藏
页码:186 / 197
页数:12
相关论文
共 50 条
  • [21] THE ACQUISITION OF LINGUISTIC KNOWLEDGE FROM VISIBLE SPEECH SPECTROGRAMS OF ORDINARY SPEECH - A PROPOSAL
    GUZY, JJ
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1982, 16 (03): : 327 - 332
  • [22] Unsupervised learning of part-of-speech guessing rules
    Mikheev, Andrei
    Natural Language Engineering, 1996, 2 (pt 2) : 111 - 136
  • [23] Ripple Down Rules for Part-of-Speech Tagging
    Dat Quoc Nguyen
    Dai Quoc Nguyen
    Son Bao Pham
    Dang Duc Pham
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT I, 2011, 6608 : 190 - 201
  • [24] POLEMICAL NOTES ON THE QUESTION OF NORMS + SPEECH-ACTS AND LINGUISTIC RULES
    JUHASZ, J
    ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 1984, 12 (01): : 82 - 85
  • [25] Bidirectional bridge between neural networks and linguistic knowledge: Linguistic rule extraction and learning from linguistic rules
    Ishibuchi, H
    Nii, M
    Turksen, IB
    1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1112 - 1117
  • [26] Vietnamese Part of Speech Tagging Based on Multi-category Words Disambiguation Model
    Zhao Chen
    Liu Yanchao
    Guo Jianyi
    Chen Wei
    Yan Xin
    Yu Zhengtao
    Chen Xiuqin
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 267 - 277
  • [27] PART OF SPEECH AND LINGUISTIC STRUCTURE IN TIBETAN - GERMAN - ZIMMERMANN,H
    DENWOOD, P
    BULLETIN OF THE SCHOOL OF ORIENTAL AND AFRICAN STUDIES-UNIVERSITY OF LONDON, 1982, 45 : 205 - 207
  • [28] Blog Classification: Adding Linguistic Knowledge to Improve the K-NN Algorithm
    Bayoudh, Ines
    Bechet, Nicolas
    Roche, Mathieu
    INTELLIGENT INFORMATION PROCESSING IV, 2008, : 68 - +
  • [29] Discovering the rules: Folk knowledge for improving GM partnerships
    Meerwarth, TL
    Briody, EK
    Kulkarni, DM
    HUMAN ORGANIZATION, 2005, 64 (03) : 286 - 302
  • [30] The Impact of Linguistic Complexity on Leadership in Online Q&A communities: Comparing Knowledge Shaping and Knowledge Adding
    Lu, Xuecong
    Jiang, Jinglu
    Head, Milena
    Yang, Junyi
    INFORMATION & MANAGEMENT, 2022, 59 (06)