Bayesian reinforcement for a probabilistic neural net Part-of-Speech tagger

被引:0
|
作者
Maragoudakis, M [1 ]
Ganchev, T [1 ]
Fakotakis, N [1 ]
机构
[1] Univ Patras, Intelligent Syst Grp, Patras 26500, Greece
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present paper introduces a novel stochastic model for Part-of-Speech tagging of natural language texts. While previous statistical approaches, such as Hidden Markov Models, are based on theoretical assumptions that are not always met in natural language, we propose a methodology which incorporates fundamental elements of two distinct machine learning disciplines. We make use of Bayesian knowledge representation to provide a robust classifier, namely a Probabilistic Neural Network one, with additional context information in order to better infer on the correct Part-of-Speech label. As for training material, we make use of minimal linguistic information, i.e. only a small lexicon which contains the words that belong to non-declinable POS categories and closed-class words. Such minimal information is augmented by statistical parameters generated by Bayesian networks learning and the outcome is fed into the Probabilistic Neural Network classifier for the task of Part-of-Speech tagging. Experimental results portray satisfactory performance, in terms of 3.5%-4% error rate.
引用
收藏
页码:137 / 145
页数:9
相关论文
共 50 条
  • [21] Part-of-Speech Tagger Based on Maximum Entropy Model
    Huang Heyan
    Zhang Xiaofei
    [J]. 2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 3, 2009, : 26 - 29
  • [22] A morphology-system and part-of-speech tagger for German
    Lezius, W
    Rapp, R
    Wettler, M
    [J]. NATURAL LANGUAGE PROCESSING AND SPEECH TECHNOLOGY: RESULTS OF THE 3RD KONVENS CONFERENCE, 1996, : 369 - 378
  • [23] Adding Morphological Information to a Connectionist Part-Of-Speech Tagger
    Zamora-Martinez, Francisco
    Jose Castro-Bleda, Maria
    Espana-Boquera, Salvador
    Tortajada-Velert, Salvador
    [J]. CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2010, 5988 : 191 - +
  • [24] Part-of-Speech Tagger for Malay Social Media Texts
    Ariffin, Siti Noor Allia Noor
    Tiun, Sabrina
    [J]. GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (04): : 124 - 142
  • [25] Building an Indonesian Rule-Based Part-of-Speech Tagger
    Rashel, Fam
    Luthfi, Andry
    Dinakaramani, Arawinda
    Manurung, Ruli
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 70 - 73
  • [26] Development of a multilingual parallel corpus and a part-of-speech tagger for Afrikaans
    Trushkina, Julia
    [J]. Intelligent Information Processing III, 2006, 228 : 453 - 462
  • [27] A Supervised Part-Of-Speech Tagger for the Greek Language of the Social Web
    Nikiforos, Maria Nefeli
    Kermanidis, Katia Lida
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3861 - 3867
  • [28] Arabic part-of-speech tagger based support vectors machines
    Yousif, Jabar Hassan
    Sembok, Tengku Mohd Tengku
    [J]. INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 2084 - +
  • [29] Choosing a Spanish Part-of-Speech tagger for a lexically sensitive task
    Escartin, Carla Parra
    Alonso, Hector Martinez
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (54): : 29 - 36
  • [30] SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts
    Proisl, Thomas
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 665 - 670