Part-of-speech tagging of Portuguese based on variable length Markov chains

被引:0
|
作者
Kepler, FN [1 ]
Finger, M [1 ]
机构
[1] USP, IME, Sao Paulo, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tagging is the task of attributing to words in context in a text, their corresponding Part-of-Speech (PoS) class. In this work, we have employed Variable Length Markov Chains (VLMC) for tagging, in the hope of capturing long distance dependencies. We obtained one of the best PoS tagging of Portuguese, with a precision of 95.51%. More surprisingly, we did that with a total time of training and execution of less than 3 minutes for a corpus of almost 1 million words. However, long distance dependencies are not well captured by the VLMC tagger, and we investigate the reasons and limitations of the use of VLMCs. Future researches in statistical linguistics regarding long range dependencies should concentrate in other ways of solving this limitation.
引用
收藏
页码:248 / 251
页数:4
相关论文
共 50 条
  • [1] Comparing two Markov methods for part-of-speech tagging of Portuguese
    Kepler, Fabio N.
    Finger, Marcelo
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 482 - 491
  • [2] A Hidden Markov Model for Persian Part-of-Speech Tagging
    Okhovvat, Morteza
    Bidgoli, Behrouz Minaei
    [J]. WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [3] Corpus based part-of-speech tagging
    Lv, Chengyao
    Liu, Huihua
    Dong, Yuanxing
    Chen, Yunliang
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 647 - 654
  • [4] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [5] Portuguese Part-of-Speech Tagging with Large Margin Structure Learning
    Fernandes, Eraldo R.
    Rodrigues, Irving M.
    Milidiu, Ruy L.
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 25 - 30
  • [6] Named Entity Recognition Based On A Hidden Markov Model in Part-Of-Speech Tagging
    Ageishi, Ryohei
    Miura, Takao
    [J]. 2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 404 - 409
  • [7] Part-of-speech tagging based on hidden Markov model assuming joint independence
    Lee, SZ
    Tsujii, J
    Rim, HC
    [J]. 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 263 - 269
  • [8] Phrase-based part-of-speech tagging
    Finch, Andrew
    Sumita, Eiichiro
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 215 - +
  • [9] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [10] Portuguese Part-of-Speech Tagging Using Entropy Guided Transformation Learning
    dos Santos, Cicero Nogueira
    Milidiu, Ruy L.
    Renteria, Raul P.
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2008, 5190 : 143 - +