Portuguese Part-of-Speech Tagging with Large Margin Structure Learning

被引:1
|
作者
Fernandes, Eraldo R. [1 ]
Rodrigues, Irving M. [1 ]
Milidiu, Ruy L. [2 ]
机构
[1] FACOM UFMS, Campo Grande, Brazil
[2] DU PUC Rio, Rio De Janeiro, Brazil
关键词
D O I
10.1109/BRACIS.2014.16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Part-of-Speech Tagging is a fundamental task on many Natural Language Processing systems. This task consists in identifying the syntactic category, i.e. the part of speech, of each word in a sentence. Despite the fact that the current state-of-the-art accuracy for this task is around 97%, any improvement has an immediate impact on more complex tasks, like Parsing, Semantic Role Labeling and Information Extraction. Thus, it is still relevant to explore this task. In this paper, we introduce a part-of-speech tagger based on the Structure Learning framework that reduces the smallest known error on the Portuguese Mac-Morpho corpus by 7.8%. We also apply our tagger to a recently revised version of Mac-Morpho. Our system accuracy on this latter version is competitive with a semi-supervised Neural Network trained on Mac-Morpho plus a very large non-annotated corpus. Additionally, our system is simpler than previous systems and uses a very limited feature set. Our system employs a Large Margin training criteria to derive a structure predictor that is more robust on unseen data.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 50 条
  • [1] Portuguese Part-of-Speech Tagging Using Entropy Guided Transformation Learning
    dos Santos, Cicero Nogueira
    Milidiu, Ruy L.
    Renteria, Raul P.
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2008, 5190 : 143 - +
  • [2] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [3] Part-of-Speech Tagging Using Multiview Learning
    Lim, Kyungtae
    Park, Jungyeul
    [J]. IEEE ACCESS, 2020, 8 : 195184 - 195196
  • [4] Comparing two Markov methods for part-of-speech tagging of Portuguese
    Kepler, Fabio N.
    Finger, Marcelo
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 482 - 491
  • [5] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [6] Improving Part-of-Speech Tagging by Meta-learning
    Kobylinski, Lukasz
    Wasiluk, Michal
    Wojdyga, Grzegorz
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 144 - 152
  • [7] Reducing Confusion in Active Learning for Part-Of-Speech Tagging
    Chaudhary, Aditi
    Anastasopoulos, Antonios
    Sheikh, Zaid
    Neubig, Graham
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 1 - 16
  • [8] Revision learning and its application to part-of-speech tagging
    Nakagawa, T
    Kudo, T
    Matsumoto, Y
    [J]. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 497 - 504
  • [9] Deep Learning Model for Tamil Part-of-Speech Tagging
    Visuwalingam, Hemakasiny
    Sakuntharaj, Ratnasingam
    Alawatugoda, Janaka
    Ragel, Roshan
    [J]. COMPUTER JOURNAL, 2024, 67 (08): : 2633 - 2642
  • [10] Part-of-speech tagging of Portuguese based on variable length Markov chains
    Kepler, FN
    Finger, M
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 248 - 251