Transformation-based part-of-speech tagging for Serbian language

被引:0
|
作者
Delic, Vlado [1 ]
Secujski, Milan [1 ]
Kupusinac, Aleksandar [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradov 6, Novi Sad 21000, Serbia
关键词
natural language processing; POS tagging; transformation-based learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning techniques based on transformation rules have proven to be a viable alternative to stochastic tagging, achieving similar accuracy while having many advantages such as simplicity and better portability to other languages. However, data sparsity remains one of the greatest obstacles to tagging languages with complex morphology. Research in POS tagging for Serbian language described in this paper has resulted in several original ideas for improving tagging accuracy and overcoming problems related to data sparsity for highly inflected languages. The POS tagger for Serbian described in this paper achieves an error rate of 10.0% when trained on a previously annotated text corpus containing 190,000 words, which is comparable with results reported for some other languages with a similar level of inflection.
引用
收藏
页码:98 / +
页数:2
相关论文
共 50 条
  • [2] A robust transformation-based learning approach using ripple down rules for part-of-speech tagging
    Dat Quoc Nguyen
    Dai Quoc Nguyen
    Dang Duc Pham
    Son Bao Pham
    [J]. AI COMMUNICATIONS, 2016, 29 (03) : 409 - 422
  • [3] Part-of-Speech Tagging for Azerbaijani Language
    Mammadov, Samir
    Rustamov, Samir
    Mustafali, Ali
    Sadigov, Ziyaddin
    Mollayev, Rasim
    Mammadov, Zamir
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 40 - 45
  • [4] Kadazan Part of Speech Tagging using Transformation-Based Approach
    Alex, Marylyn
    Zakaria, Lailatul Qadri
    [J]. 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI 2013), 2013, 11 : 621 - 627
  • [5] Part-of-Speech (POS) Tagging for the Nyishi Language
    Siram, Joyir
    Sambyo, Koj
    Sarkar, Achyuth
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 191 - 199
  • [6] Corpus based part-of-speech tagging
    Lv, Chengyao
    Liu, Huihua
    Dong, Yuanxing
    Chen, Yunliang
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 647 - 654
  • [7] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [8] Use of a genetic algorithm in Brill's transformation-based part-of-speech tagger
    Wilson, Garnett
    Heywood, Malcolm
    [J]. GECCO 2005: Genetic and Evolutionary Computation Conference, Vols 1 and 2, 2005, : 2067 - 2073
  • [9] Phrase-based part-of-speech tagging
    Finch, Andrew
    Sumita, Eiichiro
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 215 - +
  • [10] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206