Part-of-Speech Tagging for Azerbaijani Language

被引:0
|
作者
Mammadov, Samir [1 ]
Rustamov, Samir [2 ]
Mustafali, Ali [2 ]
Sadigov, Ziyaddin [2 ]
Mollayev, Rasim [2 ]
Mammadov, Zamir [3 ]
机构
[1] ADA Univ, ASAN Serv, Sch Informat Technol & Engn, Baku, Azerbaijan
[2] ADA Univ, Sch Informat Technol & Engn, Baku, Azerbaijan
[3] Azerbaijan State Pedag Univ, Dept Math, Baku, Azerbaijan
关键词
pos tagging; nip; hmm tagger; Azerbaijani pus tagger; Azerbaijani stemmer;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paper describes the process of implementing a HMM PoS tagger for Azerbaijani language to tag given text based on the tagged corpus. Different methodologies for part-of speech tagging have been studied, and after analysis of these methodologies, Hidden Markov Model has been chosen for implementation. For Azerbaijani language, the paper demonstrates the steps taken to build a stemmer as an essential part of PoS tagger. A thorough examination of possible word groups and exceptions has been conducted and most of such cases have been successfully handled. As of now, a tagged corpus of large enough size does not exist for Azerbaijani language and it hinders the testing process of HMM tagger. For this reason, a small corpus has been created for testing. However, as HMM shows remarkable performance when run on English corpus, it is expected that it will produce decent results for Azerbaijani language too.
引用
收藏
页码:40 / 45
页数:6
相关论文
共 50 条
  • [1] Part-of-Speech (POS) Tagging for the Nyishi Language
    Siram, Joyir
    Sambyo, Koj
    Sarkar, Achyuth
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 191 - 199
  • [2] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [3] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [4] Transformation-based part-of-speech tagging for Serbian language
    Delic, Vlado
    Secujski, Milan
    Kupusinac, Aleksandar
    [J]. PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS (CIMMACS '09), 2009, : 98 - +
  • [5] A Deep Learning Approach for Part-of-Speech Tagging in Nepali Language
    Prabha, Greeshma
    Jyothsna, P., V
    Shahina, K. K.
    Premjith, B.
    Soman, K. P.
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1132 - 1136
  • [6] Standards for automatic part-of-speech tagging
    Minnaja, DC
    [J]. 15TH INTERNATIONAL CONGRESS ON CYBERNETICS, PROCEEDINGS, 1999, : 745 - 750
  • [7] Part-of-speech tagging without training
    Bressan, S
    Indradjaja, LS
    [J]. INTELLIGENCE IN COMMUNICATION SYSTEMS, 2004, 3283 : 112 - 119
  • [8] Part-of-Speech Tagging by Latent Analogy
    Bellegarda, Jerome R.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 985 - 993
  • [9] Corpus based part-of-speech tagging
    Lv, Chengyao
    Liu, Huihua
    Dong, Yuanxing
    Chen, Yunliang
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 647 - 654
  • [10] Part-of-speech tagging and partial parsing
    Abney, S
    [J]. CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, 1997, 2 : 118 - 136