Enhancing HMM-based POS tagger for Mizo language

被引:0
|
作者
Nunsanga, Morrel V. L. [1 ]
Pakray, Partha [2 ]
Devi, Toijam Sonalika [1 ]
Singh, L. Lolit Kr [3 ]
机构
[1] Mizoram Univ, Dept Informat Technol, Mizoram 796004, India
[2] NIT Silchar, Dept CSE, Silchar, Assam, India
[3] Mizoram Univ, Dept ECE, Mizoram, India
关键词
Hybrid POS tagger; rule-based POS tagger; N-gram tagger; Mizo POS tagger; Hidden Markov Model;
D O I
10.3233/JIFS-224220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of associating words with their relevant parts of speech is known as part-of-speech (POS) tagging. It takes a substantial amount of well-organized data or corpora and significant target language research to obtain good performance for a tagger. Mizo is a language that needs more research attention in computational linguistics due to its under-resourced nature. The limited availability of corpora and relevant literature adds complexity to the task of assigning POS labels to Mizo text. This paper explores two methods to potentially improve the Hidden Markov Model (HMM)-based POS tagger for the Mizo language. The proposed taggers are compared with the baseline HMM tagger and the N-gram taggers on the designed Mizo corpus, which consists of 72,077 manually tagged tokens. The experimental results proved that the two proposed taggers enhanced the HMM-based Mizo POS tagger, achieving 81.52% and 84.29% accuracy, respectively. Moreover, a comprehensive analysis of the performance of the suggested hybrid tagger was conducted, yielding a weighted average precision, recall, and F1-score of 83.09%, 77.88%, and 79.64% respectively.
引用
收藏
页码:11725 / 11736
页数:12
相关论文
共 50 条
  • [21] Document Subjectivity and Target Detection in Opinion Mining Using HMM POS-Tagger
    Hamzah, Amir
    Widyastuti, Naniek
    2015 INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEMS (ICTS), 2015, : 83 - 87
  • [22] HMM-based continuous sign language recognition using stochastic grammars
    Hienz, H
    Bauer, B
    Kraiss, KF
    GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 1999, 1739 : 185 - 196
  • [23] Development of an HMM-Based Speech Synthesis System for Indian English Language
    Mullah, Helal Uddin
    Pyrtuh, Fidalizia
    Singh, L. Joyprakash
    2015 INTERNATIONAL SYMPOSIUM ON ADVANCED COMPUTING AND COMMUNICATION (ISACC), 2015, : 124 - 127
  • [24] Speaker and Language Adaptive Training for HMM-Based Polyglot Speech Synthesis
    Zen, Heiga
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 410 - 413
  • [25] Resource Building and Parts-of-Speech (POS) Tagging for the Mizo Language
    Pakray, Partha
    Pal, Arunagshu
    Majumder, Goutam
    Gelbukh, Alexander
    2015 FOURTEENTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (MICAI), 2015, : 3 - 7
  • [26] HMM-Based Trust Model
    Elsalamouny, Ehab
    Sassone, Vladimiro
    Nielsen, Mogens
    FORMAL ASPECTS IN SECURITY AND TRUST, 2010, 5983 : 21 - +
  • [27] An HMM-Based Reputation Model
    ElSalamouny, Ehab
    Sassone, Vladimiro
    ADVANCES IN SECURITY OF INFORMATION AND COMMUNICATION NETWORKS, 2013, 381 : 111 - +
  • [28] Enhancing HMM-based biomedical named entity recognition by studying special phenomena
    Zhang, J
    Shen, D
    Zhou, GD
    Su, J
    Tan, CL
    JOURNAL OF BIOMEDICAL INFORMATICS, 2004, 37 (06) : 411 - 422
  • [29] Identification of POS Tags for the Khasi Language based on Brill's Transformation Rule-Based Tagger
    Warjri, Sunita
    Pakray, Partha
    Lyngdoh, Saralin A.
    Maji, Arnab Kumar
    COMPUTACION Y SISTEMAS, 2022, 26 (02): : 989 - 1005
  • [30] HMM-based automatic speech commands and instructions recognition system for Polish language
    Wydra, S
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159