Enhancing HMM-based POS tagger for Mizo language

被引:0
|
作者
Nunsanga, Morrel V. L. [1 ]
Pakray, Partha [2 ]
Devi, Toijam Sonalika [1 ]
Singh, L. Lolit Kr [3 ]
机构
[1] Mizoram Univ, Dept Informat Technol, Mizoram 796004, India
[2] NIT Silchar, Dept CSE, Silchar, Assam, India
[3] Mizoram Univ, Dept ECE, Mizoram, India
关键词
Hybrid POS tagger; rule-based POS tagger; N-gram tagger; Mizo POS tagger; Hidden Markov Model;
D O I
10.3233/JIFS-224220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of associating words with their relevant parts of speech is known as part-of-speech (POS) tagging. It takes a substantial amount of well-organized data or corpora and significant target language research to obtain good performance for a tagger. Mizo is a language that needs more research attention in computational linguistics due to its under-resourced nature. The limited availability of corpora and relevant literature adds complexity to the task of assigning POS labels to Mizo text. This paper explores two methods to potentially improve the Hidden Markov Model (HMM)-based POS tagger for the Mizo language. The proposed taggers are compared with the baseline HMM tagger and the N-gram taggers on the designed Mizo corpus, which consists of 72,077 manually tagged tokens. The experimental results proved that the two proposed taggers enhanced the HMM-based Mizo POS tagger, achieving 81.52% and 84.29% accuracy, respectively. Moreover, a comprehensive analysis of the performance of the suggested hybrid tagger was conducted, yielding a weighted average precision, recall, and F1-score of 83.09%, 77.88%, and 79.64% respectively.
引用
收藏
页码:11725 / 11736
页数:12
相关论文
共 50 条
  • [41] Enhancing the accuracy of HMM-based conserved pathway prediction using global correspondence scores
    Qian, Xiaoning
    Sahraeian, Sayed Mohammad Ebrahim
    Yoon, Byung-Jun
    BMC BIOINFORMATICS, 2011, 12
  • [42] HMM-Based Vietnamese Speech Synthesis
    Trinh Quoc Son
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 349 - 353
  • [43] HMM-based monitoring of packet channels
    Rossi, PS
    Palmieri, F
    Iannello, G
    HIGH SPEED NETWORKS AND MULTIMEDIA COMMUNICATIONS, PROCEEDINGS, 2004, 3079 : 144 - 154
  • [44] Czech HMM-Based Speech Synthesis
    Hanzlicek, Zdenek
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 291 - 298
  • [45] Robustness of HMM-based Speech Synthesis
    Yamagishi, Junichi
    Ling, Zhenhua
    King, Simon
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 581 - 584
  • [46] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [47] Comparison of different POS tagging techniques (n-gram, HMM and Brill's tagger) for Bangla
    Hasan, Fahim Muhammad
    UzZaman, Naushad
    Khan, Murnit
    ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 121 - 126
  • [48] HMM-based multiresolution image segmentation
    Lu, JL
    Carin, L
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3357 - 3360
  • [49] EVALUATION OF HMM-BASED LAUGHTER SYNTHESIS
    Urbain, Jerome
    Cakmak, Huseyin
    Dutoit, Thierry
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7835 - 7839
  • [50] A design for HMM-based SAR ATR
    Kottke, DP
    Fiore, PD
    Brown, KL
    Fwu, JK
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY V, 1998, 3370 : 541 - 551