An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus

被引:1
|
作者
Kang, Di -Wen [1 ]
Ye, Shao-Qiang [2 ,4 ]
Ahmad, Sharifah Zarith Rahmah Syed [2 ]
Mo, Li-Ping [3 ]
Qin, Feng [2 ]
Zhou, Pan [1 ]
机构
[1] Jishou Univ, Sch Commun & Elect Engn, Jishou 416000, Peoples R China
[2] Univ Teknol Malaysia, Fac Comp, Skudai 80310, Johor, Malaysia
[3] Jishou Univ, Coll Comp Sci & Engn, Jishou, Hunan, Peoples R China
[4] Hunan Appl Technol Univ, Coll Informat & Engn, Changde 415000, Hunan, Peoples R China
关键词
Harmony Search Algorithm; Low-resource language; Optimization; Part-of-Speech tagging; Unknown words; ALGORITHM;
D O I
10.21123/bsj.2024.9694
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data -driven models perform poorly on part -of -speech tagging problems with the square Hmong language, a low -resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local evaluation strategies for handling the square Hmong part -of -speech tagging problem. The experiment shows that the average accuracy of the proposed model is 6%, 8% more than HMM and BiLSTM-CRF models, respectively. Meanwhile, the average F1 of the proposed model is also 6%, 3% more than HMM and BiLSTM-CRF models, respectively.
引用
收藏
页码:622 / 632
页数:11
相关论文
共 50 条
  • [41] An open source part-of-speech tagger for Norwegian: Building on existing language resources
    Marco, Cristina S.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4111 - 4117
  • [42] A Hybrid Approach to the Development of Part-of-Speech Tagger for Kafi-noonoo Text
    Mekuria, Zelalem
    Assabie, Yaregal
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I, 2014, 8403 : 214 - 224
  • [43] Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text
    Sawalha, Majdi
    Atwell, Eric
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1258 - 1265
  • [44] Advanced Naive Bayes Algorithm Design with Part-of-Speech Tagger on Sentiment Analysis
    Wang, Yige
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1377 - 1380
  • [45] Morphology Analysis for Hidden Markov Model based Indonesian Part-of-Speech Tagger
    Muljono
    Afini, Umriya
    Supriyanto, Catur
    2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 237 - 240
  • [46] Building a Thai part-of-speech tagged corpus (ORCHID)
    Sornlertlamvanich, Virach
    Takahashi, Naoto
    Isahara, Hitoshi
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1999, 20 (03): : 189 - 198
  • [47] Building a Part-of-Speech Tagged Corpus for Drenjongke (Bhutia)
    Ashida, Mana
    Lee, Seunghun J.
    Namgyal, Kunzang
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 57 - 63
  • [48] Investigation of Viterbi Algorithm Performance on Part-of-Speech Tagger of Natural Language Processing
    Liu, Yue
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1430 - 1433
  • [49] A Grounded Unsupervised Universal Part-of-Speech Tagger for Low-Resource Languages
    Cardenas, Ronald
    Lin, Ying
    Ji, Heng
    May, Jonathan
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2428 - 2439
  • [50] Use of a genetic algorithm in Brill's transformation-based part-of-speech tagger
    Wilson, Garnett
    Heywood, Malcolm
    GECCO 2005: Genetic and Evolutionary Computation Conference, Vols 1 and 2, 2005, : 2067 - 2073