Comparative Study of Vietnamese Part-of-Speech Tagging Tools

被引:0
|
作者
Luyl-Da Quach [1 ]
Dat Do Thanh [1 ]
Duc Chung Tran [2 ]
Hassan, Mohd Fadzil [3 ]
机构
[1] FPT Univ, Software Engn Dept, Can Tho, Vietnam
[2] FPT Univ, Comp Fundamental Dept, Hanoi, Vietnam
[3] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar, Perak, Malaysia
关键词
Vietnamese language processing; part-of-speech tagging; Vietnamese; natural language processing;
D O I
10.1109/icsgrc49013.2020.9232564
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vietnamese part-of-speech tagging is one of the most fundamental practices in Vietnamese language processing. Unfortunately, no attempt has been made to empirically compare different Vietnamese part-of-speech tagging software. Therefore, in this paper, the authors experiment upon several Vietnamese part-of-speech tagging software such as VnTagger, RDRPOSTagger (Java Version), JvnTextPro, VNCoreNLP in terms of accuracy, consistency and computational time. In addition, the brief descriptions of the models are discussed in detail. The results help researchers comprehend the models' strengths and weaknesses. The tools are tested on 4 different data sets of number of sentences and different word types such as date, number, special characters, connected characters, double words, compound words, proper names, etc ... The results show that the accuracy of the JvnTextPro tool is high and stable with an accuracy of 80.08 to 97.84%, and the RDPRPOSTagger tool has faster processing time and relatively good accuracy from 88.41 to 96.84%.
引用
收藏
页码:197 / 202
页数:6
相关论文
共 50 条
  • [1] Dual Decomposition for Vietnamese Part-of-Speech Tagging
    Bach, Ngo Xuan
    Hiraishi, Kunihiko
    Le Minh, Nguyen
    Shimazu, Akira
    [J]. 17TH INTERNATIONAL CONFERENCE IN KNOWLEDGE BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS - KES2013, 2013, 22 : 123 - 131
  • [2] A Comparative Study on Different Techniques for Thai Part-of-Speech Tagging
    Pailai, Jaruwat
    Kongkachandra, Rachada
    Supnithi, Thepchai
    Boonkwan, Prachya
    [J]. 2013 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2013,
  • [3] Part-of-speech tagging
    Martinez, Angel R.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [4] A Comparative Study on the Effectiveness of Part-of-Speech Tagging Techniques on Bug Reports
    Tian, Yuan
    Lo, David
    [J]. 2015 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2015, : 570 - 574
  • [5] Part-of-speech tagging for Swedish
    Prütz, K
    [J]. PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [6] Part-of-Speech Induction for Vietnamese
    Phuong Le-Hong
    Thi Minh Huyen Nguyen
    [J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 261 - 272
  • [7] Standards for automatic part-of-speech tagging
    Minnaja, DC
    [J]. 15TH INTERNATIONAL CONGRESS ON CYBERNETICS, PROCEEDINGS, 1999, : 745 - 750
  • [8] A CONNECTIONIST APPROACH TO PART-OF-SPEECH TAGGING
    Zamora-Martinez, F.
    Castro-Bleda, M. J.
    Espana-Boquera, S.
    Tortajada, Salvador
    Aibar, P.
    [J]. IJCCI 2009: PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2009, : 421 - +
  • [9] The Application of CRFs in Part-of-Speech Tagging
    Zhang Xiaofei
    Huang Heyan
    Zhang Liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS, VOL 2, PROCEEDINGS, 2009, : 347 - +
  • [10] Part-of-Speech Tagging by Latent Analogy
    Bellegarda, Jerome R.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (06) : 985 - 993