POS Tagging for Arabic Text Using Bee Colony Algorithm

被引:3
|
作者
Alhasan, Ahmad [1 ]
Al-Taani, Ahmad T. [1 ]
机构
[1] Yarmouk Univ, Dept Comp Sci, Irbid, Jordan
来源
关键词
Text Summarization; POS Tagging; Question Answering; Bee Colony Algorithm; Meta-heuristics Optimization Algorithms;
D O I
10.1016/j.procs.2018.10.471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Part-of-Speech (POS) Tagging is the process of automatically determining the proper grammatical tag or syntactic category of a word depending on a its context. POS Tagging is an essential step in most Natural Language Processing (NLP) applications such as text summarization, question answering, information extraction and information retrieval. In this study, we propose an efficient tagging approach for the Arabic language using Bee Colony Optimization algorithm. The problem is represented as a graph and a novel technique is proposed to assign scores to possible tags of a sentence, then the bees find the best solution path. The proposed approach is evaluated using KALIMAT corpus which consists of 18M words. Experimental results showed that the proposed approach achieved 98.2% of accuracy compared to 98%, 97.4% and 94.6% for Hybrid, Hidden Markov Model and Rule-Based methods respectively. Furthermore, the proposed approach determined all the tags presented in the corpus while the mentioned approaches can identify only three tags. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:158 / 165
页数:8
相关论文
共 50 条
  • [1] Arabic POS Tagging
    Mohamed, Emad
    Kuebler, Sandra
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [2] Utilizing Artificial Bee Colony Algorithm as Feature Selection Method in Arabic Text Classification
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 536 - 547
  • [3] POS-tagging arabic texts: A novel approach based on ant colony
    Ben Othmane, Chiraz Zribi
    Ben Fraj, Feriel
    Limam, Ichraf
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (03) : 419 - 439
  • [4] Improving Arabic Tokenization and POS Tagging Using Morphological Analyzer
    Nawar, Michael N.
    [J]. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, AMLTA 2014, 2014, 488 : 46 - 53
  • [5] Text Document Summarization Using POS tagging for Kannada Text Documents
    Jayashree, R.
    Anami, Basavaraj S.
    Poornima, B. K.
    [J]. 2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 423 - 426
  • [6] Arabic Text Classification Using Hybrid Feature Selection Method Using Chi-Square Binary Artificial Bee Colony Algorithm
    Hijazi, Musab
    Zeki, Akram
    Ismail, Amelia
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2021, 16 (01): : 213 - 228
  • [7] Developing a tagset for automated POS tagging in Arabic
    Centre for Computational Intelligence , School of Computing, De Montfort University, The Gateway, Leicester, United Kingdom
    [J]. WSEAS Trans. Comput., 2006, 11 (2787-2792):
  • [8] PoS Tagging for Classical Chinese Text
    Chiu, Tin-shing
    Lu, Qin
    Xu, Jian
    Xiong, Dan
    Lo, Fengju
    [J]. CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 448 - 456
  • [9] A BERT Based Approach for Arabic POS Tagging
    Saidi, Rakia
    Jarray, Fethi
    Mansour, Mahmud
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 311 - 321
  • [10] Joint POS Tagging and Text Normalization for Informal Text
    Li, Chen
    Liu, Yang
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1263 - 1269