An Improved Arabic Light Stemmer

被引:0
|
作者
Elrajubi, Osama Mohamed [1 ]
机构
[1] Misurata Univ, Dept Commun & Networks, Fac Informat Technol, Misurata, Libya
关键词
Arabic stemming; Arabic light stemmer; suffixes and prefixes stripping; Arabic retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
According to the desired level of analyzing words, Arabic stemming algorithms can be classified into stem-based (light stemming algorithms), and root-based algorithms. Light stemming algorithms only remove prefixes and suffixes from the words, while root-based algorithms remove prefixes, suffixes and infixes. There are several light stemmers for Arabic (Light1, Light2, Light3, Light8, and Light10), For retrieval information Light10 stemmer is out-performed the other light stemmers. In this paper, Arabic stemming algorithms are studied. And, literature review of Arabic stemmers is discussed. In addition, a new Arabic light stemmer was proposed and Implemented. The main step of the light stemmer is removing the prefixes and suffixes of the words. And because this step causes changing of the meaning of some words, many other steps are designed and implemented in the proposed stemmer. The proposed stemmer and Light10 stemmer were tested on the same Arabic data which is developed in this work. The accuracy rate of Light10 stemmer was 66%, while the accuracy rate of the proposed stemmer was 88.25 %. The reasons for incorrect stemming of the proposed stemmer are mentioned.
引用
收藏
页码:33 / 38
页数:6
相关论文
共 50 条
  • [1] ARABIC LIGHT STEMMER (ARS)
    Al-Omari, Asma
    Abuata, Belal
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2014, 9 (06): : 702 - 716
  • [2] Conditional Arabic Light Stemmer: CondLight
    Al-Lahham, Yaser
    Matarneh, Khawlah
    Hassan, Mohammad
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (3A) : 559 - 564
  • [3] A novel robust Arabic light stemmer
    Abainia, Kheireddine
    Ouamour, Siham
    Sayoud, Halim
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (03) : 557 - 573
  • [4] Arabic Light Stemming: A Comparative Study between P-Stemmer, Khoja Stemmer, and Light10 Stemmer
    Kanan, Tarek
    Sadaqa, Odai
    Almhirat, Ashraf
    Kanan, Emran
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 511 - 515
  • [5] Arabic light-based stemming: a comparative study among ligh10 stemmer, P-stemmer, and Conditional light stemmer
    Hussien, Sabria Mohammed
    Aburagheef, Hazim J.
    [J]. PROCEEDING OF 2021 2ND INFORMATION TECHNOLOGY TO ENHANCE E-LEARNING AND OTHER APPLICATION (IT-ELA 2021), 2021, : 131 - 135
  • [6] A New Enhanced Arabic Light Stemmer for IR in Medical Documents
    Al-Khatib, Ra'ed M.
    Zerrouki, Taha
    Abu Shquier, Mohammed M.
    Balla, Amar
    Al-Khateeb, Asef
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 1255 - 1269
  • [7] Arabic light-based stemmer using new rules
    Alshalabi, Hamood
    Tiun, Sabrina
    Omar, Nazlia
    AL-Aswadi, Fatima N.
    Alezabi, Kamal Ali
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 6635 - 6642
  • [8] P-Stemmer or NLTK Stemmer for Arabic Text Classification?
    Elbes, Mohammed
    Aldajah, Amal
    Sadaqa, Odai
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 516 - 520
  • [9] Arabic Stemmer Based Big Data
    Madani, Youness
    Erritali, Mohammed
    Bengourram, Jamaa
    [J]. JOURNAL OF ELECTRONIC COMMERCE IN ORGANIZATIONS, 2018, 16 (01) : 17 - 28
  • [10] Impact of Stemmer on Arabic Text Retrieval
    Atwan, Jaffar
    Mohd, Masnizah
    Kanaan, Ghassan
    Bsoul, Qusay
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 314 - 326