A New Enhanced Arabic Light Stemmer for IR in Medical Documents

被引:2
|
作者
Al-Khatib, Ra'ed M. [1 ]
Zerrouki, Taha [2 ]
Abu Shquier, Mohammed M. [3 ]
Balla, Amar [4 ]
Al-Khateeb, Asef [5 ]
机构
[1] Yarmouk Univ, Dept Comp Sci, Irbid 21163, Jordan
[2] Bouira Univ, Fac Sci & Appl Sci, Bouira, Algeria
[3] Jerash Univ, Fac Comp Sci & Informat Technol, Jerash, Jordan
[4] Ecole Natl Super Informat ESI, Algiers, Algeria
[5] Imam Mohammad Ibn Saud Islamic Univ IMSIU, Dept Comp Sci, Coll Sharia & Islamic Studies Al Ahsaa, Riyadh, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 68卷 / 01期
关键词
Machine learning; information retrieval systems; medical documents; stemming algorithms; arabic light stemmer; natural language processing;
D O I
10.32604/cmc.2021.016155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a new enhanced Arabic stemming algorithm for solving the information retrieval problem, especially in medical documents. Our proposed algorithm is a light stemming algorithm for extracting stems and roots from the input data. One of the main challenges facing the light stemming algorithm is cutting off the input word, to extract the initial segments. When initiating the light stemmer with strong initial segments, the final extracting stems and roots will be more accurate. Therefore, a new enhanced segmentation based on deploying the Direct Acyclic Graph (DAG) model is utilized. In addition to extracting the powerful initial segments, the main two procedures (i.e., stems and roots extraction), should be also reinforced with more efficient operators to improve the final outputs. To validate the proposed enhanced stemmer, four data sets are used. The achieved stems and roots resulted from our proposed light stemmer are compared with the results obtained from five other well-known Arabic light stemmers using the same data sets. This evaluation process proved that the proposed enhanced stemmer outperformed other comparative stemmers.
引用
收藏
页码:1255 / 1269
页数:15
相关论文
共 50 条
  • [1] ARABIC LIGHT STEMMER (ARS)
    Al-Omari, Asma
    Abuata, Belal
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2014, 9 (06): : 702 - 716
  • [2] An Improved Arabic Light Stemmer
    Elrajubi, Osama Mohamed
    [J]. 2013 INTERNATIONAL CONFERENCE ON RESEARCH AND INNOVATION IN INFORMATION SYSTEMS (ICRIIS), 2013, : 33 - 38
  • [3] Conditional Arabic Light Stemmer: CondLight
    Al-Lahham, Yaser
    Matarneh, Khawlah
    Hassan, Mohammad
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (3A) : 559 - 564
  • [4] A novel robust Arabic light stemmer
    Abainia, Kheireddine
    Ouamour, Siham
    Sayoud, Halim
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (03) : 557 - 573
  • [5] Arabic light-based stemmer using new rules
    Alshalabi, Hamood
    Tiun, Sabrina
    Omar, Nazlia
    AL-Aswadi, Fatima N.
    Alezabi, Kamal Ali
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 6635 - 6642
  • [6] Arabic Light Stemming: A Comparative Study between P-Stemmer, Khoja Stemmer, and Light10 Stemmer
    Kanan, Tarek
    Sadaqa, Odai
    Almhirat, Ashraf
    Kanan, Emran
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 511 - 515
  • [7] Arabic light-based stemming: a comparative study among ligh10 stemmer, P-stemmer, and Conditional light stemmer
    Hussien, Sabria Mohammed
    Aburagheef, Hazim J.
    [J]. PROCEEDING OF 2021 2ND INFORMATION TECHNOLOGY TO ENHANCE E-LEARNING AND OTHER APPLICATION (IT-ELA 2021), 2021, : 131 - 135
  • [8] Tashaphyne0.4: a new arabic light stemmer based on rhyzome modeling approach
    Al-Khatib, Ra'ed M.
    Zerrouki, Taha
    Abu Shquier, Mohammed M.
    Balla, Amar
    [J]. INFORMATION RETRIEVAL JOURNAL, 2023, 26 (1-2):
  • [9] Tashaphyne0.4: a new arabic light stemmer based on rhyzome modeling approach
    Ra’ed M. Al-Khatib
    Taha Zerrouki
    Mohammed M. Abu Shquier
    Amar Balla
    [J]. Information Retrieval Journal, 2023, 26
  • [10] BPR algorithm: New broken plural rules for an Arabic stemmer
    Alshalabi, Hamood
    Tiun, Sabrina
    Omar, Nazlia
    Anaam, Elham abdulwahab
    Saif, Yazid
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2022, 23 (03) : 363 - 371