ARABIC LIGHT STEMMER (ARS)

被引:0
|
作者
Al-Omari, Asma [1 ]
Abuata, Belal [1 ]
机构
[1] Yarmouk Univ, Fac Informat Technol & Comp Sci, CIS Dept, Irbid 21163, Jordan
来源
关键词
Arabic stemming; Light Arabic stemming; Rule based stemming; Stemming errors;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Stemming is a main step used to process textual data. It is usually used in several types of applications such as: text mining, information retrieval (IR), and natural language processing (NLP). A major task in stemming is to standardize words; which can be achieved by reducing each word to its base (root or stem). Arabic stemming is not an easy task. Unlike other languages, Arabic language is a highly inflected language, since it uses many inflectional forms. Researchers are divided on the benefit of using stemming in fields of IR, NLP... etc., since in Arabic the morphological variants of a certain word are not always semantically related. The aim of this paper is to design and implement a new Arabic light stemmer (ARS) which is not based on Arabic root patterns. Instead, it depends on well defined mathematical rules and several relations between letters. A series of tests were conducted on ARS stemmer to compare its effectiveness with the effectiveness of two other Arabic stemmers. Test shows clearly the effectiveness superiority of ARS compared to effectiveness of these two Arabic stemmers.
引用
收藏
页码:702 / 716
页数:15
相关论文
共 50 条
  • [1] An Improved Arabic Light Stemmer
    Elrajubi, Osama Mohamed
    [J]. 2013 INTERNATIONAL CONFERENCE ON RESEARCH AND INNOVATION IN INFORMATION SYSTEMS (ICRIIS), 2013, : 33 - 38
  • [2] Conditional Arabic Light Stemmer: CondLight
    Al-Lahham, Yaser
    Matarneh, Khawlah
    Hassan, Mohammad
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (3A) : 559 - 564
  • [3] A novel robust Arabic light stemmer
    Abainia, Kheireddine
    Ouamour, Siham
    Sayoud, Halim
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (03) : 557 - 573
  • [4] Arabic Light Stemming: A Comparative Study between P-Stemmer, Khoja Stemmer, and Light10 Stemmer
    Kanan, Tarek
    Sadaqa, Odai
    Almhirat, Ashraf
    Kanan, Emran
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 511 - 515
  • [5] Arabic light-based stemming: a comparative study among ligh10 stemmer, P-stemmer, and Conditional light stemmer
    Hussien, Sabria Mohammed
    Aburagheef, Hazim J.
    [J]. PROCEEDING OF 2021 2ND INFORMATION TECHNOLOGY TO ENHANCE E-LEARNING AND OTHER APPLICATION (IT-ELA 2021), 2021, : 131 - 135
  • [6] A New Enhanced Arabic Light Stemmer for IR in Medical Documents
    Al-Khatib, Ra'ed M.
    Zerrouki, Taha
    Abu Shquier, Mohammed M.
    Balla, Amar
    Al-Khateeb, Asef
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 1255 - 1269
  • [7] Arabic light-based stemmer using new rules
    Alshalabi, Hamood
    Tiun, Sabrina
    Omar, Nazlia
    AL-Aswadi, Fatima N.
    Alezabi, Kamal Ali
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 6635 - 6642
  • [8] P-Stemmer or NLTK Stemmer for Arabic Text Classification?
    Elbes, Mohammed
    Aldajah, Amal
    Sadaqa, Odai
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 516 - 520
  • [9] Arabic Stemmer Based Big Data
    Madani, Youness
    Erritali, Mohammed
    Bengourram, Jamaa
    [J]. JOURNAL OF ELECTRONIC COMMERCE IN ORGANIZATIONS, 2018, 16 (01) : 17 - 28
  • [10] Impact of Stemmer on Arabic Text Retrieval
    Atwan, Jaffar
    Mohd, Masnizah
    Kanaan, Ghassan
    Bsoul, Qusay
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 314 - 326