On the Use of Arabic Stemmers to Increase the Recall of Information Retrieval Systems

被引:0
|
作者
Nasra, Ihab [1 ]
Maree, Mohammed [2 ]
机构
[1] Arab Amer Univ, Dept Comp Sci, Jenin, Palestine
[2] Arab Amer Univ, Dept Informat Technol, Jenin, Palestine
关键词
Information Retrieval; Arabic Stemming; Morphological Analysis; Natural Language Processing; Rule-Based Stemmers;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building robust information revival systems demands employing efficient natural language processing and morphological analysis techniques. These techniques are commonly exploited to find syntactic and semantic matches between users' queries and their corresponding documents. Word stemming is one those techniques that has been widely employed in Information Retrieval systems, namely to increase their recall. A lot of research work has been conducted to evaluate English stemming techniques. However, a little attention has been given to Arabic stemmers. In this research work, we present a comprehensive review of state-of-the-art Arabic stemming techniques and compare between them according to a variety of criteria. In addition, we classify existing Arabic stemmers into four categories: Root-based, Affix Removal, Rule-based, and Context-based techniques. We review seven of the most commonly used Arabic stemming algorithms that fall under these categories, and provide a comparative analysis and evaluation between them according to the goal, input, employed approach, and output of each technique. We conclude this study by proposing our idea of building a hybrid Arabic stemming approach that combines multiple stemmers and exploits a new set of rules to better stem Arabic words.
引用
收藏
页码:2462 / 2468
页数:7
相关论文
共 50 条
  • [1] Comparative Analysis of Nine Arabic Stemmers on Microblog Information Retrieval
    Almazrua, Amal
    Almazrua, Manal
    Alkhalifa, Hend
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 60 - 65
  • [2] The Use of Arabic WordNet in Arabic Information Retrieval
    Abbache, Ahmed
    Barigou, Fatiha
    Belkredim, Fatma Zohra
    Belalem, Ghalem
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 54 - 65
  • [3] Comparative Study of Various Persian Stemmers in the Field of Information Retrieval
    Moghadam, Fatemeh Momenipour
    Keyvanpour, MohammadReza
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2015, 11 (03): : 450 - 464
  • [4] Recall-Oriented Evaluation for Information Retrieval Systems
    Audeh, Bissan
    Beaune, Philippe
    Beigbeder, Michel
    MULTIDISCIPLINARY INFORMATION RETRIEVAL, 2013, 8201 : 29 - 32
  • [5] Events extraction and classification for arabic information retrieval systems
    Abuleil, S
    Evens, M
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 769 - 770
  • [6] Arabic information retrieval
    Abu El-Khair, Ibrahim
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2007, 41 : 505 - 533
  • [7] Arabic Information Retrieval
    Darwish, Kareem
    Magdy, Walid
    FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2013, 7 (04): : I - 342
  • [8] On the Use of Fuzzy Information Retrieval for Gauging Similarity of Arabic Documents
    Alzahrani, Salha Mohammed
    Salim, Naomie
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 539 - +
  • [9] An intelligent use of stemmer and morphology analysis for Arabic information retrieval
    Alnaied, Ali
    Elbendak, Mosa
    Bulbul, Abdullah
    EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (04) : 209 - 217
  • [10] The Impact of Online Indexing in Improving Arabic Information Retrieval Systems
    Dilekh, Tahar
    Benharzallah, Saber
    Behloul, Ali
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (04): : 607 - 616