An intelligent use of stemmer and morphology analysis for Arabic information retrieval

被引:11
|
作者
Alnaied, Ali [1 ]
Elbendak, Mosa [2 ]
Bulbul, Abdullah [3 ]
机构
[1] Ankara Yildirim Beyazit Univ, Dept Elect & Comp Engn, Ankara, Turkey
[2] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
[3] Ankara Yildirim Beyazit Univ, Dept Comp Engn, Ankara, Turkey
关键词
Natural language processing; Arabic morphological analysis; Information retrieval systems; Arabic stemmer;
D O I
10.1016/j.eij.2020.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arabic Information Retrieval has gained significant attention due to an increasing usage of Arabic text on the web and social media networks. This paper discusses a new approach for Arabic stem, called Arabic Morphology Information Retrieval (AMIR), to generate/extract stems by applying a set of rules regarding the relationship among Arabic letters to find the root/stem of the respective words used as indexing terms for the text search in Arabic retrieval systems. To demonstrate the usefulness of the proposed algorithm, we highlight the benefits of the proposed rules for different Arabic information retrieval systems. Finally, we have evaluated AMIR system by comparing its performance with LUCENE, FARASA, and no-stemmer counterpart system in terms of mean average precisions. The results obtained demonstrate that AMIR has achieved a mean average precision of 0.34% while LUCENE, FARASA and no stemmer giving 0.27%, 0.28% and 0.21, respectively. This demonstrates that AMIR is able to improve Arabic stemmer and increases retrieval as well as being strong against any type of stem. (C) 2020 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:209 / 217
页数:9
相关论文
共 50 条
  • [31] Arabic Word Sense Disambiguation for Information Retrieval
    Abderrahim, Mohammed Alaeddine
    Abderrahim, Mohammed El-Amine
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [32] Modern information retrieval in Arabic - catering to standard and colloquial Arabic users
    Azmi, Aqil M.
    Aljafari, Eman A.
    [J]. JOURNAL OF INFORMATION SCIENCE, 2015, 41 (04) : 506 - 517
  • [33] Enhanced Arabic information retrieval system based on Arabic text classification
    Ghwanmeh, Sameh
    Kanaan, Ghassan
    Al-Shalabi, Riyad
    Ababneh, Ahmad
    [J]. 2007 INNOVATIONS IN INFORMATION TECHNOLOGIES, VOLS 1 AND 2, 2007, : 527 - +
  • [34] Intensive use of correspondence analysis for information retrieval
    Morin, A
    [J]. ITI 2004: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2004, : 255 - 258
  • [35] PLIS: Proposed Language Independent Stemmer for Information Retrieval Systems Using Dynamic Programming
    Kasthuri, M.
    Kumar, S. Britto Ramesh
    Khaddaj, Souheil
    [J]. 2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 132 - 135
  • [36] IRIA: The information retrieval intelligent assistant
    Francis, AG
    Devaney, M
    Ram, A
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 275 - 280
  • [37] Cognitive approaches to intelligent information retrieval
    Quintana, Y
    [J]. 1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 261 - 264
  • [38] Toward intelligent music information retrieval
    Li, Tao
    Ogihara, Mitsunori
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (03) : 564 - 574
  • [39] Intelligent decision making in information retrieval
    Phillips-Wren, GE
    Forgionne, GA
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 103 - 109
  • [40] Knowledge engineering for intelligent information retrieval
    Drexel, G
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2001, 2004 : 495 - 504