An intelligent use of stemmer and morphology analysis for Arabic information retrieval

被引:11
|
作者
Alnaied, Ali [1 ]
Elbendak, Mosa [2 ]
Bulbul, Abdullah [3 ]
机构
[1] Ankara Yildirim Beyazit Univ, Dept Elect & Comp Engn, Ankara, Turkey
[2] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne, Tyne & Wear, England
[3] Ankara Yildirim Beyazit Univ, Dept Comp Engn, Ankara, Turkey
关键词
Natural language processing; Arabic morphological analysis; Information retrieval systems; Arabic stemmer;
D O I
10.1016/j.eij.2020.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arabic Information Retrieval has gained significant attention due to an increasing usage of Arabic text on the web and social media networks. This paper discusses a new approach for Arabic stem, called Arabic Morphology Information Retrieval (AMIR), to generate/extract stems by applying a set of rules regarding the relationship among Arabic letters to find the root/stem of the respective words used as indexing terms for the text search in Arabic retrieval systems. To demonstrate the usefulness of the proposed algorithm, we highlight the benefits of the proposed rules for different Arabic information retrieval systems. Finally, we have evaluated AMIR system by comparing its performance with LUCENE, FARASA, and no-stemmer counterpart system in terms of mean average precisions. The results obtained demonstrate that AMIR has achieved a mean average precision of 0.34% while LUCENE, FARASA and no stemmer giving 0.27%, 0.28% and 0.21, respectively. This demonstrates that AMIR is able to improve Arabic stemmer and increases retrieval as well as being strong against any type of stem. (C) 2020 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:209 / 217
页数:9
相关论文
共 50 条
  • [1] Arabic Stemmer for Search Engines Information Retrieval
    Khalid, Ahmed
    Hussain, Zakir
    Baig, Mirza Anwarullah
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 407 - 411
  • [2] Building an automatic stemmer to enhance arabic information retrieval systems
    Alsamara, K
    Abuleil, S
    Abu-Salem, H
    Hammo, B
    [J]. International Conference on Computing, Communications and Control Technologies, Vol 5, Proceedings, 2004, : 270 - 274
  • [3] Impact of Stemmer on Arabic Text Retrieval
    Atwan, Jaffar
    Mohd, Masnizah
    Kanaan, Ghassan
    Bsoul, Qusay
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014, 2014, 8870 : 314 - 326
  • [4] Impact of stemmer on arabic text retrieval
    [J]. Atwan, Jaffar (jaffaratwan@gmail.com), 1600, Springer Verlag (8870):
  • [5] A rule-based extensible stemmer for information retrieval with application to Arabic
    Harmanani, HM
    Keirouz, WT
    Raheel, S
    [J]. Proceedings of the Eighth IASTED International Conference on Artificial Intelligence and Soft Computing, 2004, : 35 - 40
  • [6] The Use of Arabic WordNet in Arabic Information Retrieval
    Abbache, Ahmed
    Barigou, Fatiha
    Belkredim, Fatma Zohra
    Belalem, Ghalem
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2014, 4 (03) : 54 - 65
  • [7] Arabic information retrieval
    Abu El-Khair, Ibrahim
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2007, 41 : 505 - 533
  • [8] Arabic Information Retrieval
    Darwish, Kareem
    Magdy, Walid
    [J]. FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2013, 7 (04): : I - 342
  • [9] Stemmer Impact on Quranic Mobile Information Retrieval Performance
    Aljaloud, Huda Omar
    Dahab, Mohammed
    Kamal, Mahmoud
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (12) : 135 - 139
  • [10] Arabic Information Retrieval Using Semantic Analysis of Documents
    Al-Maghasbeh, Mohammad Khaled A.
    Bin Hamzah, Mohd Pouzi
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (05): : 53 - 58