A rule-based extensible stemmer for information retrieval with application to Arabic

被引:0
|
作者
Harmanani, HM [1 ]
Keirouz, WT [1 ]
Raheel, S [1 ]
机构
[1] Lebanese Amer Univ, Dept Comp Sci, Byblos 1401 2010, Lebanon
关键词
natural language processing; information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new and extensible method for information retrieval and content analysis in natural languages (NL). The proposed method is stem-based; stems are extracted based on a set of language dependent rules that are interpreted by a rule engine. The rule engine allows the system to be adapted to any natural language by modifying the NL semantic rules and grammar. The system has been fully tested using Arabic, and partially using English, Hebrew and Persian. We validate our approach using a database-based prototype.
引用
收藏
页码:35 / 40
页数:6
相关论文
共 50 条
  • [1] A rule-based stemmer for Arabic Gulf dialect
    Abuata, Belal
    Al-Omari, Asma
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (02) : 104 - 112
  • [2] Towards Improving Khoja Rule-Based Arabic Stemmer
    Al-Kabi, Mohammed N.
    [J]. 2013 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2013,
  • [3] A Rule-Based Subject-Correlated Arabic Stemmer
    El-Defrawy, Mahmoud
    El-Sonbaty, Yasser
    Belal, Nahla A.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (08) : 2883 - 2891
  • [4] A Rule-Based Subject-Correlated Arabic Stemmer
    Mahmoud El-Defrawy
    Yasser El-Sonbaty
    Nahla A. Belal
    [J]. Arabian Journal for Science and Engineering, 2016, 41 : 2883 - 2891
  • [5] Rule-Based Arabic Stemmer as an R package: arStemmer1
    Hasan, Alshahrani A.
    Fong, Alvis C.
    Fatimah, Alshahrani
    [J]. 2019 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2019, : 438 - 442
  • [6] The Rule-Based Sundanese Stemmer
    Suryani, Arie Ardiyanti
    Widyantoro, Dwi Hendratmo
    Purwarianti, Ayu
    Sudaryat, Yayat
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (04)
  • [7] Arabic Stemmer for Search Engines Information Retrieval
    Khalid, Ahmed
    Hussain, Zakir
    Baig, Mirza Anwarullah
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 407 - 411
  • [8] A structural rule-based stemmer for Persian
    Rahimtoroghi, Elaheh
    Faili, Hesham
    Shakery, Azadeh
    [J]. 2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 574 - 578
  • [9] Building an Effective Rule-Based Light Stemmer for Arabic Language to Improve Search Effectiveness
    Ababneh, Mohamad
    Al-Shalabi, Riyad
    Kanaan, Ghassan
    Al-Nobani, Alaa
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (04) : 368 - 372
  • [10] Building an Effective Rule-Based Light Stemmer for Arabic Language to Improve Search Effectiveness
    Kanaan, Ghassan
    Al-Shalabi, Riyad
    Ababneh, Mohamad
    Al-Nobani, Alaa
    [J]. IIT: 2008 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2008, : 292 - +