Simple Stemming Rules for Arabic Language

被引:2
|
作者
Soori, Hussein [1 ]
Platos, Jan [1 ]
Snasel, Vaclav [1 ]
机构
[1] VSB Tech Univ Ostrava, Dept Comp Sci, FEECS, Ostrava 70800, Czech Republic
关键词
D O I
10.1007/978-3-642-31603-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Processing of Arabic language is eminent for the fact that currently the number of computer and Internet users in the Arab word is growing tremendously. The problem of stemming is very important in information retrieval, knowledge mining and language processing. Arabic has very complex morphology and stemming rules that must deal with many specific properties of Arabic. This paper describes very simple rules for stemming of Arabic words. Two of these rules are universal, i.e. they are applicable to any word category, and one rule for each of the four categories: nouns, verbs, adverbs and adjectives. The rules were more successful in case of adverbs. As for nouns, verbs and adjectives, some errors occurred especially in case of suffix processing.
引用
收藏
页码:99 / +
页数:3
相关论文
共 50 条
  • [1] Integrating Effective Rules to Improve Arabic Text Stemming
    Cherif, Walid
    Madani, Abdellah
    Kissi, Mohamed
    [J]. 2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 1077 - 1081
  • [2] Automatic learning of stemming rules for the Indonesian language
    Indradjaja, LS
    Bressan, S
    [J]. PACLIC 17: Language, Information and Computation, Proceedings, 2003, : 62 - 68
  • [3] Effect of stemming on text similarity for Arabic language at sentence level
    Alhawarat, Mohammad O.
    Abdeljaber, Hikmat
    Hilal, Anwer
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [4] Effect of Stemming on Text Similarity for Arabic Language at Sentence Level
    Alhawarat M.O.
    Abdeljaber H.
    Hilal A.
    [J]. PeerJ Computer Science, 2021, 7 : 1 - 18
  • [5] Simple Rules for Syllabification of Arabic Texts
    Soori, Hussein
    Platos, Jan
    Snasel, Vaclav
    Abdulla, Hussam
    [J]. DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS, PT 1, 2011, 188 : 97 - 105
  • [6] Sentential Count Rules for Arabic Language
    Fawaz S. Al-Anzi
    [J]. Computers and the Humanities, 2001, 35 : 153 - 166
  • [7] Sentential count rules for Arabic language
    Al-Anzi, FS
    [J]. COMPUTERS AND THE HUMANITIES, 2001, 35 (02): : 153 - 166
  • [8] Arabic Stemming with two dictionaries
    Kchaou, Zied
    Kanoun, Slim
    [J]. IIT: 2008 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2008, : 747 - 750
  • [9] Stemming Arabic conjunctions and prepositions
    Nwesri, Abdusalam F. A.
    Tahaghoghi, S. M. M.
    Scholer, Falk
    [J]. STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2005, 3772 : 206 - 217
  • [10] Arabic Information Retrieval: Stemming or Lemmatization?
    Zeroual, Imad
    Lakhouaja, Abdelhak
    [J]. 2017 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2017,