Simple Stemming Rules for Arabic Language

被引:2
|
作者
Soori, Hussein [1 ]
Platos, Jan [1 ]
Snasel, Vaclav [1 ]
机构
[1] VSB Tech Univ Ostrava, Dept Comp Sci, FEECS, Ostrava 70800, Czech Republic
关键词
D O I
10.1007/978-3-642-31603-6_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Processing of Arabic language is eminent for the fact that currently the number of computer and Internet users in the Arab word is growing tremendously. The problem of stemming is very important in information retrieval, knowledge mining and language processing. Arabic has very complex morphology and stemming rules that must deal with many specific properties of Arabic. This paper describes very simple rules for stemming of Arabic words. Two of these rules are universal, i.e. they are applicable to any word category, and one rule for each of the four categories: nouns, verbs, adverbs and adjectives. The rules were more successful in case of adverbs. As for nouns, verbs and adjectives, some errors occurred especially in case of suffix processing.
引用
收藏
页码:99 / +
页数:3
相关论文
共 50 条
  • [41] Stemming Versus Light Stemming for Measuring the Simitilarity between Arabic Words with Latent Semantic Analysis Model
    Froud, Hanane
    Lachkar, Abdelmonaime
    Ouatik, Said Alaoui
    [J]. 2012 COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST'12), 2012, : 69 - 73
  • [42] A Simple Present and Past Sentences Machine Translation from Arabic Language (AL) to English language
    Hmeidi, Ismail
    Al-Aiad, Ahmad
    Al-Momani, Sama
    Ibnian, Mohammad
    [J]. 2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,
  • [43] Automatic Stemming of Words for Punjabi Language
    Gupta, Vishal
    [J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 73 - 84
  • [44] Indexing and stemming approaches for the Czech language
    Dolamic, Ljiljana
    Savoy, Jacques
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2009, 45 (06) : 714 - 720
  • [45] A word stemming algorithm for the Spanish language
    Honrado, A
    Leon, R
    O'Donnel, R
    Sinclair, D
    [J]. SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, : 139 - 145
  • [46] AN ACCURACY-ENHANCED STEMMING ALGORITHM FOR ARABIC INFORMATION RETRIEVAL
    Bessou, Sadik
    Touahria, Mohamed
    [J]. NEURAL NETWORK WORLD, 2014, 24 (02) : 117 - 128
  • [47] Automatic stemming for indexing of an agglutinative language
    Cho, S
    Han, SS
    [J]. ADVANCES IN INFORMATION SYSTEMS, 2002, 2457 : 154 - 165
  • [48] AUTOMATED ARABIC ESSAY SCORING BASED ON HYBRID STEMMING WITH WORDNET
    Alobed, Mohammad
    Altrad, Abdallah M. M.
    Abu Bakar, Zainab Binti
    Zamin, Norshuhani
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2021, : 55 - 67
  • [49] The Use of Stemming in the Arabic Text and Its Impact on the Accuracy of Classification
    Atwan, Jaffar
    Wedyan, Mohammad
    Bsoul, Qusay
    Hammadeen, Ahmad
    Alturki, Ryan
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [50] The Arabic language
    Wright, O
    [J]. BULLETIN OF THE SCHOOL OF ORIENTAL AND AFRICAN STUDIES-UNIVERSITY OF LONDON, 2002, 65 : 491 - 492