A distance-based classifier for arabic text categorization

被引:0
|
作者
Duwairi, RM [1 ]
机构
[1] Jordan Univ Sci & Technol, Dept Informat & Comp Sci, Irbid, Jordan
关键词
text categorization; machine learning; k-NN classifier; Arabic language;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A distance-based classifier for Arabic text categorization was proposed The classifier, in its learning phase, scans the set of training documents once to extract features of categories that capture inherent category-specific properties; while in its testing phase the classifier uses category-specific features to categorize unclassified documents. Stemming was used to reduce the dimensionality of feature vectors. The accuracy of the classifier was tested by carrying out several categorization tasks on an in-house collected Arabic corpus. The results show that the proposed classifier is very accurate and robust.
引用
收藏
页码:187 / 192
页数:6
相关论文
共 50 条
  • [1] Arabic text categorization based on arabic wikipedia
    [J]. Yahya, A. (yahya@birzeit.edu), 1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (13):
  • [2] DDC: distance-based decision classifier
    Javad Hamidzadeh
    Reza Monsefi
    Hadi Sadoghi Yazdi
    [J]. Neural Computing and Applications, 2012, 21 : 1697 - 1707
  • [3] DDC: distance-based decision classifier
    Hamidzadeh, Javad
    Monsefi, Reza
    Yazdi, Hadi Sadoghi
    [J]. NEURAL COMPUTING & APPLICATIONS, 2012, 21 (07): : 1697 - 1707
  • [4] Quantum variational distance-based centroid classifier
    de Oliveira, Nicolas M.
    Park, Daniel K.
    Araujo, Israel F.
    da Silva, Adenilton J.
    [J]. NEUROCOMPUTING, 2024, 576
  • [5] Rank Distance Aggregation as a Fixed Classifier Combining Rule for Text Categorization
    Dinu, Liviu P.
    Rusu, Andrei
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2010, 6008 : 638 - 647
  • [6] Distance-Based Ensemble Online Classifier with Kernel Clustering
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    [J]. INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 : 279 - 289
  • [7] Implementing a distance-based classifier with a quantum interference circuit
    Schuld, M.
    Fingerhuth, M.
    Petruccione, F.
    [J]. EPL, 2017, 119 (06)
  • [8] Projected-prototype based classifier for text categorization
    Zhang, Jianfei
    Chen, Lifei
    Guo, Gongde
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 49 : 179 - 189
  • [9] A Hybrid Distance-Based and Naive Bayes Online Classifier
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT II, 2015, 9330 : 213 - 222
  • [10] A generalized cluster centroid based classifier for text categorization
    Pang, Guansong
    Jiang, Shengyi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (02) : 576 - 586