Performance of KNN and SVM classifiers on full word Arabic articles

被引:75
|
作者
Hmeidi, Ismail [1 ]
Hawashin, Bilal [1 ]
El-Qawasmeh, Eyas [1 ]
机构
[1] Jordan Univ Sci & Technol, Fac Comp & Informat Technol, Irbid 22110, Jordan
关键词
Arabic text categorization; full word features; tf.idf weighting; CHI statistics; KNN; SVM;
D O I
10.1016/j.aei.2007.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports a comparative study of two machine learning methods on Arabic text categorization. Based on a collection of news articles as a training set, and another set of news articles as a testing set, we evaluated K nearest neighbor (KNN) algorithm, and support vector machines (SVM) algorithm. We used the full word features and considered the tf.idf as the weighting method for feature selection, and CHI statistics as a ranking metric. Experiments showed that both methods were of superior performance on the test corpus while SVM showed a better micro average F1 and prediction time. (C) 2007 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:106 / 111
页数:6
相关论文
共 50 条
  • [41] Performance Analysis of Mammogram CAD System using SVM and KNN Classifier
    Saraswathi, D.
    Srinivasan, E.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2017), 2017, : 866 - 870
  • [42] Word-Based Arabic Handwritten Recognition Using SVM Classifier with a Reject Option
    El Qacimy, Bouchra
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 64 - 68
  • [43] Naive Bayes and SVM classifiers for classifying Databank Accession Number sentences from online biomedical articles
    Kim, Jongwoo
    Le, Daniel X.
    Thoma, George R.
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [44] Classification of yoga, meditation, combined yoga–meditation EEG signals using L-SVM, KNN, and MLP classifiers
    A. Rajalakshmi
    S. S. Sridhar
    Soft Computing, 2024, 28 : 4607 - 4619
  • [45] Review and performance comparison of SVM- and ELM-based classifiers
    Chorowski, Jan
    Wang, Jian
    Zurada, Jacek M.
    NEUROCOMPUTING, 2014, 128 : 507 - 516
  • [46] Classification of yoga, meditation, combined yoga-meditation EEG signals using L-SVM, KNN, and MLP classifiers
    Rajalakshmi, A.
    Sridhar, S. S.
    SOFT COMPUTING, 2024, 28 (05) : 4607 - 4619
  • [47] Advancing prostate cancer detection: a comparative analysis of PCLDA-SVM and PCLDA-KNN classifiers for enhanced diagnostic accuracy
    Dubey, Priya
    Kumar, Surendra
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [48] Advancing prostate cancer detection: a comparative analysis of PCLDA-SVM and PCLDA-KNN classifiers for enhanced diagnostic accuracy
    Priya Dubey
    Surendra Kumar
    Scientific Reports, 13
  • [49] Performance of authorship attribution classifiers with short texts: application of religious Arabic fatwas
    Al-Sarem, Mohammed
    Emara, Abdel-Hamid
    Wahab, Ahmed Abdel
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2020, 12 (03) : 350 - 364
  • [50] Comparison of Data Reduction Techniques Based on the Performance of SVM-type Classifiers
    Georgescu, Ramona
    Berger, Christian R.
    Willett, Peter
    Azam, Mohammad
    Ghoshal, Sudipto
    2010 IEEE AEROSPACE CONFERENCE PROCEEDINGS, 2010,