Performance of KNN and SVM classifiers on full word Arabic articles

被引:75
|
作者
Hmeidi, Ismail [1 ]
Hawashin, Bilal [1 ]
El-Qawasmeh, Eyas [1 ]
机构
[1] Jordan Univ Sci & Technol, Fac Comp & Informat Technol, Irbid 22110, Jordan
关键词
Arabic text categorization; full word features; tf.idf weighting; CHI statistics; KNN; SVM;
D O I
10.1016/j.aei.2007.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports a comparative study of two machine learning methods on Arabic text categorization. Based on a collection of news articles as a training set, and another set of news articles as a testing set, we evaluated K nearest neighbor (KNN) algorithm, and support vector machines (SVM) algorithm. We used the full word features and considered the tf.idf as the weighting method for feature selection, and CHI statistics as a ranking metric. Experiments showed that both methods were of superior performance on the test corpus while SVM showed a better micro average F1 and prediction time. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:106 / 111
页数:6
相关论文
共 50 条
  • [1] Machine Reading of Arabic Manuscripts using KNN and SVM Classifiers
    Zafar, Aasim
    Iqbal, Arshad
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM-2020), 2019, : 83 - 87
  • [2] Performance of NB and SVM Classifiers in Arabic Text Data
    Eljinini, Mohammad Ali H.
    Hadi, Wa'el Musa
    Mohammad, Adel Hamdan
    Ghatasheh, Mohammad
    [J]. BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 3 AND 4, 2010, : 2593 - 2599
  • [3] Human activity recognition in egocentric video using PNN, SVM, kNN and SVM+kNN classifiers
    K. P. Sanal Kumar
    R. Bhavani
    [J]. Cluster Computing, 2019, 22 : 10577 - 10586
  • [4] Human activity recognition in egocentric video using PNN, SVM, kNN and SVM plus kNN classifiers
    Kumar, K. P. Sanal
    Bhavani, R.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 10577 - 10586
  • [5] Analysis of SVM and kNN Classifiers For Egocentric Activity Recognition
    Kumar, K. P. Sanal
    Bhavani, R.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [6] Arabic word recognition by classifiers and context
    Farah, N
    Souici, L
    Sellami, M
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2005, 20 (03) : 402 - 410
  • [7] Arabic Word Recognition by Classifiers and Context
    Nadir Farah
    Labiba Souici
    Mokhtar Sellami
    [J]. Journal of Computer Science and Technology, 2005, 20 : 402 - 410
  • [8] The Automated Arabic Text Categorization Using SVM and KNN
    Hadi, Wa'el Musa
    Eljinini, Mohammad Ali H.
    Alhawari, Samer
    [J]. KNOWLEDGE MANAGEMENT AND INNOVATION: A BUSINESS COMPETITIVE EDGE PERSPECTIVE, VOLS 1-3, 2010, : 757 - +
  • [9] Classification of Aesthetic Photographic Images using SVM and KNN Classifiers
    Almobarak, Arwa S.
    Almohammadi, Hanan R.
    Aboalnaser, Sara A.
    Syed, Liyakathunisa
    [J]. 12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 170 - 175
  • [10] Image Processing-based Performance Evaluation of KNN and SVM Classifiers for Lung Cancer Diagnosis
    Kavitha, B. C.
    Naveen, K. B.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 460 - 468