Performance of KNN and SVM classifiers on full word Arabic articles

被引:75
|
作者
Hmeidi, Ismail [1 ]
Hawashin, Bilal [1 ]
El-Qawasmeh, Eyas [1 ]
机构
[1] Jordan Univ Sci & Technol, Fac Comp & Informat Technol, Irbid 22110, Jordan
关键词
Arabic text categorization; full word features; tf.idf weighting; CHI statistics; KNN; SVM;
D O I
10.1016/j.aei.2007.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper reports a comparative study of two machine learning methods on Arabic text categorization. Based on a collection of news articles as a training set, and another set of news articles as a testing set, we evaluated K nearest neighbor (KNN) algorithm, and support vector machines (SVM) algorithm. We used the full word features and considered the tf.idf as the weighting method for feature selection, and CHI statistics as a ranking metric. Experiments showed that both methods were of superior performance on the test corpus while SVM showed a better micro average F1 and prediction time. (C) 2007 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:106 / 111
页数:6
相关论文
共 50 条
  • [31] Classifiers Selection and features extraction / selection for Arabic handwritten word recognition
    Nabiha, Azizi
    Mokhtar, Sellami
    International Review on Computers and Software, 2009, 4 (02) : 212 - 219
  • [32] Combination of Multiple Classifiers for Off -Line Handwritten Arabic Word Recognition
    Zaghdoudi, Rachid
    Seridi, Hamid
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (05) : 713 - 720
  • [33] Assessment Of Driving Stress Through SVM And KNN Classifiers On Multi-Domain Physiological Data
    Fruet, Damiano
    Bara, Chiara
    Pernice, Riccardo
    Faes, Luca
    Nollo, Giandomenico
    2022 IEEE 21ST MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (IEEE MELECON 2022), 2022, : 920 - 925
  • [34] Gene-Expression-Based Cancer Classification Through feature selection with KNN and SVM Classifiers
    Bouazza, Sara Haddou
    Hamdi, Nezha
    Zeroual, Abdelouhab
    Auhmani, Khalid
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [35] Evalutation of performance of KNN, MLP and RBF classifiers in emotion detection problem
    Polat, Goekhan
    Altun, Halis
    2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 459 - +
  • [36] A Novel Word Based Arabic Handwritten Recognition System Using SVM Classifier
    Khalifa, Mahmoud
    Yang BingRu
    ADVANCED RESEARCH ON ELECTRONIC COMMERCE, WEB APPLICATION, AND COMMUNICATION, PT 1, 2011, 143 : 163 - 171
  • [37] On the Performance of Ensemble-Based Classifiers for Arabic Speech Recognition
    Absa, Ahmed H. Abo
    Deriche, Mohamed
    2017 4TH IEEE INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGIES AND APPLIED SCIENCES (ICETAS), 2017,
  • [38] COLLABORATIVE COMBINATION OF NEURON-LINGUISTIC CLASSIFIERS FOR LARGE ARABIC WORD VOCABULARY RECOGNITION
    Echi, Afef Kacem
    Ben Cheikh, Imen
    Belaid, Abdel
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (01)
  • [39] Arabic literal amount sub-word recognition using multiple features and classifiers
    Ahmad, Irfan
    Awaida, Sameh
    Mahmoud, Sabri A.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2020, 6 (02) : 103 - 123
  • [40] Performance of SVM and Bayesian classifiers on the systematic review classification task
    Matwin, Stan
    Kouznetsov, Alexandre
    Inkpen, Diana
    Frunza, Oana
    O'Blenis, Peter
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (01) : 104 - 105