Efficient English text classification using selected Machine Learning Techniques

被引:79
|
作者
Luo, Xiaoyu [1 ]
机构
[1] Hunan Univ Technol & Business, 569 Yuelu Rd, Changsha 411104, Hunan, Peoples R China
关键词
Text classification; English language; Machine Learning; Text mining; Support Vector Machines; NETWORKS; SVM;
D O I
10.1016/j.aej.2021.02.009
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Text classification (TC) is an approach used for the classification of any kind of documents for the target category or out. In this paper, we implemented the Support Vector Machines (SVM) model in classifying English text and documents. Here we did two analytical experiments to check the selected classifiers using English documents. Experimental results performed on a set of 1033 text document present that the Rocchio classifier provides the best performance results when the size of the feature set is small while SVM outperforms the other classifiers. From the experimental analysis, we observed that the classification rate exceeds 90% when using more than 4000 features. (C) 2021 THE AUTHOR. Published by Elsevier BV on behalf of Faculty of Engineering, Alexandria University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:3401 / 3409
页数:9
相关论文
共 50 条
  • [1] An exploration on text classification using machine learning techniques
    Athanasios, Tzimourtas
    Spyros, Bakalakos
    Panagiota, Tselenti
    Athanasios, Voulodimos
    [J]. 25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 247 - 249
  • [2] Text Classification Using Lifelong Machine Learning
    Arif, Muhammad Hassan
    Jin, Xin
    Li, Jianxin
    Iqbal, Muhammad
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 394 - 404
  • [3] Survey on supervised machine learning techniques for automatic text classification
    Ammar Ismael Kadhim
    [J]. Artificial Intelligence Review, 2019, 52 : 273 - 292
  • [4] Survey on supervised machine learning techniques for automatic text classification
    Kadhim, Ammar Ismael
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (01) : 273 - 292
  • [5] Emotion Classification of Thai Text based Using Term weighting and Machine Learning Techniques
    Chirawichitchai, Nivet
    [J]. 2014 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2014, : 91 - 96
  • [6] Automated Text Classification of Maintenance Data of Higher Education Buildings Using Text Mining and Machine Learning Techniques
    Hong, Sungil
    Kim, Junghyun
    Yang, Eunhwa
    [J]. JOURNAL OF ARCHITECTURAL ENGINEERING, 2022, 28 (01)
  • [7] Text Classification of British English and American English Using Support Vector Machine
    Utomo, Muhammad Romi Ario
    Sibaroni, Yuliant
    [J]. 2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2019, : 576 - 581
  • [8] Frog classification using machine learning techniques
    Huang, Chenn-Jung
    Yang, Yi-Ju
    Yang, Dian-Xiu
    Chen, You-Jia
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3737 - 3743
  • [9] Diabetes Classification Using Machine Learning Techniques
    Phongying, Methaporn
    Hiriote, Sasiprapa
    [J]. COMPUTATION, 2023, 11 (05)
  • [10] Academic Registration Text Classification Using Machine Learning
    Alhawas, Mohammed S.
    Almurayziq, Tariq S.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 93 - 96