Feature Selection for Text Classification Using Machine Learning Approaches

被引:13
|
作者
Thirumoorthy, K. [1 ]
Muneeswaran, K. [1 ]
机构
[1] Mepco Schlenk Engn Coll, Dept Comp Sci & Engn, Sivakasi, India
来源
关键词
Feature selection; Text classification; Filter-based approach;
D O I
10.1007/s40009-021-01043-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the present scenario, millions of internet users are contributing a huge amount of data in the form of unstructured text documents. In text classification, the high dimensional feature space, noise and irrelevant information of unstructured text documents are reducing the accuracy of text classifier. The feature selection scheme is adopted to address the high dimensional feature space problem of text classification. In this proposed research, a feature selection method based on the term frequency distribution measure is deployed. We have used the Naive Bayes and SVM classifiers with two benchmark datasets (WebKB and BBC). The experimental outcome confirms that the proposed feature selection method has a better classification accuracy when compared with other feature selection techniques.
引用
下载
收藏
页码:51 / 56
页数:6
相关论文
共 50 条
  • [1] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    National Academy Science Letters, 2022, 45 : 51 - 56
  • [2] Feature selection and machine learning algorithms for uyghur text sentiment classification
    Turhuntay, Raxida
    Slamu, Wushour
    Dawut, Abdusalam
    Hamdulla, Askar
    Turhun, Erxat
    Boletin Tecnico/Technical Bulletin, 2017, 55 (13): : 56 - 66
  • [3] Machine learning and neural network approaches to feature selection and extraction for classification
    Russell, I
    Markov, Z
    Carse, B
    Pipe, AG
    Holder, LB
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (02) : 129 - 132
  • [4] Connected Devices Classification using Feature Selection with Machine Learning
    Fagroud, Fatima Zahra
    Toumi, Hicham
    Lahmar, El Habib Ben
    Achtaich, Khadija
    El Filali, Sanaa
    Baddi, Youssef
    IAENG International Journal of Computer Science, 2022, 49 (02)
  • [5] Optimizing Feature Selection for Solar Park Classification: Approaches with OBIA and Machine Learning
    Ladisa, Claudio
    Capolupo, Alessandra
    Tarantino, Eufemia
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT V, 2024, 14819 : 286 - 301
  • [6] Feature selection and classification of protein protein complexes based on their binding affinities using machine learning approaches
    Yugandhar, K.
    Gromiha, M. Michael
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (09) : 2088 - 2096
  • [7] Design of a Predictor Model for Feature Selection using Machine Learning Approaches
    Pradeep, P.
    Kamalakannan, J.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 2359 - 2373
  • [8] Text Learning and Hierarchical Feature Selection in Webpage Classification
    Peng, Xiaogang
    Ming, Zhong
    Wang, Haitao
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 452 - 459
  • [9] Gearbox faults feature selection and severity classification using machine learning
    Zuber, Ninoslav
    Bajric, Rusmir
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2020, 22 (04): : 748 - 756
  • [10] Data Classification Using Feature Selection And kNN Machine Learning Approach
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 811 - 814