Feature Selection For Text Classification Using Genetic Algorithms

被引:0
|
作者
Bidi, Noria [1 ]
Elberrichi, Zakaria [2 ]
机构
[1] Univ Mustapha Stambouli, Mascara, Algeria
[2] Univ Djilali Liabes, EEDIS Lab, Sidi Bel Abbes, Algeria
关键词
Genetic algorithms; feature selection; Naive Bayes; K-Nearest Neighbors; Support Vectors Machines; Text Classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In text classification, feature selection is essential to improve the classification effectiveness. This paper provides an empirical study of a feature selection method based on genetic algorithms for different text representation methods. This feature selection algorithm can accomplish two goals: in one hand is the search of a feature subset such that the performance of classifier is best; in other hands is find a feature subset with the smallest dimensionality which achieves higher accuracy in classification. To evaluate the performance of this approach, three from the best classifiers have been selected: Naive Bayes (NB), Nearest Neighbors (KNN) and Support Vector Machines (SVMs). Our objective is to determine whether the genetic algorithms based feature selection will improve the performances in text classification with smaller size using F-measure. Experimentations were carried out on two benchmark document collections 20Newsgroups, and Reuters-21578. And the results were very interesting.
引用
收藏
页码:806 / 810
页数:5
相关论文
共 50 条
  • [1] Gender Classification Based on Feature Selection Using Genetic Algorithms
    Liu, Zhiming
    Bebis, George
    Veropoulos, Konstantinos
    [J]. PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 187 - 193
  • [2] Genetic algorithms for clustering, feature selection and classification
    Tseng, LY
    Yang, SB
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1612 - 1616
  • [3] Feature Selection Using Genetic Algorithms for Premature Ventricular Contraction Classification
    Kaya, Yasin
    Pehlivan, Huseyin
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2015, : 1229 - 1232
  • [4] Information-theoretic feature selection algorithms for text classification
    Novovicová, J
    Malík, A
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 3272 - 3277
  • [5] Classification of brain tumours Using Genetic Algorithms as a Feature Selection Method (GAFS)
    Gwalani, Harsha
    Mittal, Namita
    Vidyarthi, Ankit
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [6] Comparison of classification algorithms using feature selection
    Juarez-Lopez, Alexander
    Hernandez-Torruco, Jose
    Hernandez-Ocana, Betania
    Chavez-Bosquez, Oscar
    [J]. 2021 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2021), 2021,
  • [7] A Comprehensive Study of Eleven Feature Selection Algorithms and their Impact on Text Classification
    Vora, Suchi
    Yang, Hui
    [J]. 2017 COMPUTING CONFERENCE, 2017, : 440 - 449
  • [8] Feature Selection in Text Classification
    Sahin, Durmus Ozkan
    Ates, Nurullah
    Kilic, Erdal
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1777 - 1780
  • [9] Feature Selection by Using Heuristic Methods for Text Classification
    Sel, Ilhami
    Yeroglu, Celalettin
    Hanbay, Davut
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [10] Feature Selection for Text Classification Using Mutual Information
    Sel, Ilhami
    Karci, Ali
    Hanbay, Davut
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,