A novel feature selection approach based on clustering algorithm

被引:6
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [1] FSSOM: One novel SOM clustering algorithm based on feature selection
    Liu, Ming
    Liu, Yuan-Chao
    Wang, Xiao-Long
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 429 - 435
  • [2] A Clustering Based Genetic Algorithm for Feature Selection
    Rostami, Mehrdad
    Moradi, Parham
    [J]. 2014 6TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2014, : 112 - 116
  • [3] A fuzzy clustering based algorithm for feature selection
    Sun, HJ
    Wang, SR
    Mei, Z
    [J]. 2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1993 - 1998
  • [4] A feature selection Bayesian approach for a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. DATA MINING IV, 2004, 7 : 181 - 192
  • [5] Interaction-based clustering algorithm for feature selection: a multivariate filter approach
    Ahmad Esfandiari
    Hamid Khaloozadeh
    Faezeh Farivar
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 1769 - 1782
  • [6] Interaction-based clustering algorithm for feature selection: a multivariate filter approach
    Esfandiari, Ahmad
    Khaloozadeh, Hamid
    Farivar, Faezeh
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1769 - 1782
  • [7] Balanced Spectral Clustering Algorithm Based on Feature Selection
    Luo, Qimin
    Lu, Guangquan
    Wen, Guoqiu
    Su, Zidong
    Liu, Xingyi
    Wei, Jian
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 356 - 367
  • [8] A Novel Intuitionistic Fuzzy Clustering Algorithm Based on Feature Selection for Multiple Object Tracking
    Li, Liang-qun
    Wang, Xiao-li
    Liu, Zong-xiang
    Xie, Wei-xin
    [J]. INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (05) : 1613 - 1628
  • [9] A Novel Intuitionistic Fuzzy Clustering Algorithm Based on Feature Selection for Multiple Object Tracking
    Liang-qun Li
    Xiao-li Wang
    Zong-xiang Liu
    Wei-xin Xie
    [J]. International Journal of Fuzzy Systems, 2019, 21 : 1613 - 1628
  • [10] Improved clustering approach based on fuzzy feature selection
    Wu, Naijun
    Li, Xiuyun
    Yang, Jie
    Liu, Peng
    [J]. 2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 479 - +