A novel feature selection approach based on clustering algorithm

被引:6
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [41] A Novel Approach for Feature Selection
    Swapna, Ch. Swetha
    Kumar, V. Vijaya
    Murthy, J. V. R.
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 877 - 885
  • [42] Feature selection based on partition clustering
    Liu, Shuang
    Zhao, Qiang
    Wu, Xiang
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (02) : 135 - 142
  • [43] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [44] A novel feature selection algorithm based on LVQ hypothesis margin
    Hu, Yaomin
    Liu, Weiming
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 24 (06): : 1431 - 1439
  • [45] A human body physiological feature selection algorithm based on filtering and improved clustering
    Chen, Bo
    Yu, Jie
    Gao, Xiu-e
    Zheng, Qing-Guo
    [J]. PLOS ONE, 2018, 13 (10):
  • [46] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    [J]. APPLIED SOFT COMPUTING, 2020, 93
  • [47] An Approach to Analyzing Subjective Text Based on Feature Selection Algorithm
    Tian Weixin
    Zheng Sheng
    [J]. 2011 AASRI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRY APPLICATION (AASRI-AIIA 2011), VOL 2, 2011, : 266 - 270
  • [48] A new ensemble feature selection approach based on genetic algorithm
    Hongzhi Wang
    Chengquan He
    Zhuping Li
    [J]. Soft Computing, 2020, 24 : 15811 - 15820
  • [49] A novel community detection based genetic algorithm for feature selection
    Mehrdad Rostami
    Kamal Berahmand
    Saman Forouzandeh
    [J]. Journal of Big Data, 8
  • [50] A novel feature selection algorithm based on LVQ hypothesis margin
    Yaomin Hu
    Weiming Liu
    [J]. Neural Computing and Applications, 2014, 24 : 1431 - 1439