Feature selection for clustering problems:: a hybrid algorithm that iterates between k-means and a Bayesian filter

被引:0
|
作者
Hruschka, ER [1 ]
Hruschka, ER [1 ]
Covoes, TF [1 ]
Ebecken, NFF [1 ]
机构
[1] Univ Catolica Santos, Santos, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are two fundamentally different approaches for feature selection: wrapper and filter. It is also possible to combine them, obtaining hybrid approaches. This paper describes a hybrid method for selecting relevant features in clustering problems. The proposed approach is based on the combination of the widely known k-means algorithm and a Bayesian filter, which is based on the Markov Blanket concept. Since the number of clusters and the subset of relevant features are usually inter-related, we propose a method that iterates between clustering (assuming that the number of clusters is not known a priori) and filtering. Experiments in a number of datasets show that the proposed approach allows selecting features that provide good partitions.
引用
收藏
页码:405 / 410
页数:6
相关论文
共 50 条
  • [1] Feature Selection Algorithm Based on K-means Clustering
    Tang, Xue
    Dong, Min
    Bi, Sheng
    Pei, Maofeng
    Cao, Dan
    Xie, Cheche
    Chi, Sunhuang
    [J]. 2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 1522 - 1527
  • [2] Unsupervised Bayesian feature selection based on k-means clustering
    Yan, Liu
    Yan, Peng
    [J]. IC-BNMT 2007: PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2007, : 352 - 356
  • [3] Feature selection for k-means clustering stability: theoretical analysis and an algorithm
    Dimitrios Mavroeidis
    Elena Marchiori
    [J]. Data Mining and Knowledge Discovery, 2014, 28 : 918 - 960
  • [4] Feature selection for k-means clustering stability: theoretical analysis and an algorithm
    Mavroeidis, Dimitrios
    Marchiori, Elena
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) : 918 - 960
  • [5] Deterministic Feature Selection for k-Means Clustering
    Boutsidis, Christos
    Magdon-Ismail, Malik
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (09) : 6099 - 6110
  • [6] K-means Clustering with Feature Selection for Stream Data
    Wang, Xiao-dong
    Chen, Rung-Ching
    Yan, Fei
    Hendry
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2018), 2018, : 453 - 456
  • [7] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [8] Improved K-means clustering algorithm based on feature selection and removal on target point
    Yang, Hua-Hui
    Meng, Chen
    Wang, Cheng
    Yao, Yun-Zhi
    [J]. Kongzhi yu Juece/Control and Decision, 2019, 34 (06): : 1219 - 1226
  • [9] A Cheap Feature Selection Approach for the K-Means Algorithm
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2195 - 2208
  • [10] Selection of K in K-means clustering
    Pham, DT
    Dimov, SS
    Nguyen, CD
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2005, 219 (01) : 103 - 119