Feature selection for clustering problems:: a hybrid algorithm that iterates between k-means and a Bayesian filter

被引:0
|
作者
Hruschka, ER [1 ]
Hruschka, ER [1 ]
Covoes, TF [1 ]
Ebecken, NFF [1 ]
机构
[1] Univ Catolica Santos, Santos, Brazil
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are two fundamentally different approaches for feature selection: wrapper and filter. It is also possible to combine them, obtaining hybrid approaches. This paper describes a hybrid method for selecting relevant features in clustering problems. The proposed approach is based on the combination of the widely known k-means algorithm and a Bayesian filter, which is based on the Markov Blanket concept. Since the number of clusters and the subset of relevant features are usually inter-related, we propose a method that iterates between clustering (assuming that the number of clusters is not known a priori) and filtering. Experiments in a number of datasets show that the proposed approach allows selecting features that provide good partitions.
引用
收藏
页码:405 / 410
页数:6
相关论文
共 50 条
  • [21] A Novel Stability Based Feature Selection Framework for k-means Clustering
    Mavroeidis, Dimitrios
    Marchiori, Elena
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2011, 6912 : 421 - 436
  • [22] K-Means and Fuzzy based Hybrid Clustering Algorithm for WSN
    Angadi, Basavaraj M.
    Kakkasageri, Mahabaleshwar S.
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2023, 69 (04) : 793 - 801
  • [23] Subspace clustering of text documents with feature weighting K-means algorithm
    Jing, LP
    Ng, MK
    Xu, J
    Huang, JZ
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 802 - 812
  • [24] A K-means Text Clustering Algorithm Based on Subject Feature Vector
    Duo, Ji
    Zhang, Peng
    Hao, Liu
    [J]. JOURNAL OF WEB ENGINEERING, 2021, 20 (06): : 1935 - 1946
  • [25] Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm
    Bae, Jin Hee
    Kim, Minwoo
    Lim, J. S.
    Geem, Zong Woo
    [J]. MATHEMATICS, 2021, 9 (05)
  • [26] Bayesian Feature Selection for Clustering Problems
    Hruschka, Eduardo
    Hruschka, Estevam, Jr.
    Covoes, Thiago
    Ebecken, Nelson
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2006, 5 (04) : 315 - 327
  • [27] Modified k-Means Clustering Algorithm
    Patel, Vaishali R.
    Mehta, Rupa G.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 307 - +
  • [28] An Improved K-means Clustering Algorithm
    Wang Yintong
    Li Wanlong
    Gao Rujia
    [J]. 2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [29] Granular K-means Clustering Algorithm
    Zhou, Chenglong
    Chen, Yuming
    Zhu, Yidong
    [J]. Computer Engineering and Applications, 2023, 59 (13) : 317 - 324
  • [30] Unsupervised K-Means Clustering Algorithm
    Sinaga, Kristina P.
    Yang, Miin-Shen
    [J]. IEEE ACCESS, 2020, 8 : 80716 - 80727