A Local SVD Framework for Stable Feature Selection for Clustering

被引:0
|
作者
Alelyani, Salem [1 ]
Liu, Huan [2 ]
机构
[1] King Khalid Univ, Coll Comp Sci, Abha, Saudi Arabia
[2] Arizona State Univ, Ira A Fulton Sch Engn, Tempe, AZ USA
关键词
Feature Selection; Singular Value Decomposition; Clustering; Stability;
D O I
10.1109/IRI.2015.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for clustering is a challenging problem due to the absence of class labels. Existing approaches can select a feature subset to maintain clustering performance while reducing dimensionality. However, we are faced with two problems: (1) there could be many sets of features that seem equally good, and (2) these features are sensitive to small data perturbation, or the selection instability problem. In this work, we investigate the stability problem in feature selection for clustering. To the best of our knowledge, this is the first work that aims to improve the stability of feature selection algorithms for clustering. The importance comes from the fact that stable selection provides consistent meaning for clusters. In this paper, we first formally define the problem and propose a Local Singular Value Decomposition (LSVD) framework for stable and accurate feature selection. Empirical results on various data sets show that the proposed framework can significantly improve selection stability whilst maintaining the clustering performance comparing to the baseline methods. An additional advantage of this approach is that the selected features preserve the physical meaning of the original features, a desirable property for subsequent data analysis.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [1] A Framework for Feature Selection in Clustering
    Witten, Daniela M.
    Tibshirani, Robert
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) : 713 - 726
  • [2] Local Feature Selection in Text Clustering
    Ribeiro, Marcelo N.
    Neto, Manoel J. R.
    Prudencio, Ricardo B. C.
    [J]. ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 45 - +
  • [3] A Local Feature Selection Approach for Clustering
    Gui, Bing
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 55 - 62
  • [4] Feature Selection for Local Learning Based Clustering
    Zeng, Hong
    Cheung, Yiu-ming
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 414 - 425
  • [5] A Feature Selection Framework Based on Supervised Data Clustering
    Liu, Hongzhi
    Fu, Bin
    Jiang, Zhengshen
    Wu, Zhonghai
    Hsu, D. Frank
    [J]. 2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 316 - 321
  • [6] A CNN BASED FRAMEWORK FOR STABLE IMAGE FEATURE SELECTION
    Han, Chaoyi
    Tao, Xiaoming
    Duan, Yiping
    Liu, Xijia
    Lu, Jianhua
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1402 - 1406
  • [7] Group sparse feature selection on local learning based clustering
    Wu, Yue
    Wang, Can
    Bu, Jiajun
    Chen, Chun
    [J]. NEUROCOMPUTING, 2016, 171 : 1118 - 1130
  • [8] Flexible Subspace Clustering: A Joint Feature Selection and K-Means Clustering Framework
    Long, Zhong-Zhen
    Xu, Guoxia
    Du, Jiao
    Zhu, Hu
    Yan, Taiyu
    Yu, Yu-Feng
    [J]. BIG DATA RESEARCH, 2021, 23
  • [9] Feature selection for clustering
    Dash, M
    Liu, H
    [J]. KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 110 - 121
  • [10] HMOSHSSA: a novel framework for solving simultaneous clustering and feature selection problems
    Kumar, Vijay
    Kumari, Rajani
    Kumar, Sandeep
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (35) : 82149 - 82175