A Local SVD Framework for Stable Feature Selection for Clustering

被引:0
|
作者
Alelyani, Salem [1 ]
Liu, Huan [2 ]
机构
[1] King Khalid Univ, Coll Comp Sci, Abha, Saudi Arabia
[2] Arizona State Univ, Ira A Fulton Sch Engn, Tempe, AZ USA
关键词
Feature Selection; Singular Value Decomposition; Clustering; Stability;
D O I
10.1109/IRI.2015.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for clustering is a challenging problem due to the absence of class labels. Existing approaches can select a feature subset to maintain clustering performance while reducing dimensionality. However, we are faced with two problems: (1) there could be many sets of features that seem equally good, and (2) these features are sensitive to small data perturbation, or the selection instability problem. In this work, we investigate the stability problem in feature selection for clustering. To the best of our knowledge, this is the first work that aims to improve the stability of feature selection algorithms for clustering. The importance comes from the fact that stable selection provides consistent meaning for clusters. In this paper, we first formally define the problem and propose a Local Singular Value Decomposition (LSVD) framework for stable and accurate feature selection. Empirical results on various data sets show that the proposed framework can significantly improve selection stability whilst maintaining the clustering performance comparing to the baseline methods. An additional advantage of this approach is that the selected features preserve the physical meaning of the original features, a desirable property for subsequent data analysis.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [41] Feature selection for clustering - A filter solution
    Dash, M
    Choi, K
    Scheuermann, P
    Liu, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 115 - 122
  • [42] SVD based feature selection and sample classification of proteomic data
    D'Addabbo, Annarita
    Papale, Massimo
    Di Paolo, Salvatore
    Magaldi, Simona
    Colella, Roberto
    d'Onofrio, Valentina
    Di Palma, Annamaria
    Ranieri, Elena
    Gesualdo, Loreto
    Ancona, Nicola
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2008, 5179 : 556 - +
  • [43] Feature selection via fuzzy clustering
    Sun, Hao-Jun
    Sun, Mei
    Mei, Zhen
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1400 - +
  • [44] Feature Selection and Semisupervised Fuzzy Clustering
    Kong, Yi-qing
    Wang, Shi-tong
    [J]. FUZZY INFORMATION AND ENGINEERING, 2009, 1 (02) : 179 - 190
  • [45] Greedy Feature Selection for Subspace Clustering
    Dyer, Eva L.
    Sankaranarayanan, Aswin C.
    Baraniuk, Richard G.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2487 - 2517
  • [46] Clustering-based feature selection
    School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
    [J]. Tien Tzu Hsueh Pao, 2008, SUPPL. (157-160):
  • [47] A filter feature selection method for clustering
    Jouve, PE
    Nicoloyannis, N
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 583 - 593
  • [48] A survey on feature selection approaches for clustering
    Emrah Hancer
    Bing Xue
    Mengjie Zhang
    [J]. Artificial Intelligence Review, 2020, 53 : 4519 - 4545
  • [49] FEATURE-SELECTION BY INTERACTIVE CLUSTERING
    WISMATH, SK
    SOONG, HP
    AKL, SG
    [J]. PATTERN RECOGNITION, 1981, 14 (1-6) : 75 - 80
  • [50] Unsupervised feature selection for balanced clustering
    Zhou, Peng
    Chen, Jiangyong
    Fan, Mingyu
    Du, Liang
    Shen, Yi-Dong
    Li, Xuejun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 193