Spectral Clustering Based Unsupervised Feature Selection Algorithms

被引:0
|
作者
Xie, Juan-Ying [1 ]
Ding, Li-Juan [1 ,3 ]
Wang, Ming-Zhao [2 ]
机构
[1] School of Computer Science, Shaanxi Normal University, Xi'an,710062, China
[2] College of life Sciences, Shaanxi Normal University, Xi'an,710062, China
[3] College of Information Engineering, Engineering University of PAP, Xi'an,710086, China
来源
Ruan Jian Xue Bao/Journal of Software | 2020年 / 31卷 / 04期
关键词
Feature Selection;
D O I
10.13328/j.cnki.jos.005927
中图分类号
学科分类号
摘要
Gene expression data usually comprise small number of samples with tens of thousands of genes. There are a large number of genes unrelated to diseases in this kind of data. The primary task is to detect those key essential genes when analyzing this kind of data. The common feature selection algorithms depend on labels of data, but it is very difficult to get labels for data. To overcome the challenges, especially for gene expression data, the unsupervised feature selection idea is proposed, named as FSSC (feature selection by spectral clustering). FSSC groups all of features into clusters by a spectral clustering algorithm, so that similar features are in same clusters. The feature discernibility and independence are defined, and the feature importance is defined as the product of its discernibility and independence. The representative feature is selected from each cluster to construct the feature subset. According to the spectral clustering algorithms used in FSSC, three kinds of unsupervised feature selection algorithms named as FSSC-SD (FSSC based on standard deviation), FSSC-MD (FSSC based on mean distance) and FSSC-ST (FSSC based on self-tuning) are developed. The SVM (support vector machines) and KNN (K-nearest neighbors) classifiers are adopted to test the performance of the selected feature subsets in experiments. Experimental results on 10 gene expression datasets show that FSSC-SD, FSSC-MD, and FSSC-ST algorithms can select powerful features to classify samples. © Copyright 2020, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:1009 / 1024
相关论文
共 50 条
  • [1] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Shamsinejadbabki, Pirooz
    Saraee, Mohammad
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 669 - 684
  • [2] A new unsupervised feature selection method for text clustering based on genetic algorithms
    Pirooz Shamsinejadbabki
    Mohammad Saraee
    [J]. Journal of Intelligent Information Systems, 2012, 38 : 669 - 684
  • [3] Unsupervised feature selection via discrete spectral clustering and feature weights
    Shang, Ronghua
    Kong, Jiarui
    Wang, Lujuan
    Zhang, Weitong
    Wang, Chao
    Li, Yangyang
    Jiao, Licheng
    [J]. NEUROCOMPUTING, 2023, 517 : 106 - 117
  • [4] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [5] Unsupervised spectral feature selection algorithms for high dimensional data
    Mingzhao Wang
    Henry Han
    Zhao Huang
    Juanying Xie
    [J]. Frontiers of Computer Science, 2023, 17
  • [6] Unsupervised spectral feature selection algorithms for high dimensional data
    Wang, Mingzhao
    Han, Henry
    Huang, Zhao
    Xie, Juanying
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (05)
  • [7] Unsupervised spectral feature selection algorithms for high dimensional data
    Mingzhao WANG
    Henry HAN
    Zhao HUANG
    Juanying XIE
    [J]. Frontiers of Computer Science, 2023, 17 (05) - 44
  • [8] Unsupervised Feature Selection Based on Spectral Clustering with Maximum Relevancy and Minimum Redundancy Approach
    Khozaei, Bahareh
    Eftekhari, Mahdi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (11)
  • [9] Feature selection in unsupervised context: Clustering based approach
    Klepaczko, A
    Materka, A
    [J]. Computer Recognition Systems, Proceedings, 2005, : 219 - 226
  • [10] Unsupervised feature selection for balanced clustering
    Zhou, Peng
    Chen, Jiangyong
    Fan, Mingyu
    Du, Liang
    Shen, Yi-Dong
    Li, Xuejun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 193