Distance based feature selection for clustering microarray data

被引:0
|
作者
Dash, Manoranjan [1 ]
Gopalkrishnan, Vivekanand [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
关键词
feature selection; clustering; distance function; microarray data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In microarray data, clustering is the fundamental task for separating genes into biologically functional groups or for classifying tissues and phenotypes. Recently, with innovative gene expression microarray data technologies, thousands of expression levels of genes (features) can be measured simultaneously in a single experiment. The large number of genes with a lot of noise causes high complexity for cluster analysis. This challenge has raised the demand for feature selection - an effective dimensionality reduction technique that removes noisy features. In this paper we propose a novel filter method for feature selection. The suggested method, called ClosestFS, is based on a distance measure. For each feature, the distance is evaluated by computing its impact on the histogram for the whole data. Our experimental results show that the quality of clustering results (evaluated by several widely used measures) of K-means algorithm using ClosestFS as the pre-processing step is significantly better than that of the pure K-means.
引用
收藏
页码:512 / 519
页数:8
相关论文
共 50 条
  • [1] Spectral Clustering and Feature Selection for Microarray Data
    Garcia-Garcia, Dario
    Santos-Rodriguez, Raul
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 425 - 428
  • [2] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Swarnkar, Tripti
    Mitra, Pabitra
    [J]. JOURNAL OF BIOSCIENCES, 2015, 40 (04) : 755 - 767
  • [3] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Tripti Swarnkar
    Pabitra Mitra
    [J]. Journal of Biosciences, 2015, 40 : 755 - 767
  • [4] A Clustering Based Feature Selection Method Using Feature Information Distance for Text Data
    Chao, Shilong
    Cai, Jie
    Yang, Sheng
    Wang, Shulin
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I, 2016, 9771 : 122 - 132
  • [5] Clustering-based hybrid feature selection approach for high dimensional microarray data
    Babu, Samson Anosh P.
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [6] Robust microarray data feature selection using a correntropy based distance metric learning approach
    Vahabzadeh, Venus
    Moattar, Mohammad Hossein
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 161
  • [7] Graph Based Unsupervised Feature Selection for Microarray Data
    Swarnkar, Tripti
    Mitra, Pabitra
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [8] A Clustering Approach for Feature Selection in Microarray Data Classification Using Random forest
    Aydadenta, Husna
    Adiwijaya
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05): : 1167 - 1175
  • [9] Feature Genes Selection of Adult ALL Microarray Data with Affinity Propagation Clustering
    Chuang, Chen-Chia
    Li, Yan-Cheng
    Jeng, Jin-Tsong
    Chang, Chih-Kai
    Wang, Zhi-Qian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 230 - 231
  • [10] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469