A fuzzy approach to clustering and selecting features for classification of gene expression data

被引:0
|
作者
Chitsaz, Elham [1 ]
Taheri, Mohammad [1 ]
Katebi, Seraj D.
机构
[1] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
bioinformatics; feature selection; fuzzy logic; clustering; mutual information;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classification assigns a discrete value named label to each sample in a dataset with respect to its feature values. In this research, we aim to consider some datasets which contain a few samples whereas a huge amount of features are provided for each sample. Most of biological datasets such as micro-arrays has this property. A fundamental contribution of this article is a major extension of pervious works for crisp data clustering. The new approach is based on fuzzy feature clustering which is utilized to select the best features (genes). The proposed method has two advantages over the crisp method. Firstly, it leads to more stability and faster convergence; secondly, it improves the accuracy of the classifier using the selected features. Moreover, in this paper a novel method has been proposed for the discretization of continuous data using the Fisher criterion. In addition, a new method for initialization of cluster centers is suggested. The proposed method has achieved a considerable improvement compared with the crisp version. The leukemia dataset has been used to illustrate the effectiveness of the method.
引用
收藏
页码:1650 / 1655
页数:6
相关论文
共 50 条
  • [21] Fuzzy clustering algorithm of kernel for gene expression data analysis
    Chen, Zhiru
    Hong, Wenxue
    Wang, Changwu
    ICIC Express Letters, 2009, 3 (04): : 1435 - 1440
  • [22] Selecting informative genes from leukemia gene expression data using a hybrid approach for cancer classification
    Mohamad, Mohd Saberi
    Deris, Safaai
    Hashim, Siti Zaiton Mohd
    3RD KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2006, 2007, 15 : 528 - +
  • [23] HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data
    Wang, YH
    Makedon, FS
    Ford, JC
    Pearlman, J
    BIOINFORMATICS, 2005, 21 (08) : 1530 - 1537
  • [24] Multitask fuzzy Bregman co-clustering approach for clustering data with multisource features
    Sokhandan, Alireza
    Adibi, Peyman
    Sajadi, Mohammadreza
    NEUROCOMPUTING, 2017, 247 : 102 - 114
  • [25] Rotation Clustering: A Consensus Clustering Approach to Cluster Gene Expression Data
    Galdi, Paola
    Serra, Angela
    Tagliaferri, Roberto
    FUZZY LOGIC AND SOFT COMPUTING APPLICATIONS, WILF 2016, 2017, 10147 : 229 - 238
  • [26] Attribute clustering for grouping, selection, and classification of gene expression data
    Au, WH
    Chan, KCC
    Wong, AKC
    Wang, Y
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) : 83 - 101
  • [27] Ensemble classification for gene expression data based on parallel clustering
    Meng, Jun
    Jiang, Dingling
    Zhang, Jing
    Luan, Yushi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (03) : 213 - 229
  • [28] Fuzzy clustering in classification using weighted features
    Bandeira, LPC
    Sousa, JMC
    Kaymak, U
    FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 560 - 567
  • [29] A pattern matching approach for clustering gene expression data
    Das, Rosy
    Kalita, Jugal
    Bhattacharyya, Dhruba K.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2011, 3 (02) : 130 - 149
  • [30] An approach for clustering gene expression data with error information
    Brian Tjaden
    BMC Bioinformatics, 7