A fuzzy approach to clustering and selecting features for classification of gene expression data

被引:0
|
作者
Chitsaz, Elham [1 ]
Taheri, Mohammad [1 ]
Katebi, Seraj D.
机构
[1] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
bioinformatics; feature selection; fuzzy logic; clustering; mutual information;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classification assigns a discrete value named label to each sample in a dataset with respect to its feature values. In this research, we aim to consider some datasets which contain a few samples whereas a huge amount of features are provided for each sample. Most of biological datasets such as micro-arrays has this property. A fundamental contribution of this article is a major extension of pervious works for crisp data clustering. The new approach is based on fuzzy feature clustering which is utilized to select the best features (genes). The proposed method has two advantages over the crisp method. Firstly, it leads to more stability and faster convergence; secondly, it improves the accuracy of the classifier using the selected features. Moreover, in this paper a novel method has been proposed for the discretization of continuous data using the Fisher criterion. In addition, a new method for initialization of cluster centers is suggested. The proposed method has achieved a considerable improvement compared with the crisp version. The leukemia dataset has been used to illustrate the effectiveness of the method.
引用
收藏
页码:1650 / 1655
页数:6
相关论文
共 50 条
  • [31] A Novel Soft Clustering Approach for Gene Expression Data
    Kavitha, E.
    Tamilarasan, R.
    Baladhandapani, Arunadevi
    Kannan, M. K. Jayanthi
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (03): : 871 - 886
  • [32] An approach for clustering gene expression data with error information
    Tjaden, B
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [33] Selecting gene features for unsupervised analysis of single-cell gene expression data
    Sheng, Jie
    Li, Wei Vivian
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [34] Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data
    Paul, Animesh Kumar
    Shill, Pintu Chandra
    BIOSYSTEMS, 2018, 163 : 1 - 10
  • [35] A fuzzy clustering approach for transients classification
    Zio, E
    Baraldi, P
    APPLIED COMPUTATIONAL INTELLIGENCE, 2004, : 573 - 578
  • [36] A fuzzy approach for analyzing outliers in gene expression data
    Yousri, Noha A.
    Kamel, Mohamed S.
    Ismail, Mohamed A.
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 207 - +
  • [37] A fuzzy logic approach to analyzing gene expression data
    Woolf, PJ
    Wang, YX
    PHYSIOLOGICAL GENOMICS, 2000, 3 (01) : 9 - 15
  • [38] Interval-valued fuzzy set approach to fuzzy co-clustering for data classification
    Van Nha Pham
    Long Thanh Ngo
    Pedrycz, Witold
    KNOWLEDGE-BASED SYSTEMS, 2016, 107 : 1 - 13
  • [39] Impact of missing data imputation methods on gene expression clustering and classification
    de Souto, Marcilio C. P.
    Jaskowiak, Pablo A.
    Costa, Ivan G.
    BMC BIOINFORMATICS, 2015, 16
  • [40] Impact of missing data imputation methods on gene expression clustering and classification
    Marcilio CP de Souto
    Pablo A Jaskowiak
    Ivan G Costa
    BMC Bioinformatics, 16