A fuzzy approach to clustering and selecting features for classification of gene expression data

被引:0
|
作者
Chitsaz, Elham [1 ]
Taheri, Mohammad [1 ]
Katebi, Seraj D.
机构
[1] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
bioinformatics; feature selection; fuzzy logic; clustering; mutual information;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classification assigns a discrete value named label to each sample in a dataset with respect to its feature values. In this research, we aim to consider some datasets which contain a few samples whereas a huge amount of features are provided for each sample. Most of biological datasets such as micro-arrays has this property. A fundamental contribution of this article is a major extension of pervious works for crisp data clustering. The new approach is based on fuzzy feature clustering which is utilized to select the best features (genes). The proposed method has two advantages over the crisp method. Firstly, it leads to more stability and faster convergence; secondly, it improves the accuracy of the classifier using the selected features. Moreover, in this paper a novel method has been proposed for the discretization of continuous data using the Fisher criterion. In addition, a new method for initialization of cluster centers is suggested. The proposed method has achieved a considerable improvement compared with the crisp version. The leukemia dataset has been used to illustrate the effectiveness of the method.
引用
收藏
页码:1650 / 1655
页数:6
相关论文
共 50 条
  • [11] Fuzzy clustering-based discretization for gene expression classification
    Kianmehr, Keivan
    Alshalalfa, Mohammed
    Alhajj, Reda
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 441 - 465
  • [12] Fuzzy Rule Based Clustering for Gene Expression Data
    Sinaee, Mehrnoosh
    Mansoori, Eghbal G.
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 7 - 11
  • [13] An ensemble approach for phenotype classification based on fuzzy partitioning of gene expression data
    Dragomir, A.
    Maraziotis, I.
    Bezerianos, A.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 1930 - +
  • [14] Selecting a classification function for class prediction with gene expression data
    Jong, Victor L.
    Novianti, Putri W.
    Roes, Kit C. B.
    Eijkemans, Marinus J. C.
    BIOINFORMATICS, 2016, 32 (12) : 1814 - 1822
  • [15] Fuzzy-Based Approach for Clustering Data with Multivalued Features
    Prakash, L. N. C. K.
    Vimaladevi, M.
    Chakravarthy, V. Deeban
    Narayana, G. Surya
    Srinivasulu, Asadi
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [16] A hybrid approach for selecting gene subsets using gene expression data
    Yang, Cheng-San
    Chuang, Li-Yeh
    Ke, Chao-Hsuan
    Yang, Cheng-Hong
    2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 159 - +
  • [17] Classification of gene expression data using fuzzy logic
    Ohno-Machado, L
    Vinterbo, S
    Weber, G
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2002, 12 (01) : 19 - 24
  • [18] Classification of Microarray Gene Expression Data using Weighted Grey Wolf Optimizer based Fuzzy Clustering
    Achom, Amika
    Das, Ranjita
    Pakray, Partha
    Saha, Sriparna
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2705 - 2710
  • [19] Estimation distribution of algorithm for fuzzy clustering gene expression data
    Liu, Feng
    Liu, Juan
    Feng, Jing
    Zhou, Huaibei
    ADVANCES IN NATURAL COMPUTATION, PT 2, 2006, 4222 : 328 - 335
  • [20] Fuzzy Clustering Algorithm of Kernel for Gene Expression Data Analysis
    Liu, Wenyuan
    Zhang, Bin
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 553 - 556