Semi-supervised clustering for gene-expression data in multiobjective optimization framework

被引:0
|
作者
Abhay Kumar Alok
Sriparna Saha
Asif Ekbal
机构
[1] Indian Institute of Technology,Computer Science Engineering
关键词
Gene expression data clustering; Semi-supervised classification; Multiobjective optimization; Cluster validity index;
D O I
暂无
中图分类号
学科分类号
摘要
Studying the patterns hidden in gene expression data helps to understand the functionality of genes. But due to the large volume of genes and the complexity of biological networks it is difficult to study the resulting mass of data which often consists of millions of measurements. In order to reveal natural structures and to identify interesting patterns from the given gene expression data set, clustering techniques are applied. Semi-supervised classification is a new direction of machine learning. It requires huge unlabeled data and a few labeled data. Semi-supervised classification in general performs better than unsupervised classification. But to the best of our knowledge there are no works for solving gene expression data clustering problem using semi-supervised classification techniques. In the current paper we have made an attempt to solve the gene expression data clustering problem using a multiobjective optimization based semi-supervised classification technique with the aim to attain good quality partitions by using few labeled data. In order to generate the labeled data, initially Fuzzy C-means clustering technique is applied. In order to automatically determine the partitioning, multiple cluster centers corresponding to a cluster are encoded in the form of a string. In order to compute the quality of the obtained partitioning, values of five objective functions are computed. The effectiveness of this proposed semi-supervised clustering technique is demonstrated on five publicly available benchmark gene expression data sets. Comparison results with the existing techniques for gene expression data clustering prove that the proposed method is the most effective one. Statistical and biological significance tests have also been carried out.
引用
收藏
页码:421 / 439
页数:18
相关论文
共 50 条
  • [1] Semi-supervised clustering for gene-expression data in multiobjective optimization framework
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (02) : 421 - 439
  • [2] Gene-Expression Data Semi-Supervised Clustering in Multi-Objective Optimization Framework
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 1081 - 1086
  • [3] Simultaneous Feature Selection and Semi-supervised Clustering for Gene-Expression Data
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    Kanekar, Neha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [4] Semi-Supervised Clustering Using Multiobjective Optimization
    Saha, Sriparna
    Ekbal, Asif
    Alok, Abhay Kumar
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 360 - 365
  • [5] On semi-supervised clustering via multiobjective optimization
    Handl, Julia
    Knowles, Joshua
    [J]. GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1465 - +
  • [6] Semi-supervised consensus clustering for gene expression data analysis
    Wang, Yunli
    Pan, Youlian
    [J]. BIODATA MINING, 2014, 7
  • [7] Simultaneous Feature Selection and Unsupervised Clustering for Gene-Expression Data in Multiobjective Optimization Framework
    Alok, Abhay Kumar
    Kanekar, Neha
    Saha, Sriparna
    Ekbal, Asif
    [J]. 2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 691 - 696
  • [8] Semi-supervised consensus clustering for gene expression data analysis
    Yunli Wang
    Youlian Pan
    [J]. BioData Mining, 7
  • [9] Feature selection and semi-supervised clustering using multiobjective optimization
    Saha, Sriparna
    Ekbal, Asif
    Alok, Abhay Kumar
    Spandana, Rachamadugu
    [J]. SPRINGERPLUS, 2014, 3
  • [10] Feature Selection and Semi-supervised Clustering Using Multiobjective Optimization
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    [J]. 2014 INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE ISCMI 2014, 2014, : 126 - 129