Gene-Expression Data Semi-Supervised Clustering in Multi-Objective Optimization Framework

被引:0
|
作者
Alok, Abhay Kumar [1 ]
Saha, Sriparna [1 ]
Ekbal, Asif [1 ]
机构
[1] Indian Inst Technol, Comp Sci Engn, Patna 800013, Bihar, India
关键词
Multiobjective optimization; Sym-index; FCM index; I-index; ARI-index; XB-index; Semi-supervised clustering; AMOSA; Fuzzy C-means; Silhouette-index; ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Studying the patterns hidden in gene expression data helps to understand the functionality of genes. But due to the large collection of genes and the complicated biological networks it is hard to study the generated large volume of data which often contains millions of measurements. In general clustering techniques are used to determine natural structures and capture exciting patterns from the given data as a first step of studying the gene expression data. In this paper the problem of gene expression data clustering is formulated as a semi-supervised classification problem. So here semi-supervised clustering is modelled as multi-objective optimization problems. Here five objective functions are used and simultaneously optimized by AMOSA. Among the five objective functions, first four objective functions quantify some unsupervised properties like total symmetry, compactness and separability present in the clusters and last one captures the supervised information. In order to generate the supervised information, Fuzzy C-means algorithm is invoked on the data sets. Based on the highest membership values of data points with respect to different clusters, labeled information are extracted. In each case only 10% class labeled information of data points are randomly selected which act as supervised information in case of semi-supervised clustering. The effectiveness of this proposed semi-supervised clustering technique is demonstrated on three publicly available benchmark gene expression data sets. Results are compared with existing techniques for gene expression data clustering.
引用
收藏
页码:1081 / 1086
页数:6
相关论文
共 50 条
  • [1] Semi-supervised clustering for gene-expression data in multiobjective optimization framework
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (02) : 421 - 439
  • [2] Semi-supervised clustering for gene-expression data in multiobjective optimization framework
    Abhay Kumar Alok
    Sriparna Saha
    Asif Ekbal
    [J]. International Journal of Machine Learning and Cybernetics, 2017, 8 : 421 - 439
  • [3] Multi-objective semi-supervised clustering algorithm based on constraint set optimization for gene expression data
    Zhao, Minghui
    Li, Dan
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6570 - 6575
  • [4] A new semi-supervised clustering technique using multi-objective optimization
    Abhay Kumar Alok
    Sriparna Saha
    Asif Ekbal
    [J]. Applied Intelligence, 2015, 43 : 633 - 661
  • [5] A new semi-supervised clustering technique using multi-objective optimization
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    [J]. APPLIED INTELLIGENCE, 2015, 43 (03) : 633 - 661
  • [6] Simultaneous Feature Selection and Semi-supervised Clustering for Gene-Expression Data
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    Kanekar, Neha
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [7] Multi-objective Semi-supervised clustering for finding predictive clusters
    Ghasemi, Zahra
    Khorshidi, Hadi Akbarzadeh
    Aickelin, Uwe
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 195
  • [8] MULTI-OBJECTIVE OPTIMIZATION FOR SEMI-SUPERVISED DISCRIMINATIVE LANGUAGE MODELING
    Kobayashi, Akio
    Oku, Takahiro
    Imai, Toru
    Nakagawa, Seiichi
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4997 - 5000
  • [9] Multi-objective semi-supervised clustering of tissue samples for cancer diagnosis
    Sriparna Saha
    Kuldeep Kaushik
    Abhay Kumar Alok
    Sudipta Acharya
    [J]. Soft Computing, 2016, 20 : 3381 - 3392
  • [10] Multi-objective semi-supervised clustering of tissue samples for cancer diagnosis
    Saha, Sriparna
    Kaushik, Kuldeep
    Alok, Abhay Kumar
    Acharya, Sudipta
    [J]. SOFT COMPUTING, 2016, 20 (09) : 3381 - 3392