A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO

被引:30
|
作者
Mandal, Monalisa [1 ]
Mukhopadhyay, Anirban [1 ]
机构
[1] Univ Kalyani, Dept Comp Sci & Engn, Kalyani 741235, W Bengal, India
来源
PLOS ONE | 2014年 / 9卷 / 03期
关键词
PARTICLE SWARM OPTIMIZATION; FEATURE-SELECTION; CANCER; CLASSIFICATION;
D O I
10.1371/journal.pone.0090949
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the edges represent their dissimilarity. Both nodes and edges are given weight according to the feature's relevance and dissimilarity among the features, respectively. The problem of finding relevant and non-redundant features is then mapped into densest subgraph finding problem. We have proposed a multiobjective particle swarm optimization (PSO)-based algorithm that optimizes average node-weight and average edge-weight of the candidate subgraph simultaneously. The proposed algorithm is applied for identifying relevant and non-redundant disease-related genes from microarray gene expression data. The performance of the proposed method is compared with that of several other existing feature selection techniques on different real-life microarray gene expression datasets.
引用
收藏
页数:13
相关论文
共 7 条
  • [1] A novel PSO-based graph-theoretic approach for identifying most relevant and non-redundant gene markers from gene expression data
    Mandal, Monalisa
    Mukhopadhyay, Anirban
    [J]. INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2015, 30 (03) : 175 - 192
  • [2] Identifying Most Relevant Non-redundant Gene Markers from Gene Expression Data Using PSO-based Graph -Theoretic Approach
    Mandal, Monalisa
    Mukhopadhyay, Anirban
    [J]. 2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 374 - 379
  • [3] Identifying Non-Redundant Gene Markers from Microarray Data: A Multiobjective Variable Length PSO-Based Approach
    Mukhopadhyay, Anirban
    Mandal, Monalisa
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (06) : 1170 - 1183
  • [4] A graph-theoretic technique for classification of normal and tumor tissues using gene expression microarray data
    Kim, Saejoon
    [J]. 2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4621 - 4624
  • [5] Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees
    Xu, Y
    Olman, V
    Xu, D
    [J]. BIOINFORMATICS, 2002, 18 (04) : 536 - 545
  • [6] Fuzzy Rule-based Classifier for Microarray Gene Expression Data by using a Multiobjective PSO-based Approach
    Mandal, Monalisa
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [7] MapReduce Algorithms for Inferring Gene Regulatory Networks from Time-Series Microarray Data Using an Information-Theoretic Approach
    Abduallah, Yasser
    Turki, Turki
    Byron, Kevin
    Du, Zongxuan
    Cervantes-Cervantes, Miguel
    Wang, Jason T. L.
    [J]. BIOMED RESEARCH INTERNATIONAL, 2017, 2017