Pipelining the ranking techniques for microarray data classification: A case study

被引:16
|
作者
Dash, Rasmita [1 ]
Misra, Bijan Bihari [2 ]
机构
[1] Siksha O Anusandhan Univ, Inst Tech Educ & Res, Dept Comp Sc & Informat Technol, Bhubaneswar 751030, Odisha, India
[2] Silicon Inst Technol, Dept Comp Sc & Engn, Bhubaneswar 751024, Odisha, India
关键词
Microarray data; Feature selection; Feature ranking technique; Classification; Statistical test; DIFFERENTIALLY EXPRESSED GENES; FEATURE-SELECTION; PREDICTION; CANCER; ROBUST; OPTIMIZATION; REGRESSION; PROFILES;
D O I
10.1016/j.asoc.2016.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of relevant genes from microarray data is an apparent need in many applications. For such identification different ranking techniques with different evaluation criterion are used, which usually assign different ranks to the same gene. As a result, different techniques identify different gene subsets, which may not be the set of significant genes. To overcome such problems, in this study pipelining the ranking techniques is suggested. In each stage of pipeline, few of the lower ranked features are eliminated and at the end a relatively good subset of feature is preserved. However, the order in which the ranking techniques are used in the pipeline is important to ensure that the significant genes are preserved in the final subset. For this experimental study, twenty four unique pipeline models are generated out of four gene ranking strategies. These pipelines are tested with seven different microarray databases to find the suitable pipeline for such task. Further the gene subset obtained is tested with four classifiers and four performance metrics are evaluated. No single pipeline dominates other pipelines in performance; therefore a grading system is applied to the results of these pipelines to find out a consistent model. The finding of grading system that a pipeline model is significant is also established by Nemenyi post-hoc hypothetical test. Performance of this pipeline model is compared with four ranking techniques, though its performance is not superior always but majority of time it yields better results and can be suggested as a consistent model. However it requires more computational time in comparison to single ranking techniques. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:298 / 316
页数:19
相关论文
共 50 条
  • [31] MicroCBR: A case-based reasoning architecture for the classification of microarray data
    De Paz, Juan F.
    Bajo, Javier
    Vera, Vicente
    Corchado, Juan M.
    APPLIED SOFT COMPUTING, 2011, 11 (08) : 4496 - 4507
  • [32] Analysis of Microarray Gene Expression Data Using Various Feature Selection and Classification Techniques
    Singh, W. Jai
    Kavitha, R. K.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 105 - 108
  • [33] Fusion of Dimensionality Reduction Methods: a Case Study in Microarray Classification
    Deegalla, Sampath
    Bostrom, Henrik
    FUSION: 2009 12TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2009, : 460 - +
  • [34] A benchmarking study of classification techniques for behavioral data
    Sofie De Cnudde
    David Martens
    Theodoros Evgeniou
    Foster Provost
    International Journal of Data Science and Analytics, 2020, 9 : 131 - 173
  • [35] Comparative Study of Classification Techniques for Weather Data
    Panjwani, Shweta
    Kumar, S. Naresh
    Ahuja, Laxmi
    ADVANCES IN COMPUTING AND DATA SCIENCES, ICACDS 2016, 2017, 721 : 572 - 576
  • [36] A benchmarking study of classification techniques for behavioral data
    De Cnudde, Sofie
    Martens, David
    Evgeniou, Theodoros
    Provost, Foster
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (02) : 131 - 173
  • [37] Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data
    Abedini, Mani
    Kirley, Michael
    Chiong, Raymond
    AUSTRALASIAN MEDICAL JOURNAL, 2013, 6 (05): : 272 - 279
  • [38] Multidimensional Visualization Techniques for Microarray Data
    Cvek, Urska
    Trutschl, Marjan
    Kilgore, Phillip C.
    Stone, Randolph, II
    Clifford, John L.
    15TH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION (IV 2011), 2011, : 241 - 246
  • [39] Data mining techniques for microarray datasets
    Liu, L
    Yang, J
    Tung, AKH
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 1149 - 1149
  • [40] Stable classification with applications to microarray data
    Li, CS
    Cheng, C
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (03) : 599 - 609