Pipelining the ranking techniques for microarray data classification: A case study

被引:16
|
作者
Dash, Rasmita [1 ]
Misra, Bijan Bihari [2 ]
机构
[1] Siksha O Anusandhan Univ, Inst Tech Educ & Res, Dept Comp Sc & Informat Technol, Bhubaneswar 751030, Odisha, India
[2] Silicon Inst Technol, Dept Comp Sc & Engn, Bhubaneswar 751024, Odisha, India
关键词
Microarray data; Feature selection; Feature ranking technique; Classification; Statistical test; DIFFERENTIALLY EXPRESSED GENES; FEATURE-SELECTION; PREDICTION; CANCER; ROBUST; OPTIMIZATION; REGRESSION; PROFILES;
D O I
10.1016/j.asoc.2016.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of relevant genes from microarray data is an apparent need in many applications. For such identification different ranking techniques with different evaluation criterion are used, which usually assign different ranks to the same gene. As a result, different techniques identify different gene subsets, which may not be the set of significant genes. To overcome such problems, in this study pipelining the ranking techniques is suggested. In each stage of pipeline, few of the lower ranked features are eliminated and at the end a relatively good subset of feature is preserved. However, the order in which the ranking techniques are used in the pipeline is important to ensure that the significant genes are preserved in the final subset. For this experimental study, twenty four unique pipeline models are generated out of four gene ranking strategies. These pipelines are tested with seven different microarray databases to find the suitable pipeline for such task. Further the gene subset obtained is tested with four classifiers and four performance metrics are evaluated. No single pipeline dominates other pipelines in performance; therefore a grading system is applied to the results of these pipelines to find out a consistent model. The finding of grading system that a pipeline model is significant is also established by Nemenyi post-hoc hypothetical test. Performance of this pipeline model is compared with four ranking techniques, though its performance is not superior always but majority of time it yields better results and can be suggested as a consistent model. However it requires more computational time in comparison to single ranking techniques. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:298 / 316
页数:19
相关论文
共 50 条
  • [1] Tumor classification ranking from microarray data
    Hewett, Rattikorn
    Kijsanayothin, Phongphun
    BMC GENOMICS, 2008, 9 (Suppl 2)
  • [2] A two stage grading approach for feature selection and classification of microarray data using Pareto based feature ranking techniques: A case study
    Dash, Rasmita
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) : 232 - 247
  • [3] Tumor classification ranking from microarray data
    Rattikorn Hewett
    Phongphun Kijsanayothin
    BMC Genomics, 9
  • [4] On the classification techniques in data mining for microarray data classification
    Aydadenta, Husna
    Adiwijaya
    INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE (ICODIS), 2018, 971
  • [5] A hybrid approach to feature ranking for microarray data classification
    Popovic, Dusan
    Sifrim, Alejandro
    Moschopoulos, Charalampos
    Moreau, Yves
    De Moor, Bart
    Communications in Computer and Information Science, 2013, 384 : 241 - 248
  • [6] A Hybrid Approach to Feature Ranking for Microarray Data Classification
    Popovic, Dusan
    Sifrim, Alejandro
    Moschopoulos, Charalampos
    Moreau, Yves
    De Moor, Bart
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT II, 2013, 384 : 241 - 248
  • [7] Hybrid Classification Techniques for Microarray Data
    B. Jaison
    A. Chilambuchelvan
    K. A. Mohamed Junaid
    National Academy Science Letters, 2015, 38 : 415 - 419
  • [8] Hybrid Classification Techniques for Microarray Data
    Jaison, B.
    Chilambuchelvan, A.
    Junaid, K. A. Mohamed
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2015, 38 (05): : 415 - 419
  • [9] Performance analysis of clustering techniques over microarray data: A case study
    Dash, Rasmita
    Misra, Bijan Bihari
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 493 : 162 - 176
  • [10] A Ranking Approach for Probe Selection and Classification of Microarray Data with Artificial Neural Networks
    Chagas Faria, Alexandre Wagner
    Da Silva, Alisson Marques
    Rodrigues, Thiago de Souza
    Costa, Marcelo Azevedo
    Braga, Antonio Padua
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (10) : 953 - 961