GECC: Gene Expression Based Ensemble Classification of Colon Samples

被引:25
|
作者
Rathore, Saima [1 ,2 ]
Hussain, Mutawarra [1 ]
Khan, Asifullah [1 ]
机构
[1] Pakistan Inst Engn & Appl Sci, Dept Comp & Informat Sci, Islamabad, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Comp Sci & Informat Technol, Muzaffarabad, Pakistan
关键词
Colon cancer; ensemble classification; gene expressions; PCA; mRMR; F-Score; chi-square; FEATURE-SELECTION; CANCER; PREDICTION; MACHINE; PROFILES; TUMOR; SVM; INFORMATION; PATTERNS;
D O I
10.1109/TCBB.2014.2344655
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression deviates from its normal composition in case a patient has cancer. This variation can be used as an effective tool to find cancer. In this study, we propose a novel gene expressions based colon classification scheme (GECC) that exploits the variations in gene expressions for classifying colon gene samples into normal and malignant classes. Novelty of GECC is in two complementary ways. First, to cater overwhelmingly larger size of gene based data sets, various feature extraction strategies, like, chi-square, F-Score, principal component analysis (PCA) and minimum redundancy and maximum relevancy (mRMR) have been employed, which select discriminative genes amongst a set of genes. Second, a majority voting based ensemble of support vector machine (SVM) has been proposed to classify the given gene based samples. Previously, individual SVM models have been used for colon classification, however, their performance is limited. In this research study, we propose an SVM-ensemble based new approach for gene based classification of colon, wherein the individual SVM models are constructed through the learning of different SVM kernels, like, linear, polynomial, radial basis function (RBF), and sigmoid. The predicted results of individual models are combined through majority voting. In this way, the combined decision space becomes more discriminative. The proposed technique has been tested on four colon, and several other binary-class gene expression data sets, and improved performance has been achieved compared to previously reported gene based colon cancer detection techniques. The computational time required for the training and testing of 208 x 5,851 data set has been 591.01 and 0.019 s, respectively.
引用
收藏
页码:1131 / 1145
页数:15
相关论文
共 50 条
  • [41] Gene boosting for cancer classification based on gene expression profiles
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    [J]. PATTERN RECOGNITION, 2009, 42 (09) : 1761 - 1767
  • [42] Ensemble Learning with Extended Newton Support Vector Machines for Enhancing Gene Expression Classification
    Huu-Hoa Nguyen
    Nguyen-Khang Pham
    [J]. SN Computer Science, 5 (5)
  • [43] A combinational feature selection and ensemble neural network method for classification of gene expression data
    Bing Liu
    Qinghua Cui
    Tianzi Jiang
    Songde Ma
    [J]. BMC Bioinformatics, 5
  • [44] A combinational feature selection and ensemble neural network method for classification of gene expression data
    Liu, B
    Cui, QH
    Jiang, TZ
    Ma, SD
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [45] Molecular classification of colorectal cancer using the gene expression profile of tumor samples
    Rashid, Mamoon
    Vishwakarma, Ramesh K.
    Deeb, Ahmad M.
    Hussein, Mohamed A.
    Aziz, Mohammad A.
    [J]. EXPERIMENTAL BIOLOGY AND MEDICINE, 2019, 244 (12) : 1005 - 1016
  • [46] A Gene Selection Approach based on Clustering for Classification Tasks in Colon Cancer
    Castellanos-Garzon, Jose A.
    Ramos, Juan
    [J]. ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2015, 4 (03): : 1 - 10
  • [47] Inflammation, adenoma and cancer:: Objective classification of colon biopsy specimens with gene expression signature
    Galamb, Orsolya
    Gyoerffy, Balazs
    Sipos, Ferenc
    Spisaka, Sandor
    Nemetha, Anna Maria
    Mihellera, Pal
    Tulassay, Zsolt
    Dinya, Elek
    Molnar, Bela
    [J]. DISEASE MARKERS, 2008, 25 (01) : 1 - 16
  • [48] Gene Expression Classification of Colon Cancer into Molecular Subtypes: Characterization, Validation, and Prognostic Value
    Marisa, Laetitia
    de Reynies, Aureline
    Duval, Alex
    Selves, Janick
    Gaub, Marie Pierre
    Vescovo, Laure
    Etienne-Grimaldi, Marie-Christine
    Schiappa, Renaud
    Guenot, Dominique
    Ayadi, Mira
    Kirzin, Sylvain
    Chazal, Maurice
    Flejou, Jean-Francois
    Benchimol, Daniel
    Berger, Anne
    Lagarde, Arnaud
    Pencreach, Erwan
    Piard, Francois
    Elias, Dominique
    Parc, Yann
    Olschwang, Sylviane
    Milano, Gerard
    Laurent-Puig, Pierre
    Boige, Valerie
    [J]. PLOS MEDICINE, 2013, 10 (05)
  • [49] Feature (gene) selection in gene expression-based tumor classification
    Xiong, MM
    Li, WJ
    Zhao, JY
    Jin, L
    Boerwinkle, E
    [J]. MOLECULAR GENETICS AND METABOLISM, 2001, 73 (03) : 239 - 247
  • [50] Classification of topological domains based on gene expression and regulation
    Zhao, Jingjing
    Shi, Hongbo
    Ahituv, Nadav
    [J]. GENOME, 2013, 56 (07) : 415 - 423