GECC: Gene Expression Based Ensemble Classification of Colon Samples

被引:25
|
作者
Rathore, Saima [1 ,2 ]
Hussain, Mutawarra [1 ]
Khan, Asifullah [1 ]
机构
[1] Pakistan Inst Engn & Appl Sci, Dept Comp & Informat Sci, Islamabad, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Comp Sci & Informat Technol, Muzaffarabad, Pakistan
关键词
Colon cancer; ensemble classification; gene expressions; PCA; mRMR; F-Score; chi-square; FEATURE-SELECTION; CANCER; PREDICTION; MACHINE; PROFILES; TUMOR; SVM; INFORMATION; PATTERNS;
D O I
10.1109/TCBB.2014.2344655
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression deviates from its normal composition in case a patient has cancer. This variation can be used as an effective tool to find cancer. In this study, we propose a novel gene expressions based colon classification scheme (GECC) that exploits the variations in gene expressions for classifying colon gene samples into normal and malignant classes. Novelty of GECC is in two complementary ways. First, to cater overwhelmingly larger size of gene based data sets, various feature extraction strategies, like, chi-square, F-Score, principal component analysis (PCA) and minimum redundancy and maximum relevancy (mRMR) have been employed, which select discriminative genes amongst a set of genes. Second, a majority voting based ensemble of support vector machine (SVM) has been proposed to classify the given gene based samples. Previously, individual SVM models have been used for colon classification, however, their performance is limited. In this research study, we propose an SVM-ensemble based new approach for gene based classification of colon, wherein the individual SVM models are constructed through the learning of different SVM kernels, like, linear, polynomial, radial basis function (RBF), and sigmoid. The predicted results of individual models are combined through majority voting. In this way, the combined decision space becomes more discriminative. The proposed technique has been tested on four colon, and several other binary-class gene expression data sets, and improved performance has been achieved compared to previously reported gene based colon cancer detection techniques. The computational time required for the training and testing of 208 x 5,851 data set has been 591.01 and 0.019 s, respectively.
引用
收藏
页码:1131 / 1145
页数:15
相关论文
共 50 条
  • [11] A novel ensemble classification of gene expression profile based on bagging and neighborhood rough set
    Chen, Tao
    Hong, Zenglin
    Zhao, Hui
    [J]. Journal of Computational Information Systems, 2015, 11 (08): : 2747 - 2754
  • [12] Ensemble sparse classification of colon cancer
    Rathore, Saima
    Iftikhar, Muhammad Aksam
    Hassan, Mehdi
    [J]. PROCEEDINGS OF 14TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY PROCEEDINGS - FIT 2016, 2016, : 235 - 240
  • [13] Cancer Classification from Gene Expression Data by NPPC Ensemble
    Ghorai, Santanu
    Mukherjee, Anirban
    Sengupta, Sanghamitra
    Dutta, Pranab K.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 659 - 671
  • [14] Ensemble methods of rank-based trees for single sample classification with gene expression profiles
    Min Lu
    Ruijie Yin
    X. Steven Chen
    [J]. Journal of Translational Medicine, 22
  • [15] Ensemble methods of rank-based trees for single sample classification with gene expression profiles
    Lu, Min
    Yin, Ruijie
    Chen, X. Steven
    [J]. JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [16] Ensemble classification of colon biopsy images based on information rich hybrid features
    Rathore, Saima
    Hussain, Mutawarra
    Iftikhar, Muhammad Aksam
    Jalil, Abdul
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 47 : 76 - 92
  • [17] Gene Expression Data Classification Using Artificial Neural Network Ensembles Based on Samples Filtering
    Chen, Wutao
    Lu, Huijuan
    Wang, Mingyi
    Fang, Cheng
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL I, PROCEEDINGS, 2009, : 626 - +
  • [18] TEXTURE BASED CLASSIFICATION OF HYPERSPECTRAL COLON BIOPSY SAMPLES USING CLBP
    Masood, Khalid
    Rajpoot, Nasir
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 1011 - 1014
  • [19] Gene expression based cancer classification
    Tarek, Sara
    Abd Elwahab, Reda
    Shoman, Mahmoud
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2017, 18 (03) : 151 - 159
  • [20] Facial Expression Classification Based on Ensemble Convolutional Neural Network
    Zhou Tao
    Lu Xiaoqi
    Ren Guoyin
    Gu Yu
    Zhang Ming
    Li Jing
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (14)