Ensemble dependence model for classification and prediction of cancer and normal gene expression data

被引:30
|
作者
Qiu, P [1 ]
Wang, ZJ
Liu, KJR
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V5Z 1M9, Canada
关键词
D O I
10.1093/bioinformatics/bti483
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: DNA microarray technologies make it possible to simultaneously monitor thousands of genes' expression levels. A topic of great interest is to study the different expression profiles between microarray samples from cancer patients and normal subjects, by classifying them at gene expression levels. Currently, various clustering methods have been proposed in the literature to classify cancer and normal samples based on microarray data, and they are predominantly data-driven approaches. In this paper, we propose an alternative approach, a model-driven approach, which can reveal the relationship between the global gene expression profile and the subject's health status, and thus is promising in predicting the early development of cancer. Results: In this work, we propose an ensemble dependence model, aimed at exploring the group dependence relationship of gene clusters. Under the framework of hypothesis-testing, we employ genes' dependence relationship as a feature to model and classify cancer and normal samples. The proposed classification scheme is applied to several real cancer datasets, including cDNA, Affymetrix microarray and proteomic data. It is noted that the proposed method yields very promising performance. We further investigate the eigen-value pattern of the proposed method, and we discover different patterns between cancer and normal samples. Moreover, the transition between cancer and normal patterns suggests that the eigen-value pattern of the proposed models may have potential to predict the early stage of cancer development. In addition, we examine the effects of possible model mismatch on the proposed scheme.
引用
收藏
页码:3114 / 3121
页数:8
相关论文
共 50 条
  • [1] Cancer Classification from Gene Expression Data by NPPC Ensemble
    Ghorai, Santanu
    Mukherjee, Anirban
    Sengupta, Sanghamitra
    Dutta, Pranab K.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 659 - 671
  • [2] New ensemble machine learning method for classification and prediction on gene expression data
    Wang, Ching Wei
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 60 - 63
  • [3] A support vector machine ensemble for cancer classification using gene expression data
    Liao, Chen
    Li, Shutao
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 488 - +
  • [4] An ensemble correlation-based gene selection algorithm for cancer classification with gene expression data
    Piao, Yongjun
    Piao, Minghao
    Park, Kiejung
    Ryu, Keun Ho
    [J]. BIOINFORMATICS, 2012, 28 (24) : 3306 - 3315
  • [5] Cancer Classification from Gene Expression Based Microarray Data Using SVM Ensemble
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    [J]. 2015 International Conference on Condition Assessment Techniques in Electrical Systems (CATCON), 2015, : 13 - 16
  • [6] Design of a novel ensemble model of classification technique for gene-expression data of lung cancer with modified genetic algorithm
    Chandrakar, Prem Kumar
    Shrivas, Akhilesh Kumar
    Sahu, Neelam
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2021, 7 (25) : 1 - 13
  • [7] Ensemble classification for gene expression data based on parallel clustering
    Meng, Jun
    Jiang, Dingling
    Zhang, Jing
    Luan, Yushi
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (03) : 213 - 229
  • [8] Bayesian ensemble methods for survival prediction in gene expression data
    Bonato, Vinicius
    Baladandayuthapani, Veerabhadran
    Broom, Bradley M.
    Sulman, Erik P.
    Aldape, Kenneth D.
    Do, Kim-Anh
    [J]. BIOINFORMATICS, 2011, 27 (03) : 359 - 367
  • [9] A Hybrid Ensemble Algorithm Combining AdaBoost and Genetic Algorithm for Cancer Classification with Gene Expression Data
    Lu, Huijuan
    Gao, Huiyun
    Ye, Minchao
    Yan, Ke
    Wang, Xiuhui
    [J]. 2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 15 - 19
  • [10] Cancer Classification Ensemble System Based on Gene Expression Profiles
    Tarek, Sara
    Elwahab, Reda Abd
    Shoman, Mahmoud
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,