GENE SELECTION USING LOGISTIC REGRESSIONS BASED ON AIC, BIC AND MDL CRITERIA

被引:17
|
作者
Zhou, Xiaobo [1 ]
Wang, Xiaodong [2 ]
Dougherty, Edward R. [1 ,3 ]
机构
[1] Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA
[2] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Pathol, Houston, TX 77030 USA
关键词
Gene microarray; logistic regression; Bayesian gene selection; cancer classification;
D O I
10.1142/S179300570500007X
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables (gene expressions) and the small number of experimental conditions. Many gene-selection and classification methods have been proposed; however most of these treat gene selection and classification separately, and not under the same model. We propose a Bayesian approach to gene selection using the logistic regression model. The Akaike information criterion (AIC), the Bayesian information criterion (BIC) and the minimum description length (MDL) principle are used in constructing the posterior distribution of the chosen genes. The same logistic regression model is then used for cancer classification. Fast implementation issues for these methods are discussed. The proposed methods are tested on several data sets including those arising from hereditary breast cancer, small round blue-cell tumors, lymphoma, and acute leukemia. The experimental results indicate that the proposed methods show high classification accuracies on these data sets. Some robustness and sensitivity properties of the proposed methods are also discussed. Finally, mixing logistic-regression based gene selection with other classification methods and mixing logistic-regression-based classification with other gene-selection methods are considered.
引用
收藏
页码:129 / 145
页数:17
相关论文
共 50 条
  • [1] FAST STEPWISE PROCEDURES OF SELECTION OF VARIABLES BY USING AIC AND BIC CRITERIA BIC CRITERIA
    安鸿志
    顾岚
    [J]. Acta Mathematicae Applicatae Sinica, 1989, (01) : 60 - 67
  • [2] Modified AIC and MDL model selection criteria for short data records
    De Ridder, F
    Pintelon, R
    Schoukens, J
    Gillikin, DP
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2005, 54 (01) : 144 - 150
  • [3] Model selection using modified AIC and BIC in joint modeling of paired functional data
    Wei, Jiawei
    Zhou, Lan
    [J]. STATISTICS & PROBABILITY LETTERS, 2010, 80 (23-24) : 1918 - 1924
  • [4] Bootstrap-based model selection criteria for beta regressions
    Bayer, Fabio M.
    Cribari-Neto, Francisco
    [J]. TEST, 2015, 24 (04) : 776 - 795
  • [5] Bootstrap-based model selection criteria for beta regressions
    Fábio M. Bayer
    Francisco Cribari-Neto
    [J]. TEST, 2015, 24 : 776 - 795
  • [6] Extreme value distribution based gene selection criteria for discriminant microarray data analysis using logistic regression
    Li, WT
    Sun, FZ
    Grosse, I
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (2-3) : 215 - 226
  • [7] Using Bonferroni, BIC and AIC to assess evidence for alternative biological pathways: covariate selection for the multilevel Embryo-Uterus model
    Christos Stylianou
    Andrew Pickles
    Stephen A Roberts
    [J]. BMC Medical Research Methodology, 13
  • [8] Using Bonferroni, BIC and AIC to assess evidence for alternative biological pathways: covariate selection for the multilevel Embryo-Uterus model
    Stylianou, Christos
    Pickles, Andrew
    Roberts, Stephen A.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2013, 13
  • [9] Model selection in multivariate adaptive regressions splines (MARS) using alternative information criteria
    Adiguzel, Meryem Bekar
    Cengiz, Mehmet Ali
    [J]. HELIYON, 2023, 9 (09)
  • [10] EPR-RCGA-based modelling of compression index and RMSE-AIC-BIC-based model selection for Chinese marine clays and their engineering application
    Wu, Ze-xiang
    Ji, Hui
    Yu, Chuang
    Zhou, Cheng
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2018, 19 (03): : 211 - 224