Robust Model Selection for Classification of Microarrays

被引:0
|
作者
Suzuki, Ikumi [1 ]
Takenouchi, Takashi [1 ]
Ohira, Miki [2 ]
Oba, Shigeyuki [3 ]
Ishii, Shin [1 ,3 ,4 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300192, Japan
[2] Chiba Canc Ctr Res Inst, Div Biochem, Chiba 2608717, Japan
[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[4] PRESTO, Japan Sci & Technol Corp, Tokyo, Japan
关键词
gene expression; cancer diagnosis; mini-chip microarrays; supervised analysis;
D O I
暂无
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Recently, microarray-based cancer diagnosis systems have been increasingly investigated. However, cost reduction and reliability assurance of such diagnosis systems are still remaing problems in real clinical scenes. To reduce the cost, we need a supervised classifier involving the smallest number of genes, as long as the classifier is sufficiently reliable. To achieve a reliable classifier, we should assess candidate classifiers and select the best one. In the selection process of the best classifier, however, the assessment criterion must involve large variance because of limited number of samples and non-negligible observation noise. Therefore, even if a classifier with a very small number of genes exhibited the smallest leave-one-out cross-validation (LOO) error rate, it would not necessarily be reliable because classifiers based on a small number of genes tend to show large variance. We propose a robust model selection criterion, the min-max criterion, based on a resampling bootstrap simulation to assess the variance of estimation of classification error rates. We applied our assessment framework to four published real gene expression datasets and one synthetic dataset. We found that a state-of-the-art procedure, weighted voting classifiers with LOO criterion, had a non-negligible risk of selecting extremely poor classifiers and, on the other hand, that the new min-max criterion could eliminate that risk. These finding suggests that our criterion presents a safer procedure to design a practical cancer diagnosis system.
引用
下载
收藏
页码:141 / 157
页数:17
相关论文
共 50 条
  • [21] Robust and parallel Bayesian model selection
    Zhang, Michael Minyi
    Lam, Henry
    Lin, Lizhen
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 127 : 229 - 247
  • [22] Robust Model Selection for Stochastic Processes
    Garcia, Jesus E.
    Gonzalez-Lopez, V. A.
    Viola, M. L. L.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (10-12) : 2516 - 2526
  • [23] Robust concentration graph model selection
    Gottard, Anna
    Pacillo, Simona
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3070 - 3079
  • [24] Robust model selection with flexible trimming
    Riani, Marco
    Atkinson, Anthony C.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 3300 - 3312
  • [25] A robust model for intelligent text classification
    Basili, R
    Moschitti, A
    ICTAI 2001: 13TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2001, : 265 - 272
  • [26] A Robust Geometric Model for Argument Classification
    Giannone, Cristina
    Croce, Danilo
    Basili, Roberto
    De Cao, Diego
    AI (ASTERISK) IA 2009: EMERGENT PERSPECTIVES IN ARTIFICIAL INTELLIGENCE, 2009, 5883 : 284 - 293
  • [27] Model Similarity, Model Selection, and Attribute Classification
    Ma, Wenchao
    Iaconangelo, Charles
    de la Torre, Jimmy
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2016, 40 (03) : 200 - 217
  • [28] Detecting differentially expressed genes in microarrays using Bayesian model selection
    Ishwaran, H
    Rao, JS
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) : 438 - 455
  • [29] Robust model selection criteria for robust S and LTS estimators
    Cetin, Meral
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2016, 45 (01): : 153 - 164
  • [30] Structural similarity-based noise-robust band selection model for hyperspectral image classification
    Liu, Yifan
    Qian, Longxia
    Hong, Mei
    Wang, Xianyue
    Journal of Applied Remote Sensing, 2024, 18 (03)