Robust Model Selection for Classification of Microarrays

被引:0
|
作者
Suzuki, Ikumi [1 ]
Takenouchi, Takashi [1 ]
Ohira, Miki [2 ]
Oba, Shigeyuki [3 ]
Ishii, Shin [1 ,3 ,4 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300192, Japan
[2] Chiba Canc Ctr Res Inst, Div Biochem, Chiba 2608717, Japan
[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[4] PRESTO, Japan Sci & Technol Corp, Tokyo, Japan
关键词
gene expression; cancer diagnosis; mini-chip microarrays; supervised analysis;
D O I
暂无
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Recently, microarray-based cancer diagnosis systems have been increasingly investigated. However, cost reduction and reliability assurance of such diagnosis systems are still remaing problems in real clinical scenes. To reduce the cost, we need a supervised classifier involving the smallest number of genes, as long as the classifier is sufficiently reliable. To achieve a reliable classifier, we should assess candidate classifiers and select the best one. In the selection process of the best classifier, however, the assessment criterion must involve large variance because of limited number of samples and non-negligible observation noise. Therefore, even if a classifier with a very small number of genes exhibited the smallest leave-one-out cross-validation (LOO) error rate, it would not necessarily be reliable because classifiers based on a small number of genes tend to show large variance. We propose a robust model selection criterion, the min-max criterion, based on a resampling bootstrap simulation to assess the variance of estimation of classification error rates. We applied our assessment framework to four published real gene expression datasets and one synthetic dataset. We found that a state-of-the-art procedure, weighted voting classifiers with LOO criterion, had a non-negligible risk of selecting extremely poor classifiers and, on the other hand, that the new min-max criterion could eliminate that risk. These finding suggests that our criterion presents a safer procedure to design a practical cancer diagnosis system.
引用
收藏
页码:141 / 157
页数:17
相关论文
共 50 条
  • [1] Robust model selection and the statistical classification of languages
    Garcia, Jesus E.
    Gonzalez-Lopez, V. A.
    Viola, M. L. L.
    [J]. XI BRAZILIAN MEETING ON BAYESIAN STATISTICS (EBEB 2012), 2012, 1490 : 160 - 170
  • [2] Selection of Accurate and Robust Classification Model for Binary Classification Problems
    Khan, Muhammad A.
    Jan, Zahoor
    Ishtiaq, M.
    Khan, M. Asif
    Mirza, Anwar M.
    [J]. SIGNAL PROCESSING, IMAGE PROCESSING, AND PATTERN RECOGNITION, 2009, 61 : 161 - 168
  • [3] Classification of mislabelled microarrays using robust sparse logistic regression
    Bootkrajang, Jakramate
    Kaban, Ata
    [J]. BIOINFORMATICS, 2013, 29 (07) : 870 - 877
  • [4] A robust classification model with Voting based feature selection for Diagnosis of Epilepsy
    Hassan, Ali
    Riaz, Farhan
    Basit, Abdul
    [J]. 2015 IEEE 28TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2015, : 176 - 179
  • [5] Model Classification-and-Selection Assisted Robust Receiver for OFDM Systems
    Zhang, Xiaoying
    Mei, Kai
    Liu, Xiaoran
    Zhang, Lei
    Wei, Jibo
    [J]. IEEE ACCESS, 2019, 7 : 85746 - 85754
  • [6] Robust Feature Selection Method for Music Classification
    Rameshkumar, P.
    Monisha, M.
    Santhi, B.
    Vigneshwaran, T.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [7] Robust Classification Under Sample Selection Bias
    Liu, Anqi
    Ziebart, Brian D.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [8] ROBUST MODEL SELECTION IN REGRESSION
    RONCHETTI, E
    [J]. STATISTICS & PROBABILITY LETTERS, 1985, 3 (01) : 21 - 23
  • [9] Classification of microarrays; synergistic effects between normalization, gene selection and machine learning
    Onskog, Jenny
    Freyhult, Eva
    Landfors, Mattias
    Ryden, Patrik
    Hvidsten, Torgeir R.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [10] Selection of Characteristics and Classification of DNA Microarrays Using Bioinspired Algorithms and the Generalized Neuron
    Alejandra Romero-Montiel, Flor
    Rodriguez-Vazquez, Katya
    [J]. ADVANCES IN SOFT COMPUTING, MICAI 2018, PT I, 2018, 11288 : 86 - 97