Model selection for linear classifiers using Bayesian error estimation

被引:13
|
作者
Huttunen, Heikki [1 ]
Tohka, Jussi [2 ,3 ]
机构
[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland
[2] Univ Carlos III Madrid, Dept Bioengn & Aerosp Engn, E-28903 Getafe, Spain
[3] Inst Invest Sanitaria Gregorio Maranon, Madrid, Spain
关键词
Logistic regression; Support vector machine; Regularization; Bayesian error estimator; Linear classifier; CLASSIFICATION; PERFORMANCE;
D O I
10.1016/j.patcog.2015.05.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regularized linear models are important classification methods for high dimensional problems, where regularized linear classifiers are often preferred due to their ability to avoid overfitting. The degree of freedom of the model dis determined by a regularization parameter, which is typically selected using counting based approaches, such as K-fold cross-validation. For large data, this can be very time consuming, and, for small sample sizes, the accuracy of the model selection is limited by the large variance of CV error estimates. In this paper, we study the applicability of a recently proposed Bayesian error estimator for the selection of the best model along the regularization path. We also propose an extension of the estimator that allows model selection in multiclass cases and study its efficiency with L-1 regularized logistic regression and L-2 regularized linear support vector machine. The model selection by the new Bayesian error estimator is experimentally shown to improve the classification accuracy, especially in small sample-size situations, and is able to avoid the excess variability inherent to traditional cross-validation approaches. Moreover, the method has significantly smaller computational complexity than cross-validation. (C) 2015 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:3739 / 3748
页数:10
相关论文
共 50 条
  • [21] Bayesian model selection for optical flow estimation
    Krajsek, Kai
    Mester, Rudolf
    PATTERN RECOGNITION, PROCEEDINGS, 2007, 4713 : 142 - +
  • [22] Bayesian Post-Model-Selection Estimation
    Harel, Nadav
    Routtenberg, Tirza
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 175 - 179
  • [23] Model selection for the mixed logit with Bayesian estimation
    Balcombe, Kelvin
    Chalak, Ali
    Fraser, Iain
    JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 2009, 57 (02) : 226 - 237
  • [24] Wavelet estimation by Bayesian thresholding and model selection
    Cinquemani, Eugenio
    Pillonetto, Gianluigi
    AUTOMATICA, 2008, 44 (09) : 2288 - 2297
  • [25] BAYESIAN MODEL SELECTION AND PARAMETER ESTIMATION IN PENALIZED REGRESSION MODEL USING SMC SAMPLERS
    Thi Le Thu Nguyen
    Septier, Francois
    Peters, Gareth W.
    Delignon, Yves
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [26] Valid Bayesian estimation of the cointegrating error correction model
    Strachan, RW
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2003, 21 (01) : 185 - 195
  • [27] Bayesian model selection for linear and non-linear time series using the Gibbs sampler
    Troughton, PT
    Godsill, SJ
    MATHEMATICS IN SIGNAL PROCESSING IV, 1998, 67 : 249 - 261
  • [28] Estimation Performance for the Bayesian Hierarchical Linear Model
    El Korso, Mohammed Nabil
    Boyer, Remy
    Larzabal, Pascal
    Fleury, Bernard-Henri
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) : 488 - 492
  • [29] On the generalization error by a layered statistical model with Bayesian estimation
    Watanabe, Sumio
    Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 2000, 83 (06): : 95 - 106
  • [30] Bayesian classifiers based on kernel density estimation: Flexible classifiers
    Perez, Aritz
    Larranaga, Pedro
    Inza, Inaki
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (02) : 341 - 362