Generalization Error Bounds for Multiclass Sparse Linear Classifiers

被引:0
|
作者
Levy, Tomer [1 ]
Abramovich, Felix [1 ]
机构
[1] Tel Aviv Univ, Dept Stat & Operat Res, Tel Aviv, Israel
基金
以色列科学基金会;
关键词
Feature selection; high-dimensionality; minimaxity; misclassification excess risk; sparsity; HIGH-DIMENSIONAL CLASSIFICATION; MODELS; SLOPE; REGULARIZATION; CONSISTENCY; SELECTION; LASSO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with different structural assumptions on the regression coefficients matrix. We propose a computationally feasible feature selection procedure based on penalized maximum likelihood with convex penalties capturing a specific type of sparsity at hand. In particular, we consider global row-wise sparsity, double row-wise sparsity, and low-rank sparsity, and show that with the properly chosen tuning parameters the derived plug-in classifiers attain the minimax generalization error bounds (in terms of misclassification excess risk) within the corresponding classes of multiclass sparse linear classifiers. The developed approach is general and can be adapted to other types of sparsity as well.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Generalization error bounds for aggregate classifiers
    Blanchard, G
    [J]. NONLINEAR ESTIMATION AND CLASSIFICATION, 2003, 171 : 357 - 367
  • [2] Some new bounds on the generalization error of combined classifiers
    Koltchinskii, V
    Panchenko, D
    Lozano, F
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 245 - 251
  • [3] Error bounds for sparse classifiers in high-dimensions
    Dedieu, Antoine
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 48 - 56
  • [4] New bounds and approximations for the error of linear classifiers
    Rueda, L
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, 2004, 3287 : 342 - 349
  • [5] Upper bounds for error rates of linear combinations of classifiers
    Murua, A
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) : 591 - 602
  • [6] Generalization bounds for averaged classifiers
    Freund, Y
    Mansour, Y
    Schapire, RE
    [J]. ANNALS OF STATISTICS, 2004, 32 (04): : 1698 - 1722
  • [7] Generalization error of combined classifiers
    Mason, L
    Bartlett, PL
    Golea, M
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 65 (02) : 415 - 438
  • [8] Generalization Error of Invariant Classifiers
    Sokolic, Jure
    Giryes, Raja
    Sapiro, Guillermo
    Rodrigues, Miguel R. D.
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 1094 - 1103
  • [9] RECYCLED LINEAR CLASSIFIERS FOR MULTICLASS CLASSIFICATION
    Soni, Akshay
    Haupt, Jarvis
    Porikli, Fatih
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] On the generalization error of fixed combinations of classifiers
    Anthony, Martin
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2007, 73 (05) : 725 - 734