Kernel variable selection for multicategory support vector machines

被引:5
|
作者
Park, Beomjin [1 ]
Park, Changyi [1 ]
机构
[1] Univ Seoul, Dept Stat, Seoul 130743, South Korea
基金
新加坡国家研究基金会;
关键词
Multicategory classification; Statistical learning; Variable selection consistency; GENE SELECTION; LYMPHOBLASTIC-LEUKEMIA; SVM-RFE; CLASSIFICATION; CD9; DIAGNOSIS;
D O I
10.1016/j.jmva.2021.104800
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Variable selection is important in statistical learning because it can increase predictive performances and yield interpretable models. Since support vector machines construct a classification model through a mapping from an original space to a high-dimensional feature space, it is difficult to select informative variables and interpret the relation between covariates and class labels. In this paper, we suggest a variable selection method for support vector machines, focusing on the multicategory problem. We study asymptotic properties of the proposed method. Also we illustrate that our method can accurately select relevant variables and yield interpretable models on both simulated and real data sets. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页数:22
相关论文
共 50 条