Structured variable selection in support vector machines

被引:8
|
作者
Wu, Seongho [1 ]
Zou, Hui [1 ]
Yuan, Ming [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
来源
基金
美国国家科学基金会;
关键词
Classification; Heredity; Nonparametric estimation; Support vector machine; Variable selection;
D O I
10.1214/07-EJS125
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When applying the support vector machine (SVM) to high-dimensional classification problems, we often impose a sparse structure in the SVM to eliminate the influences of the irrelevant predictors. The lasso and other variable selection techniques have been successfully used in the SVM to perform automatic variable selection. In some problems, there is a natural hierarchical structure among the variables. Thus, in order to have an interpretable SVM classifier, it is important to respect the heredity principle when enforcing the sparsity in the SVM. Many variable selection methods, however, do not respect the heredity principle. In this paper we enforce both sparsity and the heredity principle in the SVM by using the so-called structured variable selection (SVS) framework originally proposed in [20]. We minimize the empirical hinge loss under a set of linear inequality constraints and a lasso-type penalty. The solution always obeys the desired heredity principle and enjoys sparsity. The new SVM classifier can be efficiently fitted, because the optimization problem is a linear program. Another contribution of this work is to present a nonparametric extension of the SVS framework, and we propose nonparametric heredity SVMs. Simulated and real data are used to illustrate the merits of the proposed method.
引用
收藏
页码:103 / 117
页数:15
相关论文
共 50 条
  • [31] Bilevel model selection for Support Vector Machines
    Kunapuli, Gautam
    Bennett, Kristin P.
    Hu, Jing
    Pang, Jong-Shi
    DATA MINING AND MATHEMATICAL PROGRAMMING, 2008, 45 : 129 - 158
  • [32] Evolutionary selection of kernels in Support Vector Machines
    Thadani, Kanchan
    Ashutosh
    Jayaraman, V. K.
    Sundararajan, V.
    2006 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, VOLS 1 AND 2, 2007, : 18 - +
  • [33] Variable Selection and Oversampling in the Use of Smooth Support Vector Machines for Predicting the Default Risk of Companies
    Haerdle, Wolfgang
    Lee, Yuh-Jye
    Schaefer, Dorothea
    Yeh, Yi-Ren
    JOURNAL OF FORECASTING, 2009, 28 (06) : 512 - 534
  • [34] Learning layered ranking functions with structured support vector machines
    Waegeman, Willem
    De Baets, Bernard
    Boullart, Luc
    NEURAL NETWORKS, 2008, 21 (10) : 1511 - 1523
  • [35] Structured multicategory support vector machines with analysis of variance decomposition
    Lee, Yoonkyung
    Kim, Yuwon
    Lee, Sangjun
    Koo, Ja-Yong
    BIOMETRIKA, 2006, 93 (03) : 555 - 571
  • [36] Variable selection for the linear support vector machine
    Zhu, Ji
    Zou, Hui
    TRENDS IN NEURAL COMPUTATION, 2007, 35 : 35 - +
  • [37] Research on parameter selection method for support vector machines
    Ling Sun
    Jian Bao
    Yangyang Chen
    Mingming Yang
    Applied Intelligence, 2018, 48 : 331 - 342
  • [38] Optimizing resources in model selection for support vector machines
    Adankon, MM
    Cheriet, M
    Ayat, NE
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 925 - 930
  • [39] Candidate vectors selection for training support vector machines
    Li, Minqiang
    Chen, Fuzan
    Kou, Jisong
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 538 - +
  • [40] Linear penalization support vector machines for feature selection
    Miranda, J
    Montoya, R
    Weber, R
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 188 - 192