Structured variable selection in support vector machines

被引:8
|
作者
Wu, Seongho [1 ]
Zou, Hui [1 ]
Yuan, Ming [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
来源
基金
美国国家科学基金会;
关键词
Classification; Heredity; Nonparametric estimation; Support vector machine; Variable selection;
D O I
10.1214/07-EJS125
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When applying the support vector machine (SVM) to high-dimensional classification problems, we often impose a sparse structure in the SVM to eliminate the influences of the irrelevant predictors. The lasso and other variable selection techniques have been successfully used in the SVM to perform automatic variable selection. In some problems, there is a natural hierarchical structure among the variables. Thus, in order to have an interpretable SVM classifier, it is important to respect the heredity principle when enforcing the sparsity in the SVM. Many variable selection methods, however, do not respect the heredity principle. In this paper we enforce both sparsity and the heredity principle in the SVM by using the so-called structured variable selection (SVS) framework originally proposed in [20]. We minimize the empirical hinge loss under a set of linear inequality constraints and a lasso-type penalty. The solution always obeys the desired heredity principle and enjoys sparsity. The new SVM classifier can be efficiently fitted, because the optimization problem is a linear program. Another contribution of this work is to present a nonparametric extension of the SVS framework, and we propose nonparametric heredity SVMs. Simulated and real data are used to illustrate the merits of the proposed method.
引用
收藏
页码:103 / 117
页数:15
相关论文
共 50 条
  • [41] Feature Selection using Fuzzy Support Vector Machines
    Hong Xia
    Bao Qing Hu
    Fuzzy Optimization and Decision Making, 2006, 5 (2) : 187 - 192
  • [42] Optimal kernel selection in twin support vector machines
    Reshma Khemchandani
    Suresh Jayadeva
    Optimization Letters, 2009, 3 : 77 - 88
  • [43] Training Data Selection for Support Vector Machines Model
    Dang Huu Nghi
    Luong Chi Mai
    INFORMATION AND ELECTRONICS ENGINEERING, 2011, 6 : 28 - 32
  • [44] Complexity reduction and parameter selection in support vector machines
    Ancona, N
    Cicirelli, G
    Distante, A
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 2375 - 2380
  • [45] Comparison of Feature Selection Methods in Support Vector Machines
    Kim, Kwangsu
    Park, Changyi
    KOREAN JOURNAL OF APPLIED STATISTICS, 2013, 26 (01) : 131 - 139
  • [46] Feature selection for support vector machines in text categorization
    Liu, Y
    Lu, HM
    Lu, ZX
    Wang, P
    MLMTA'03: INTERNATIONAL CONFERENCE ON MACHINE LEARNING; MODELS, TECHNOLOGIES AND APPLICATIONS, 2003, : 129 - 134
  • [47] Uncertainty Handling in Model Selection for Support Vector Machines
    Glasmachers, Tobias
    Igel, Christian
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN X, PROCEEDINGS, 2008, 5199 : 185 - 194
  • [48] AUC Maximizing Support Vector Machines with Feature Selection
    Tian, Yingjie
    Shi, Yong
    Chen, Xiaojun
    Chen, Wenjing
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1691 - 1698
  • [49] Feature selection for support vector machines with RBF kernel
    Liu, Quanzhong
    Chen, Chihau
    Zhang, Yang
    Hu, Zhengguo
    ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (02) : 99 - 115
  • [50] ε-tube based pattern selection for support vector machines
    Kim, Dongil
    Cho, Sungzoon
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 215 - 224