Structured variable selection in support vector machines

被引:8
|
作者
Wu, Seongho [1 ]
Zou, Hui [1 ]
Yuan, Ming [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
来源
基金
美国国家科学基金会;
关键词
Classification; Heredity; Nonparametric estimation; Support vector machine; Variable selection;
D O I
10.1214/07-EJS125
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When applying the support vector machine (SVM) to high-dimensional classification problems, we often impose a sparse structure in the SVM to eliminate the influences of the irrelevant predictors. The lasso and other variable selection techniques have been successfully used in the SVM to perform automatic variable selection. In some problems, there is a natural hierarchical structure among the variables. Thus, in order to have an interpretable SVM classifier, it is important to respect the heredity principle when enforcing the sparsity in the SVM. Many variable selection methods, however, do not respect the heredity principle. In this paper we enforce both sparsity and the heredity principle in the SVM by using the so-called structured variable selection (SVS) framework originally proposed in [20]. We minimize the empirical hinge loss under a set of linear inequality constraints and a lasso-type penalty. The solution always obeys the desired heredity principle and enjoys sparsity. The new SVM classifier can be efficiently fitted, because the optimization problem is a linear program. Another contribution of this work is to present a nonparametric extension of the SVS framework, and we propose nonparametric heredity SVMs. Simulated and real data are used to illustrate the merits of the proposed method.
引用
收藏
页码:103 / 117
页数:15
相关论文
共 50 条
  • [21] Selection of tuning parameters for support vector machines
    Solo, V
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 237 - 240
  • [22] Model Parameter Selection of Support Vector Machines
    Zhao, Mingyuan
    Tang, Ke
    Zhou, Mingtian
    Zhang, Fengli
    Zeng, Ling
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 128 - +
  • [23] Boosted support vector machines with genetic selection
    A. Ramirez-Morales
    J. U. Salmon-Gamboa
    Jin Li
    A. G. Sanchez-Reyna
    A. Palli-Valappil
    Applied Intelligence, 2023, 53 : 4996 - 5012
  • [24] Efficient Parameter Selection of Support Vector Machines
    Ismael, K.
    Salleh, S. H.
    Najeb, J. M.
    Bakhteri, R. B. Jahangir
    4TH KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2008, VOLS 1 AND 2, 2008, 21 (1-2): : 183 - +
  • [25] Stable Feature Selection with Support Vector Machines
    Kamkar, Iman
    Gupta, Sunil Kumar
    Dinh Phung
    Venkatesh, Svetha
    AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 298 - 308
  • [26] Training data selection for support vector machines
    Wang, JG
    Neskovic, P
    Cooper, LN
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 554 - 564
  • [27] Optimal feature selection for support vector machines
    Nguyen, Minh Hoai
    de la Torre, Fernando
    PATTERN RECOGNITION, 2010, 43 (03) : 584 - 591
  • [28] Optimal parameter selection in support vector machines
    Schittkowski, K.
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2005, 1 (04) : 465 - 476
  • [29] Efficient parameter selection for support vector machines
    Huang, Hsin-Hsiung
    Wang, Zijing
    Chung, Wingyan
    ENTERPRISE INFORMATION SYSTEMS, 2019, 13 (06) : 916 - 932
  • [30] Feature selection for linear support vector machines
    Liang, Zhizheng
    Zhao, Tuo
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 606 - 609