Bayesian approach to feature selection and parameter tuning for support vector machine classifiers

被引:50
|
作者
Gold, C [1 ]
Holub, A
Sollich, P
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Kings Coll London, Dept Math, London WC2R 2LS, England
关键词
D O I
10.1016/j.neunet.2005.06.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A Bayesian point of view of SVM classifiers allows the definition of a quantity analogous to the evidence in probabilistic models. By maximizing this one can systematically tune hyperparameters and, via automatic relevance determination (ARD), select relevant input features. Evidence gradients are expressed as averages over the associated posterior and can be approximated using Hybrid Monte Carlo (HMC) sampling. We describe how a Nystrom approximation of the Gram matrix can be used to speed up sampling times significantly while maintaining almost unchanged classification accuracy. In experiments on classification problems with a significant number of irrelevant features this approach to ARD can give a significant improvement in classification performance over more traditional, non-ARD, SVM systems. The final tuned hyperparameter values provide a useful criterion for pruning irrelevant features, and we define a measure of relevance with which to determine systematically how many features should be removed. This use of ARD for hard feature selection can improve classification accuracy in non-ARD SVMs. In the majority of cases, however, we find that in data sets constructed by human domain experts the performance of non-ARD SVMs is largely insensitive to the presence of some less relevant features. Eliminating such features via ARD then does not improve classification accuracy, but leads to impressive reductions in the number of features required, by up to 75%.(1) (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:693 / 701
页数:9
相关论文
共 50 条
  • [1] Optimal Feature Selection for Support Vector Machine Classifiers
    Strub, O.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 304 - 308
  • [2] Fault Diagnosis Based on Fuzzy Support Vector Machine with Parameter Tuning and Feature Selection
    State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou, 310027, China
    不详
    Chin J Chem Eng, 2007, 2 (233-239):
  • [3] Fault Diagnosis Based on Fuzzy Support Vector Machine with Parameter Tuning and Feature Selection
    毛勇
    夏铮
    尹征
    孙优贤
    万征
    ChineseJournalofChemicalEngineering, 2007, (02) : 233 - 239
  • [4] A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers
    Rosita Guido
    Maria Carmela Groccia
    Domenico Conforti
    Soft Computing, 2023, 27 : 12863 - 12881
  • [5] A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers
    Guido, Rosita
    Groccia, Maria Carmela
    Conforti, Domenico
    SOFT COMPUTING, 2023, 27 (18) : 12863 - 12881
  • [6] Fast Bayesian support vector machine parameter tuning with the Nystrom method
    Gold, C
    Sollich, P
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2820 - 2825
  • [7] Parameter determination of support vector machine and feature selection using simulated annealing approach
    Lin, Shih-Wei
    Lee, Zne-Jung
    Chen, Shih-Chieh
    Tseng, Tsung-Yuan
    APPLIED SOFT COMPUTING, 2008, 8 (04) : 1505 - 1512
  • [8] A SA-based feature selection and parameter optimization approach for support vector machine
    Lin, S.-W.
    Tseng, T.-Y.
    Chen, S.-C.
    Huang, J.-F.
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 3144 - 3146
  • [9] Support Vector Machine with feature selection: A multiobjective approach
    Alcaraz, Javier
    Labbe, Martine
    Landete, Mercedes
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [10] An integrated approach of feature selection and parameter optimisation of kernel to enhance the performance of support vector machine
    Sarojini, Balakrishnan
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2015, 15 (2-3) : 265 - 278