A comparative study on large scale kernelized support vector machines

被引:0
|
作者
Daniel Horn
Aydın Demircioğlu
Bernd Bischl
Tobias Glasmachers
Claus Weihs
机构
[1] Technische Universität Dortmund,Fakultät Statistik
[2] Ruhr-Universität Bochum,Department of Statistics
[3] LMU München,undefined
关键词
Support vector machine; Multi-objective optimization; Supervised learning; Machine learning; Large scale; Nonlinear SVM; Parameter tuning; 62-07 Data analysis;
D O I
暂无
中图分类号
学科分类号
摘要
Kernelized support vector machines (SVMs) belong to the most widely used classification methods. However, in contrast to linear SVMs, the computation time required to train such a machine becomes a bottleneck when facing large data sets. In order to mitigate this shortcoming of kernel SVMs, many approximate training algorithms were developed. While most of these methods claim to be much faster than the state-of-the-art solver LIBSVM, a thorough comparative study is missing. We aim to fill this gap. We choose several well-known approximate SVM solvers and compare their performance on a number of large benchmark data sets. Our focus is to analyze the trade-off between prediction error and runtime for different learning and accuracy parameter settings. This includes simple subsampling of the data, the poor-man’s approach to handling large scale problems. We employ model-based multi-objective optimization, which allows us to tune the parameters of learning machine and solver over the full range of accuracy/runtime trade-offs. We analyze (differences between) solvers by studying and comparing the Pareto fronts formed by the two objectives classification error and training time. Unsurprisingly, given more runtime most solvers are able to find more accurate solutions, i.e., achieve a higher prediction accuracy. It turns out that LIBSVM with subsampling of the data is a strong baseline. Some solvers systematically outperform others, which allows us to give concrete recommendations of when to use which solver.
引用
收藏
页码:867 / 883
页数:16
相关论文
共 50 条
  • [41] A study on reduced support vector machines
    Lin, KM
    Lin, CJ
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (06): : 1449 - 1459
  • [42] A Comparative Study of Extreme Learning Machines and Support Vector Machines in Prediction of Sediment Transport in Open Channels
    Ebtehaj, I.
    Bonakdari, H.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2016, 29 (11): : 1499 - 1506
  • [43] Large-scale image colorization based on divide-and-conquer support vector machines
    Chen, Bo-Wei
    He, Xinyu
    Ji, Wen
    Rho, Seungmin
    Kung, Sun-Yuan
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (08): : 2942 - 2961
  • [44] Large-scale ligand-based predictive modelling using support vector machines
    Jonathan Alvarsson
    Samuel Lampa
    Wesley Schaal
    Claes Andersson
    Jarl E. S. Wikberg
    Ola Spjuth
    Journal of Cheminformatics, 8
  • [45] Large-scale image colorization based on divide-and-conquer support vector machines
    Bo-Wei Chen
    Xinyu He
    Wen Ji
    Seungmin Rho
    Sun-Yuan Kung
    The Journal of Supercomputing, 2016, 72 : 2942 - 2961
  • [46] An Efficient Algorithm for a Class of Large-Scale Support Vector Machines Exploiting Hidden Sparsity
    Niu, Dunbiao
    Wang, Chengjing
    Tang, Peipei
    Wang, Qingsong
    Song, Enbin
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 5608 - 5623
  • [47] Sphere Support Vector Machines for large classification tasks
    Strack, Robert
    Kecman, Vojislav
    Strack, Beata
    Li, Qi
    NEUROCOMPUTING, 2013, 101 : 59 - 67
  • [48] Large-Scale Patent Classification with Min-Max Modular Support Vector Machines
    Chu, Xiao-Lei
    Ma, Chao
    Li, Jing
    Lu, Bao-Liang
    Utiyama, Masao
    Isahara, Hitoshi
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3973 - +
  • [49] Large-scale ligand-based predictive modelling using support vector machines
    Alvarsson, Jonathan
    Lampa, Samuel
    Schaal, Wesley
    Andersson, Claes
    Wikberg, Jarl E. S.
    Spjuth, Ola
    JOURNAL OF CHEMINFORMATICS, 2016, 8
  • [50] Fast Local Support Vector Machines for Large Datasets
    Segata, Nicola
    Blanzieri, Enrico
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 295 - +