SVM Parameter Tuning with Grid Search and Its Impact on Reduction of Model Over-fitting

被引:67
|
作者
Lameski, Petre [1 ]
Zdravevski, Eftim [1 ]
Mingov, Riste [2 ]
Kulakov, Andrea [1 ]
机构
[1] St Cyril & Methodius Univ, Fac Comp Sci & Engn, Skopje, North Macedonia
[2] NI TEKNA Intelligent Technol, Negotino, North Macedonia
关键词
Support Vector Machines; SVM; Grid search; Over-fitting; Parameter tuning; Time series; Coalminig;
D O I
10.1007/978-3-319-25783-9_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe our submission to the IJCRS' 15 Data Mining Competition, which is concerned with prediction of dangerous concentrations of methane in longwalls of a Polish coalmine. We address the challenge of building robust classification models with support vector machines (SVMs) that are built from time series data. Moreover, we investigate the impact of parameter tuning of SVMs with grid search on the classification performance and its effect on preventing over-fitting. Our results show improvements of predictive performance with proper parameter tuning but also improved stability of the classification models even when the test data comes from a different time period and class distribution. By applying the proposed method we were able to build a classification model that predicts unseen test data even better than the training data, thus highlighting the non-over-fitting properties of the model. The submitted solution was about 2% behind the winning solution.
引用
收藏
页码:464 / 474
页数:11
相关论文
共 50 条
  • [1] ALLEVIATING OVER-FITTING IN ATTRIBUTE REDUCTION: AN EARLY STOPPING STRATEGY
    Liu, Keyu
    Song, Jingjing
    Zhang, Wendong
    Yang, Xibe
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2018, : 190 - 195
  • [2] Policy evaluation using model over-fitting: the Nordic case
    Tapia, Armando
    Gonzalez, Silvestre L. L.
    Vergara, Jose R. R.
    Villafuerte, Mariano
    Montiel, Luis V. V.
    [J]. COMPUTATIONAL STATISTICS, 2023,
  • [3] On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 2079 - 2107
  • [4] Under-Fitting and Over-Fitting: The Performance of Bayesian Model Selection and Fit Indices in SEM
    Depaoli, Sarah
    Winter, Sonja D.
    Liu, Haiyan
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2024, 31 (04) : 604 - 625
  • [5] On over-fitting in model selection and subsequent selection bias in performance evaluation
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. Journal of Machine Learning Research, 2010, 11 : 2079 - 2107
  • [6] Effectiveness of Random Search in SVM hyper-parameter tuning
    Mantovani, Rafael G.
    Rossi, Andre L. D.
    Vanschoren, Joaquin
    Bischl, Bernd
    de Carvalho, Andre C. P. L. F.
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [7] Study of the Over-Fitting in Building PLS Model Using Orthogonal Signal Correction
    Zhang Xian
    Yuan Hong-fu
    Guo Zheng
    Song Chun-feng
    Li Xiao-yu
    Xie Jin-chun
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2011, 31 (06) : 1688 - 1691
  • [8] Agricultural Case Studies of Classification Accuracy, Spectral Resolution, and Model Over-Fitting
    Nansen, Christian
    Geremias, Leandro Delalibera
    Xue, Yingen
    Huang, Fangneng
    Parra, Jose Roberto
    [J]. APPLIED SPECTROSCOPY, 2013, 67 (11) : 1332 - 1338
  • [9] Restricted Boltzmann Machine with Multivalued Hidden VariablesA Model Suppressing Over-Fitting
    Yuuki Yokoyama
    Tomu Katsumata
    Muneki Yasuda
    [J]. The Review of Socionetwork Strategies, 2019, 13 : 253 - 266
  • [10] Restricted Boltzmann Machine with Multivalued Hidden Variables A Model Suppressing Over-Fitting
    Yokoyama, Yuuki
    Katsumata, Tomu
    Yasuda, Muneki
    [J]. REVIEW OF SOCIONETWORK STRATEGIES, 2019, 13 (02): : 253 - 266