Bilevel hyperparameter optimization for support vector classification: theoretical analysis and a solution method

被引:3
|
作者
Li, Qingna [1 ]
Li, Zhen [2 ]
Zemkoho, Alain [3 ]
机构
[1] Beijing Inst Technol, Sch Math & Stat, Beijing Key Lab MCAACl, Key Lab Math Theory & Computat Informat Secur, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Math & Stat, Beijing 100081, Peoples R China
[3] Univ Southampton, Sch Math Sci, Southampton SO17 1BJ, Hants, England
基金
美国国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Support vector classification; Hyperparameter selection; Bilevel optimization; Mathematical program with equilibrium constraints; C-stationarity; MATHEMATICAL PROGRAMS; MODEL SELECTION; OPTIMALITY CONDITIONS; REFORMULATION; CONVERGENCE; ALGORITHM;
D O I
10.1007/s00186-022-00798-6
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Support vector classification (SVC) is a classical and well-performed learning method for classification problems. A regularization parameter, which significantly affects the classification performance, has to be chosen and this is usually done by the cross-validation procedure. In this paper, we reformulate the hyperparameter selection problem for support vector classification as a bilevel optimization problem in which the upper-level problem minimizes the average number of misclassified data points over all the cross-validation folds, and the lower-level problems are the l(1)-loss SVC problems, with each one for each fold in T-fold cross-validation. The resulting bilevel optimization model is then converted to a mathematical program with equilibrium constraints (MPEC). To solve this MPEC, we propose a global relaxation cross-validation algorithm (GR-CV) based on the well-know Sholtes-type global relaxation method (GRM). It is proven to converge to a C-stationary point. Moreover, we prove that the MPEC-tailored version of the Mangasarian-Fromovitz constraint qualification (MFCQ), which is a key property to guarantee the convergence of the GRM, automatically holds at each feasible point of this MPEC. Extensive numerical results verify the efficiency of the proposed approach. In particular, compared with other methods, our algorithm enjoys superior generalization performance over almost all the data sets used in this paper.
引用
收藏
页码:315 / 350
页数:36
相关论文
共 50 条
  • [1] Bilevel hyperparameter optimization for support vector classification: theoretical analysis and a solution method
    Qingna Li
    Zhen Li
    Alain Zemkoho
    Mathematical Methods of Operations Research, 2022, 96 : 315 - 350
  • [2] Handling Imbalanced Classification Problems With Support Vector Machines via Evolutionary Bilevel Optimization
    Rosales-Perez, Alejandro
    Garcia, Salvador
    Herrera, Francisco
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 4735 - 4747
  • [3] A Modification of Solution Optimization in Support Vector Machine Simplification for Classification
    Pham Quoc Thang
    Nguyen Thanh Thuy
    Hoang Thi Lam
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 149 - 158
  • [4] Theoretical analysis for solution of support vector data description
    Wang, Xiaoming
    Chung, Fu-lai
    Wang, Shitong
    NEURAL NETWORKS, 2011, 24 (04) : 360 - 369
  • [5] Rock Burst Intensity Classification Prediction Model Based on a Bayesian Hyperparameter Optimization Support Vector Machine
    Yan, Shaohong
    Zhang, Yanbo
    Liu, Xiangxin
    Liu, Runze
    MATHEMATICS, 2022, 10 (18)
  • [6] A fast smoothing newton method for bilevel hyperparameter optimization for SVC with logistic loss
    Wang, Yixin
    Li, Qingna
    OPTIMIZATION, 2024,
  • [7] Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization
    Shi, Wanli
    Gu, Bin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9621 - 9629
  • [8] Local Binary Pattern with Hyperparameter Tuned Support Vector Machine for Fingerprint Classification
    Chougule, Abhijeet
    Shah, Medha
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1084 - 1087
  • [9] Hyperparameter Black-Box Optimization to Improve the Automatic Classification of Support Tickets
    Bruni, Renato
    Bianchi, Gianpiero
    Papa, Pasquale
    ALGORITHMS, 2023, 16 (01)
  • [10] Hyperparameter Learning Under Data Poisoning: Analysis of the Influence of Regularization via Multiobjective Bilevel Optimization
    Carnerero-Cano, Javier
    Munoz-Gonzalez, Luis
    Spencer, Phillippa
    Lupu, Emil C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16008 - 16022