Training Data Selection for Support Vector Machines Model

被引:0
|
作者
Dang Huu Nghi [1 ]
Luong Chi Mai [2 ]
机构
[1] Hanoi Univ Min & Geol, Hanoi, Vietnam
[2] Vietnamse Acad Sci & Technol, Inst Informat Technol, 18 Hoang Quoc Viet Rd, Hanoi, Vietnam
关键词
Support vector machines; model selection; training data selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, Support Vector Machines (SVM) have become a popular tool for pattern recognition and machine learning. In training a support vector machine we need to select the parameters of the kernel function as well as soft margin parameter C of SVM. Thus to develop the optimal classifier we need to determine the optimal kernel parameter and the optimal value of C. Determining the optimal classifier is called model selection. When applied to a large data set, however, it requires a long time for training so the model selection task and its performance can be degraded a long time. To speed up training thereby shortening the time for model selection, several methods have been proposed, one of which is to reduce the training set size called training data selection. Recently, there has been considerable research on data selection for SVM training. The main idea is to select only the patterns that are likely to be located near the decision boundary. In this paper we propose a methods that select a subset of data for SVM training. Our experimental results show that a significant amount of training data can be removed by our proposed method without degrading the performance of the resulting SVM classifiers.
引用
收藏
页码:28 / 32
页数:5
相关论文
共 50 条
  • [1] Training data selection for support vector machines
    Wang, JG
    Neskovic, P
    Cooper, LN
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 554 - 564
  • [2] Support Vector Machines Training Data Selection Using a Genetic Algorithm
    Kawulok, Michal
    Nalepa, Jakub
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 557 - 565
  • [3] Model selection for support vector machines
    Chapelle, O
    Vapnik, V
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 230 - 236
  • [4] Expediting model selection for support vector machines based on data reduction
    Ou, YY
    Chen, CY
    Hwang, SC
    Oyang, YJ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 786 - 791
  • [5] Candidate vectors selection for training support vector machines
    Li, Minqiang
    Chen, Fuzan
    Kou, Jisong
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 538 - +
  • [6] Fast Model Selection for MaxMinOver-based Training of Support Vector Machines
    Timm, Fabian
    Klement, Sascha
    Martinetz, Thomas
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2316 - 2319
  • [7] Model Parameter Selection of Support Vector Machines
    Zhao, Mingyuan
    Tang, Ke
    Zhou, Mingtian
    Zhang, Fengli
    Zeng, Ling
    [J]. 2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 128 - +
  • [8] Bilevel model selection for Support Vector Machines
    Kunapuli, Gautam
    Bennett, Kristin P.
    Hu, Jing
    Pang, Jong-Shi
    [J]. DATA MINING AND MATHEMATICAL PROGRAMMING, 2008, 45 : 129 - 158
  • [9] Optimizing resources in model selection for support vector machines
    Adankon, MM
    Cheriet, M
    Ayat, NE
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 925 - 930
  • [10] In-sample Model Selection for Support Vector Machines
    Anguita, Davide
    Ghio, Alessandro
    Oneto, Luca
    Ridella, Sandro
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1154 - 1161