Accelerating the SVM learning for very large data sets

被引：0

作者：

Sung, Eric ^{[1
]}

Yan, Zhu ^{[1
]}

Li Xuchun ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS | 2006年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose an original sequential learning algorithm, SBA, that enables the SVM to efficiently learn from only a small subset of the input data set. The principle is based on sequentially adding convex hull points of the binary classes to a small subset. The SVM is trained on the current training pool and its result. is used to find the data which is wrongly classsified and furthest away from the current optimal hyperplane. This point is added to the training pool and the SVM is retrained on it. The iteration stops when no more suchpoints are found A formal proof of strict convergence is provided and we derive a geometric bound on the training time. It will be explained how SBA can be extended to handle non-linearly and non-separable class distributions. Experimental trials on some well known data sets verify the speed advantage of our method coupled to any SVM over that of that SVM used and the core vector machine.

引用

页码：484 / +

页数：2

共 50 条

[1] A Geometric Approach to Train SVM on Very Large Data Sets
Zeng, Zhi-Qiang
Xu, Hua-Rong
Xie, Yan-Qi
Gao, Ji
2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 991 - +
[2] Fast SVM training algorithm with decomposition on very large data sets
Dong, JX
Krzyzak, A
Suen, CY
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (04) : 603 - 618
[3] Decision tree learning on very large data sets
Hall, LO
Chawla, N
Bowyer, KW
1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2579 - 2584
[4] On-line learning for very large data sets
Bottou, U
Le Cun, Y
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) : 137 - 151
[5] Core vector machines: Fast SVM training on very large data sets
Tsang, IW
Kwok, JT
Cheung, PM
JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 363 - 392
[6] Comments on the Core vector machines: Fast SVM training on very large data sets
LITIS, INSA de Rouen, Avenue de l'Université, 76801 Saint-Etienne du Rouvray, France
J. Mach. Learn. Res., 2007, (291-301):
[7] Comments on the "Core Vector Machines: Fast SVM training on very large data sets"
Loosli, Gaelle
Canu, Stephane
JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 291 - 301
[8] Sequential learning with LS-SVM for large-scale data sets
Jung, Tobias
Polani, Daniel
ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 381 - 390
[9] Diversified SVM ensembles for large data sets
Tsang, Ivor W.
Kocsor, Andras
Kwok, James T.
MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 792 - 800
[10] Data mining from extreme data sets: Very large and/or very skewed data sets
Hall, LO
2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 2555 - 2555

← 1 2 3 4 5 →