Accelerating the SVM learning for very large data sets

被引:0
|
作者
Sung, Eric [1 ]
Yan, Zhu [1 ]
Li Xuchun [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an original sequential learning algorithm, SBA, that enables the SVM to efficiently learn from only a small subset of the input data set. The principle is based on sequentially adding convex hull points of the binary classes to a small subset. The SVM is trained on the current training pool and its result. is used to find the data which is wrongly classsified and furthest away from the current optimal hyperplane. This point is added to the training pool and the SVM is retrained on it. The iteration stops when no more suchpoints are found A formal proof of strict convergence is provided and we derive a geometric bound on the training time. It will be explained how SBA can be extended to handle non-linearly and non-separable class distributions. Experimental trials on some well known data sets verify the speed advantage of our method coupled to any SVM over that of that SVM used and the core vector machine.
引用
收藏
页码:484 / +
页数:2
相关论文
共 50 条
  • [1] A Geometric Approach to Train SVM on Very Large Data Sets
    Zeng, Zhi-Qiang
    Xu, Hua-Rong
    Xie, Yan-Qi
    Gao, Ji
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 991 - +
  • [2] Fast SVM training algorithm with decomposition on very large data sets
    Dong, JX
    Krzyzak, A
    Suen, CY
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (04) : 603 - 618
  • [3] Decision tree learning on very large data sets
    Hall, LO
    Chawla, N
    Bowyer, KW
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2579 - 2584
  • [4] On-line learning for very large data sets
    Bottou, U
    Le Cun, Y
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) : 137 - 151
  • [5] Core vector machines: Fast SVM training on very large data sets
    Tsang, IW
    Kwok, JT
    Cheung, PM
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 363 - 392
  • [6] Comments on the Core vector machines: Fast SVM training on very large data sets
    LITIS, INSA de Rouen, Avenue de l'Université, 76801 Saint-Etienne du Rouvray, France
    J. Mach. Learn. Res., 2007, (291-301):
  • [7] Comments on the "Core Vector Machines: Fast SVM training on very large data sets"
    Loosli, Gaelle
    Canu, Stephane
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 291 - 301
  • [8] Sequential learning with LS-SVM for large-scale data sets
    Jung, Tobias
    Polani, Daniel
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 381 - 390
  • [9] Diversified SVM ensembles for large data sets
    Tsang, Ivor W.
    Kocsor, Andras
    Kwok, James T.
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 792 - 800
  • [10] Data mining from extreme data sets: Very large and/or very skewed data sets
    Hall, LO
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 2555 - 2555