Kernel Conjugate Gradient for Fast Kernel Machines

被引:0
|
作者
Ratliff, Nathan D. [1 ]
Bagnell, J. Andrew [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel variant of the conjugate gradient algorithm, Kernel Conjugate Gradient (KCG), designed to speed up learning for kernel machines with differentiable loss functions. This approach leads to a better conditioned optimization problem during learning. We establish an upper bound on the number of iterations for KCG that indicates it should require less than the square root of the number of iterations that standard conjugate gradient requires. In practice, for various differentiable kernel learning problems, we find KCG consistently, and significantly, outperforms existing techniques. The algorithm is simple to implement, requires no more computation per iteration than standard approaches, and is well motivated by Reproducing Kernel Hilbert Space (RKHS) theory. We further show that data-structure techniques recently used to speed up kernel machine approaches are well matched to the algorithm by reducing the dominant costs of training: function evaluation and RKHS inner product computation.
引用
收藏
页码:1017 / 1022
页数:6
相关论文
共 50 条
  • [1] Conjugate gradients for kernel machines
    Bartels, Simon
    Hennig, Philipp
    [J]. Journal of Machine Learning Research, 2020, 21
  • [2] Conjugate Gradients for Kernel Machines
    Bartels, Simon
    Hennig, Philipp
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [3] The Kernel Conjugate Gradient Algorithms
    Zhang, Ming
    Wang, Xiaojian
    Chen, Xiaoming
    Zhang, Anxue
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (16) : 4377 - 4387
  • [4] A fast kernel extreme learning machine based on conjugate gradient
    He, Chunmei
    Xu, Fanhua
    Liu, Yaqi
    Zheng, Jinhua
    [J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2018, 29 (1-4) : 70 - 80
  • [5] Fast and Scalable Local Kernel Machines
    Segata, Nicola
    Blanzieri, Enrico
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1883 - 1926
  • [6] Kernel conjugate gradient methods with random projections
    Lin, Junhong
    Cevher, Volkan
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2021, 55 : 223 - 269
  • [7] DATA SELECTION KERNEL CONJUGATE GRADIENT ALGORITHM
    Diniz, Paulo S. R.
    Ferreira, Jonathas O.
    Mendonca, Marcele O. K.
    Ferreira, Tadeu N.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5440 - 5444
  • [8] Fuzzy Kernel Stochastic Gradient Descent Machines
    Tuan Nguyen
    Phuong Duong
    Trung Le
    Anh Le
    Viet Ngo
    Dat Tran
    Ma, Wanli
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3226 - 3232
  • [9] Fast forecasting with simplified kernel regression machines
    He, Wenwu
    Wang, Zhizhong
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 60 - +
  • [10] KERNEL LEAST MEAN SQUARE BASED ON CONJUGATE GRADIENT
    Peng, Siyuan
    Wu, Zongze
    Ma, Wentao
    Chen, Badong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2796 - 2800