Kernel Conjugate Gradient for Fast Kernel Machines

被引:0
|
作者
Ratliff, Nathan D. [1 ]
Bagnell, J. Andrew [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel variant of the conjugate gradient algorithm, Kernel Conjugate Gradient (KCG), designed to speed up learning for kernel machines with differentiable loss functions. This approach leads to a better conditioned optimization problem during learning. We establish an upper bound on the number of iterations for KCG that indicates it should require less than the square root of the number of iterations that standard conjugate gradient requires. In practice, for various differentiable kernel learning problems, we find KCG consistently, and significantly, outperforms existing techniques. The algorithm is simple to implement, requires no more computation per iteration than standard approaches, and is well motivated by Reproducing Kernel Hilbert Space (RKHS) theory. We further show that data-structure techniques recently used to speed up kernel machine approaches are well matched to the algorithm by reducing the dominant costs of training: function evaluation and RKHS inner product computation.
引用
收藏
页码:1017 / 1022
页数:6
相关论文
共 50 条
  • [41] Deep Kernel machines: a survey
    Nikhitha, Nair K.
    Afzal, A. L.
    Asharaf, S.
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (02) : 537 - 556
  • [42] Bridging logic and kernel machines
    Diligenti, Michelangelo
    Gori, Marco
    Maggini, Marco
    Rigutini, Leonardo
    [J]. MACHINE LEARNING, 2012, 86 (01) : 57 - 88
  • [43] Constraint Verification With Kernel Machines
    Gori, Marco
    Melacci, Stefano
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) : 825 - 831
  • [44] Kernel machines with missing responses
    Liu, Tiantian
    Goldberg, Yair
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (02): : 3766 - 3820
  • [45] Kernel machines with missing covariates
    Liu, Tiantian
    Goldberg, Yair
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 2485 - 2538
  • [46] Nonlinear knowledge in kernel machines
    Mangasarian, Olvi L.
    Wild, Edward W.
    [J]. DATA MINING AND MATHEMATICAL PROGRAMMING, 2008, 45 : 181 - 198
  • [47] Sparse Representation in Kernel Machines
    Sun, Hongwei
    Wu, Qiang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2576 - 2582
  • [48] Kernel machines and Boolean functions
    Kowalczyk, A
    Smola, AJ
    Williamson, RC
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 439 - 446
  • [49] Accelerated Stochastic Gradient Method for Support Vector Machines Classification with Additive Kernel
    Wang, Xufeng
    Zhou, Shuisheng
    [J]. PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 855 - 860
  • [50] The Fast Kernel Transform
    Ryan, John Paul
    Ament, Sebastian
    Gomes, Carla P.
    Damle, Anil
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151