Kernel Conjugate Gradient for Fast Kernel Machines

被引：0

作者：

Ratliff, Nathan D. ^{[1
]}

Bagnell, J. Andrew ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA

来源：

20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2007年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel variant of the conjugate gradient algorithm, Kernel Conjugate Gradient (KCG), designed to speed up learning for kernel machines with differentiable loss functions. This approach leads to a better conditioned optimization problem during learning. We establish an upper bound on the number of iterations for KCG that indicates it should require less than the square root of the number of iterations that standard conjugate gradient requires. In practice, for various differentiable kernel learning problems, we find KCG consistently, and significantly, outperforms existing techniques. The algorithm is simple to implement, requires no more computation per iteration than standard approaches, and is well motivated by Reproducing Kernel Hilbert Space (RKHS) theory. We further show that data-structure techniques recently used to speed up kernel machine approaches are well matched to the algorithm by reducing the dominant costs of training: function evaluation and RKHS inner product computation.

引用

页码：1017 / 1022

页数：6

共 50 条

[1] Conjugate gradients for kernel machines
Bartels, Simon
Hennig, Philipp
[J]. Journal of Machine Learning Research, 2020, 21
[2] Conjugate Gradients for Kernel Machines
Bartels, Simon
Hennig, Philipp
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[3] The Kernel Conjugate Gradient Algorithms
Zhang, Ming
Wang, Xiaojian
Chen, Xiaoming
Zhang, Anxue
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (16) : 4377 - 4387
[4] A fast kernel extreme learning machine based on conjugate gradient
He, Chunmei
Xu, Fanhua
Liu, Yaqi
Zheng, Jinhua
[J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2018, 29 (1-4) : 70 - 80
[5] Fast and Scalable Local Kernel Machines
Segata, Nicola
Blanzieri, Enrico
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1883 - 1926
[6] Kernel conjugate gradient methods with random projections
Lin, Junhong
Cevher, Volkan
[J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2021, 55 : 223 - 269
[7] DATA SELECTION KERNEL CONJUGATE GRADIENT ALGORITHM
Diniz, Paulo S. R.
Ferreira, Jonathas O.
Mendonca, Marcele O. K.
Ferreira, Tadeu N.
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5440 - 5444
[8] Fuzzy Kernel Stochastic Gradient Descent Machines
Tuan Nguyen
Phuong Duong
Trung Le
Anh Le
Viet Ngo
Dat Tran
Ma, Wanli
[J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3226 - 3232
[9] Fast forecasting with simplified kernel regression machines
He, Wenwu
Wang, Zhizhong
[J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 60 - +
[10] KERNEL LEAST MEAN SQUARE BASED ON CONJUGATE GRADIENT
Peng, Siyuan
Wu, Zongze
Ma, Wentao
Chen, Badong
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2796 - 2800

← 1 2 3 4 5 →