Hyper-Parameter Initialization for Squared Exponential Kernel-based Gaussian Process Regression

被引：0

作者：

Ulapane, Nalika ^{[1
]}

Thiyagarajan, Karthick ^{[2
]}

Kodagoda, Sarath ^{[2
]}

机构：

[1] Univ Melbourne, Elect & Elect Engn, Parkville, Vic 3010, Australia

[2] Univ Technol Sydney, UTS Robot Inst, Ultimo, NSW 2007, Australia

来源：

PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020) | 2020年

关键词：

Gaussian Process; hyper-parameters; kernel; machine learning; nonlinear regression; optimization; squared exponential;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Hyper-parameter optimization is an essential task in the use of machine learning techniques. Such optimizations are typically done starting with an initial guess provided to hyper-parameter values followed by optimization (or minimization) of some cost function via gradient-based methods. The initial values become crucial since there is every chance for reaching local minimums in the cost functions being minimized, especially since gradient-based optimizing is done. Therefore, initializing hyper-parameters several limes and repeating Optimization to achieve the best solutions is usually attempted. Repetition of optimization can be computationally expensive when using techniques like Gaussian Process (GP) which has an O(n(3)) complexity, and not having a formal strategy to initialize hyperparameter values is an additional challenge. In general, re-initialization of hyper-parameter values in the contexts of many machine learning techniques including GP has been done at random over the years; some recent developments have proposed some initialization strategies based on the optimization of some meta loss cost functions. To simplify this challenge of hyperparameter initialization, this paper introduces a data-dependent deterministic initialization technique. The specific case of the squared exponential kernel-based GP regression problem is focused on, and the proposed technique brings novelty by being deterministic as opposed to random initialization, and fast (due to the deterministic nature) as opposed to optimizing some form of meta cost function as done in some previous works. Although global suitability of this initialization technique is not proven in this paper, as a preliminary study the technique's effectiveness is demonstrated via several synthetic as well as real data-based nonlinear regression examples, hinting that the technique may have the effectiveness for broader usage.

引用

页码：1154 / 1159

页数：6

共 50 条

[1] On the Influence of Ill-conditioned Regression Matrix on Hyper-parameter Estimators for Kernel-based Regularization Methods
Ju, Yue
Chen, Tianshi
Mu, Biqiang
Ljung, Lennart
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 300 - 305
[2] A Family of Hyper-parameter Estimators Linking EB and SURE for Kernel-based Regularization Methods
Zhang M.
Chen T.
Mu B.
IEEE Transactions on Automatic Control, 2024, 69 (12) : 1 - 16
[3] EXACT O(N2) HYPER-PARAMETER OPTIMIZATION FOR GAUSSIAN PROCESS REGRESSION
Xu, Linning
Dai, Yijue
Zhang, Jiawei
Zhang, Ceyao
Yin, Feng
PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
[4] Simple, Fast and Accurate Hyper-parameter Tuning in Gaussian-kernel SVM
Chen, Guangliang
Florero-Salinas, Wilson
Li, Dan
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 348 - 355
[5] TRANSITIONAL ANNEALED ADAPTIVE SLICE SAMPLING FOR GAUSSIAN PROCESS HYPER-PARAMETER ESTIMATION
Garbuno-Inigo, A.
DiazDelaO, F. A.
Zuev, K. M.
INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2016, 6 (04) : 341 - 359
[6] Nonparametric identification of batch process using two-dimensional kernel-based Gaussian process regression
Chen, Minghao
Xu, Zuhua
Zhao, Jun
Zhu, Yucai
Shao, Zhijiang
CHEMICAL ENGINEERING SCIENCE, 2022, 250
[7] Gaussian process hyper-parameter estimation using Parallel Asymptotically Independent Markov Sampling
Garbuno-Inigo, A.
DiazDelaO, F. A.
Zuev, K. M.
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 367 - 383
[8] Multiple Kernel Based Regularized System Identification with SURE Hyper-parameter Estimator
Hong, Shiying
Mu, Biqiang
Yin, Feng
Andersen, Martin S.
Chen, Tianshi
IFAC PAPERSONLINE, 2018, 51 (15): : 13 - 18
[9] A Gaussian Kernel-based Clustering Algorithm with Automatic Hyper-parameters Computation
de Carvalho, Francisco de A. T.
Ferreira, Marcelo R. P.
Simoes, Eduardo C.
ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 393 - 400
[10] Gaussian kernel-based fuzzy inference systems for high dimensional regression
Cai, Qianfeng
Hao, Zhifeng
Yang, Xiaowei
NEUROCOMPUTING, 2012, 77 (01) : 197 - 204

← 1 2 3 4 5 →