Hyper-Parameter Initialization for Squared Exponential Kernel-based Gaussian Process Regression

被引:0
|
作者
Ulapane, Nalika [1 ]
Thiyagarajan, Karthick [2 ]
Kodagoda, Sarath [2 ]
机构
[1] Univ Melbourne, Elect & Elect Engn, Parkville, Vic 3010, Australia
[2] Univ Technol Sydney, UTS Robot Inst, Ultimo, NSW 2007, Australia
关键词
Gaussian Process; hyper-parameters; kernel; machine learning; nonlinear regression; optimization; squared exponential;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hyper-parameter optimization is an essential task in the use of machine learning techniques. Such optimizations are typically done starting with an initial guess provided to hyper-parameter values followed by optimization (or minimization) of some cost function via gradient-based methods. The initial values become crucial since there is every chance for reaching local minimums in the cost functions being minimized, especially since gradient-based optimizing is done. Therefore, initializing hyper-parameters several limes and repeating Optimization to achieve the best solutions is usually attempted. Repetition of optimization can be computationally expensive when using techniques like Gaussian Process (GP) which has an O(n(3)) complexity, and not having a formal strategy to initialize hyperparameter values is an additional challenge. In general, re-initialization of hyper-parameter values in the contexts of many machine learning techniques including GP has been done at random over the years; some recent developments have proposed some initialization strategies based on the optimization of some meta loss cost functions. To simplify this challenge of hyperparameter initialization, this paper introduces a data-dependent deterministic initialization technique. The specific case of the squared exponential kernel-based GP regression problem is focused on, and the proposed technique brings novelty by being deterministic as opposed to random initialization, and fast (due to the deterministic nature) as opposed to optimizing some form of meta cost function as done in some previous works. Although global suitability of this initialization technique is not proven in this paper, as a preliminary study the technique's effectiveness is demonstrated via several synthetic as well as real data-based nonlinear regression examples, hinting that the technique may have the effectiveness for broader usage.
引用
收藏
页码:1154 / 1159
页数:6
相关论文
共 50 条
  • [1] On the Influence of Ill-conditioned Regression Matrix on Hyper-parameter Estimators for Kernel-based Regularization Methods
    Ju, Yue
    Chen, Tianshi
    Mu, Biqiang
    Ljung, Lennart
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 300 - 305
  • [2] A Family of Hyper-parameter Estimators Linking EB and SURE for Kernel-based Regularization Methods
    Zhang M.
    Chen T.
    Mu B.
    IEEE Transactions on Automatic Control, 2024, 69 (12) : 1 - 16
  • [3] EXACT O(N2) HYPER-PARAMETER OPTIMIZATION FOR GAUSSIAN PROCESS REGRESSION
    Xu, Linning
    Dai, Yijue
    Zhang, Jiawei
    Zhang, Ceyao
    Yin, Feng
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [4] Simple, Fast and Accurate Hyper-parameter Tuning in Gaussian-kernel SVM
    Chen, Guangliang
    Florero-Salinas, Wilson
    Li, Dan
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 348 - 355
  • [5] TRANSITIONAL ANNEALED ADAPTIVE SLICE SAMPLING FOR GAUSSIAN PROCESS HYPER-PARAMETER ESTIMATION
    Garbuno-Inigo, A.
    DiazDelaO, F. A.
    Zuev, K. M.
    INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2016, 6 (04) : 341 - 359
  • [6] Nonparametric identification of batch process using two-dimensional kernel-based Gaussian process regression
    Chen, Minghao
    Xu, Zuhua
    Zhao, Jun
    Zhu, Yucai
    Shao, Zhijiang
    CHEMICAL ENGINEERING SCIENCE, 2022, 250
  • [7] Gaussian process hyper-parameter estimation using Parallel Asymptotically Independent Markov Sampling
    Garbuno-Inigo, A.
    DiazDelaO, F. A.
    Zuev, K. M.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 367 - 383
  • [8] Multiple Kernel Based Regularized System Identification with SURE Hyper-parameter Estimator
    Hong, Shiying
    Mu, Biqiang
    Yin, Feng
    Andersen, Martin S.
    Chen, Tianshi
    IFAC PAPERSONLINE, 2018, 51 (15): : 13 - 18
  • [9] A Gaussian Kernel-based Clustering Algorithm with Automatic Hyper-parameters Computation
    de Carvalho, Francisco de A. T.
    Ferreira, Marcelo R. P.
    Simoes, Eduardo C.
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 393 - 400
  • [10] Gaussian kernel-based fuzzy inference systems for high dimensional regression
    Cai, Qianfeng
    Hao, Zhifeng
    Yang, Xiaowei
    NEUROCOMPUTING, 2012, 77 (01) : 197 - 204