Group Kernels for Gaussian Process Metamodels with Categorical Inputs

被引:26
|
作者
Roustant, Olivier [1 ,2 ]
Padonou, Esperan [2 ]
Deville, Yves [3 ]
Clement, Alois [4 ]
Perrin, Guillaume [5 ]
Giorla, Jean [4 ]
Wynn, Henry [6 ]
机构
[1] Univ Toulouse, INSA, Inst Math Toulouse, F-31077 Toulouse 4, France
[2] Univ Clermont Auvergne, CNRS, Mines St Etienne, UMR 6158 Limos, F-42023 St Etienne, France
[3] AlpeStat, F-73000 Chambery, France
[4] CEA, DAM, VA, F-21120 Is Sur Tille, France
[5] CEA, DAM, DIF, F-91297 Arpajon, France
[6] London Sch Econ, London WC2A 2AE, England
来源
基金
英国工程与自然科学研究理事会;
关键词
Gaussian process regression; categorical data; hierarchical model; kriging; qualitative data;
D O I
10.1137/18M1209386
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Gaussian processes (GPs) are widely used as a metamodel for emulating time-consuming computer codes. We focus on problems involving categorical inputs, with a potentially large number L of levels (typically several tens), partitioned in G << L groups of various sizes. Parsimonious covariance functions, or kernels, can then be defined by block covariance matrices T with constant covariances between pairs of blocks and within blocks. We study the positive definiteness of such matrices to encourage their practical use. The hierarchical group/level structure, equivalent to a nested Bayesian linear model, provides a parameterization of valid block matrices T. The same model can then be used when the assumption within blocks is relaxed, giving a flexible parametric family of valid covariance matrices with constant covariances between pairs of blocks. The positive definiteness of T is equivalent to the positive definiteness of a smaller matrix of size G, obtained by averaging each block. The model is applied to a problem in nuclear waste analysis, where one of the categorical inputs is atomic number, which has more than 90 levels.
引用
收藏
页码:775 / 806
页数:32
相关论文
共 50 条
  • [1] Classification and Categorical Inputs with Treed Gaussian Process Models
    Tamara Broderick
    Robert B. Gramacy
    Journal of Classification, 2011, 28 : 244 - 270
  • [2] Classification and Categorical Inputs with Treed Gaussian Process Models
    Broderick, Tamara
    Gramacy, Robert B.
    JOURNAL OF CLASSIFICATION, 2011, 28 (02) : 244 - 270
  • [3] Gaussian Process Regression with Categorical Inputs for Predicting the Blood Glucose Level
    Tomczak, Jakub M.
    ADVANCES IN SYSTEMS SCIENCE, ICSS 2016, 2017, 539 : 98 - 108
  • [4] Gaussian process metamodels for floating offshore wind turbine platforms
    Rajiv, Gayathry
    Verma, Mohit
    Subbulakshmi, A.
    OCEAN ENGINEERING, 2023, 267
  • [5] A Gaussian Process Regression Model for Distribution Inputs
    Bachoc, Francois
    Gamboa, Fabrice
    Loubes, Jean-Michel
    Venet, Nil
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (10) : 6620 - 6637
  • [6] A mixed-categorical correlation kernel for Gaussian process
    Saves, P.
    Diouane, Y.
    Bartoli, N.
    Lefebvre, T.
    Morlier, J.
    NEUROCOMPUTING, 2023, 550
  • [7] Latent Gaussian process for anomaly detection in categorical data
    Lv, Fengmao
    Liang, Tao
    Zhao, Jiayi
    Zhuo, Zhongliu
    Wu, Jinzhao
    Yang, Guowu
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [8] Categorical Inputs, Sensitivity Analysis, Optimization and Importance Tempering with tgp Version 2, an R Package for Treed Gaussian Process Models
    Gramacy, Robert B.
    Taddy, Matthew
    JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (06): : 1 - 48
  • [9] Evaluating Gaussian process metamodels and sequential designs for noisy level set estimation
    Xiong Lyu
    Mickaël Binois
    Michael Ludkovski
    Statistics and Computing, 2021, 31
  • [10] Amortized Inference for Gaussian Process Hyperparameters of Structured Kernels
    Bitzer, Matthias
    Meister, Mona
    Zimmer, Christoph
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 184 - 194