STOCHASTIC GAUSSIAN PROCESS MODEL AVERAGING FOR HIGH-DIMENSIONAL INPUTS

被引:1
|
作者
Xuereb, Maxime [1 ]
Ng, Szu Hui [1 ]
Pedrielli, Giulia [2 ]
机构
[1] Natl Univ Singapore, Dept Ind Syst Engn & Management, 1 Engn Dr 2, Singapore 117576, Singapore
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, 699 S Mill Ave, Tempe, AZ 85281 USA
关键词
SELECTION;
D O I
10.1109/WSC48552.2020.9384114
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many statistical learning methodologies exhibit loss of efficiency and accuracy when applied to large, high-dimensional data-sets. Such loss is exacerbated by noisy data. In this paper, we focus on Gaussian Processes (GPs), a family of non-parametric approaches used in machine learning and Bayesian Optimization. In fact, GPs show difficulty scaling with the input data size and dimensionality. This paper presents, for the first time, the Stochastic GP Model Averaging (SGPMA) algorithm, to tackle both challenges. SGPMA uses a Bayesian approach to weight several predictors, each trained with an independent subset of the initial data-set (solving the large data-sets issue), and defined in a low-dimensional embedding of the original space (solving the high dimensionality). We conduct several experiments with different input size and dimensionality. The results show that our methodology is superior to naive averaging and that the embedding choice is critical to manage the computational cost / prediction accuracy trade-off.
引用
收藏
页码:373 / 384
页数:12
相关论文
共 50 条
  • [1] High-dimensional model averaging for quantile regression
    Xie, Jinhan
    Ding, Xianwen
    Jiang, Bei
    Yan, Xiaodong
    Kong, Linglong
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2024, 52 (02): : 618 - 635
  • [2] Model averaging with high-dimensional dependent data
    Zhao, Shangwei
    Zhou, Jianhong
    Li, Hongjun
    [J]. ECONOMICS LETTERS, 2016, 148 : 68 - 71
  • [3] Functional Martingale Residual Process for High-Dimensional Cox Regression with Model Averaging
    He, Baihua
    Liu, Yanyan
    Wu, Yuanshan
    Yin, Guosheng
    Zhao, Xingqiu
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [4] High-Dimensional Gaussian Process Inference with Derivatives
    de Roos, Filip
    Gessner, Alexandra
    Hennig, Philipp
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] High-dimensional Gaussian model selection on a Gaussian design
    Verzelen, Nicolas
    [J]. ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2010, 46 (02): : 480 - 524
  • [6] Jackknife model averaging for high-dimensional quantile regression
    Wang, Miaomiao
    Zhang, Xinyu
    Wan, Alan T. K.
    You, Kang
    Zou, Guohua
    [J]. BIOMETRICS, 2023, 79 (01) : 178 - 189
  • [7] A Model-Averaging Approach for High-Dimensional Regression
    Ando, Tomohiro
    Li, Ker-Chau
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (505) : 254 - 265
  • [8] Gaussian Process Emulation for High-Dimensional Coupled Systems
    Dolski, Tamara
    Spiller, Elaine T.
    Minkoff, Susan E.
    [J]. TECHNOMETRICS, 2024, 66 (03) : 455 - 469
  • [9] Regression on High-dimensional Inputs
    Kuleshov, Alexander
    Bernstein, Alexander
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 732 - 739
  • [10] HIGH-DIMENSIONAL DYNAMIC STOCHASTIC MODEL REPRESENTATION
    Eftekhari, Aryan
    Scheidegger, Simon
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2022, 44 (03): : C210 - C236