smashGP: Large-Scale Spatial Modeling via Matrix-Free Gaussian Processes

被引:0
|
作者
Erlandson, Lucas [1 ]
Gomez, Ana Maria Estrada [2 ]
Chow, Edmond [1 ]
Paynabar, Kamran [3 ]
机构
[1] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA USA
[2] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47907 USA
[3] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA USA
关键词
Gaussian processes; Hierarchical matrices; Matrix-free methods; Spatial data analysis; RANDOM-FIELDS; CLUSTERS;
D O I
10.1080/10618600.2024.2353653
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gaussian processes are essential for spatial data analysis. Not only do they allow the prediction of unknown values, but they also allow for uncertainty quantification. However, in the era of big data, directly using Gaussian processes has become computationally infeasible as cubic run times are required for dense matrix decomposition and inversion. Various alternatives have been proposed to reduce the computational burden of directly fitting Gaussian processes. These alternatives rely on assumptions on the underlying structure of the covariance or precision matrices, such as sparsity or low-rank. In contrast, this article uses hierarchical matrices and matrix-free methods to enable the computation of Gaussian processes for large spatial datasets by exploiting the underlying kernel properties. The proposed framework, smashGP, represents the covariance matrix as an H2 matrix in O(n) time and is able to estimate the unknown parameters of the model and predict the values of spatial observations at unobserved locations in O(n log n) time thanks to fast matrix-vector products. Additionally, it can be parallelized to take full advantage of shared-memory computing environments. With simulations and case studies, we illustrate the advantage of smashGP to model large-scale spatial datasets. Supplementary materials for this article are available online.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] laGP: Large-Scale Spatial Modeling via Local Approximate Gaussian Processes in R
    Gramacy, Robert B.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2016, 72 (01): : 1 - 46
  • [2] Matrix-free large-scale Bayesian inference in cosmology
    Jasche, Jens
    Lavaux, Guilhem
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2015, 447 (02) : 1204 - 1212
  • [3] Large-scale unconstrained optimization using separable cubic modeling and matrix-free subspace minimization
    C. P. Brás
    J. M. Martínez
    M. Raydan
    [J]. Computational Optimization and Applications, 2020, 75 : 169 - 205
  • [4] Large-scale unconstrained optimization using separable cubic modeling and matrix-free subspace minimization
    Bras, C. P.
    Martinez, J. M.
    Raydan, M.
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2020, 75 (01) : 169 - 205
  • [5] A matrix-free smoothing algorithm for large-scale support vector machines
    Ni, Tie
    Zhai, Jun
    [J]. INFORMATION SCIENCES, 2016, 358 : 29 - 43
  • [6] Large-scale simulation of mantle convection based on a new matrix-free approach
    Bauer, S.
    Huber, M.
    Ghelichkhan, S.
    Mohr, M.
    Ruede, U.
    Wohlmuth, B.
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 31 : 60 - 76
  • [7] ALPAQA: A matrix-free solver for nonlinear MPC and large-scale nonconvex optimization
    Pas, Pieter
    Schuurmans, Mathijs
    Patrinos, Panagiotis
    [J]. 2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 417 - 422
  • [8] A new matrix-free algorithm for the large-scale trust-region subproblem
    Rojas, M
    Santos, SA
    Sorensen, DC
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2001, 11 (03) : 611 - 646
  • [9] A New Matrix-Free Approach for Large-Scale Geodynamic Simulations and its Performance
    Bauer, Simon
    Huber, Markus
    Mohr, Marcus
    Ruede, Ulrich
    Wohlmuth, Barbara
    [J]. COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 17 - 30
  • [10] Matrix-free algorithm for the large-scale constrained trust-region subproblem
    Kearsley, AJ
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2006, 21 (02): : 233 - 245