PREDICTING PALEOCLIMATE FROM COMPOSITIONAL DATA USING MULTIVARIATE GAUSSIAN PROCESS INVERSE PREDICTION

被引:2
|
作者
Tipton, John R. [1 ]
Hooten, Mevin B. [2 ,3 ]
Nolan, Connor [4 ]
Booth, Robert K. [5 ]
McLachlan, Jason [6 ]
机构
[1] Univ Arkansas, Dept Math Sci, Fayetteville, AR 72701 USA
[2] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
[3] US Geol Survey, Colorado Cooperat Fish & Wildlife Res Unit, Dept Fish Wildlife & Conservat Biol, Ft Collins, CO 80523 USA
[4] Univ Arizona, Dept Geosci, Tucson, AZ 85721 USA
[5] Lehigh Univ, Earth & Environm Sci Dept, Bethlehem, PA 18015 USA
[6] Univ Notre Dame, Dept Biol, Notre Dame, IN 46556 USA
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 04期
基金
美国国家科学基金会;
关键词
Bayesian hierarchical models; predictive validation; model comparison; ecological functional response model; BAYESIAN-INFERENCE; FOREST COMPOSITION; MODEL; PH; DIATOMS; RECONSTRUCTION; CALIBRATION; LIKELIHOOD; REGRESSION; ANALOGS;
D O I
10.1214/19-AOAS1281
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multivariate compositional count data arise in many applications including ecology, microbiology, genetics and paleoclimate. A frequent question in the analysis of multivariate compositional count data is what underlying values of a covariate(s) give rise to the observed composition. Learning the relationship between covariates and the compositional count allows for inverse prediction of unobserved covariates given compositional count observations. Gaussian processes provide a flexible framework for modeling functional responses with respect to a covariate without assuming a functional form. Many scientific disciplines use Gaussian process approximations to improve prediction and make inference on latent processes and parameters. When prediction is desired on unobserved covariates given realizations of the response variable, this is called inverse prediction. Because inverse prediction is often mathematically and computationally challenging, predicting unobserved covariates often requires fitting models that are different from the hypothesized generative model. We present a novel computational framework that allows for efficient inverse prediction using a Gaussian process approximation to generative models. Our framework enables scientific learning about how the latent processes co-vary with respect to covariates while simultaneously providing predictions of missing covariates. The proposed framework is capable of efficiently exploring the high dimensional, multi-modal latent spaces that arise in the inverse problem. To demonstrate flexibility, we apply our method in a generalized linear model framework to predict latent climate states given multivariate count data. Based on cross-validation, our model has predictive skill competitive with current methods while simultaneously providing formal, statistical inference on the underlying community dynamics of the biological system previously not available.
引用
收藏
页码:2363 / 2388
页数:26
相关论文
共 50 条
  • [21] Gaussian Process Regression for a PMV Prediction Model using Environmental Monitoring Data
    Yoon, Young Ran
    Moon, Hyeun Jun
    Kim, Sun Ho
    Kim, Jeong Won
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 2540 - 2545
  • [22] Predicting target data rates for dynamic spectrum allocation using Gaussian process regression
    Njoku, Judith Nkechinyere
    Morocho-Cayamcela, Manuel Eugenio
    Caliwag, Angela
    Xiao, Pei
    Lim, Wansu
    ICT EXPRESS, 2022, 8 (02): : 207 - 212
  • [23] Multivariate data imputation using Gaussian mixture models
    Silva, Diogo S. F.
    Deutsch, Clayton, V
    SPATIAL STATISTICS, 2018, 27 : 74 - 90
  • [24] Reconstructing paleoclimate fields using online data assimilation with a linear inverse model
    Perkins, Walter A.
    Hakim, Gregory J.
    CLIMATE OF THE PAST, 2017, 13 (05) : 421 - 436
  • [25] Dynamic prediction of reliability of in-service RC bridges using the Bayesian updating and inverse gaussian process
    Chen L.
    Huang T.-L.
    Gongcheng Lixue/Engineering Mechanics, 2020, 37 (04): : 186 - 195
  • [26] Prediction of Plasma Simulation Data with the Gaussian Process Method
    Preuss, R.
    von Toussaint, U.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, MAXENT 2013, 2014, 1636 : 118 - 123
  • [27] Modal Analysis of Spatiotemporal Data via Multivariate Gaussian Process Regression
    Song, Jiwoo
    Huang, Daning
    AIAA JOURNAL, 2025, 63 (02) : 732 - 749
  • [28] Predicting Encounters in Opportunistic Networks using Gaussian Process
    Chilipirea, Cristian
    Petre, Andreea-Cristina
    Dobre, Ciprian
    19TH INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS 2013), 2013, : 99 - 105
  • [29] Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data
    Fang, Yuan
    Karlis, Dimitris
    Subedi, Sanjeena
    JOURNAL OF CLASSIFICATION, 2022, 39 (03) : 510 - 552
  • [30] Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data
    Yuan Fang
    Dimitris Karlis
    Sanjeena Subedi
    Journal of Classification, 2022, 39 : 510 - 552