PREDICTING PALEOCLIMATE FROM COMPOSITIONAL DATA USING MULTIVARIATE GAUSSIAN PROCESS INVERSE PREDICTION

被引:2
|
作者
Tipton, John R. [1 ]
Hooten, Mevin B. [2 ,3 ]
Nolan, Connor [4 ]
Booth, Robert K. [5 ]
McLachlan, Jason [6 ]
机构
[1] Univ Arkansas, Dept Math Sci, Fayetteville, AR 72701 USA
[2] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
[3] US Geol Survey, Colorado Cooperat Fish & Wildlife Res Unit, Dept Fish Wildlife & Conservat Biol, Ft Collins, CO 80523 USA
[4] Univ Arizona, Dept Geosci, Tucson, AZ 85721 USA
[5] Lehigh Univ, Earth & Environm Sci Dept, Bethlehem, PA 18015 USA
[6] Univ Notre Dame, Dept Biol, Notre Dame, IN 46556 USA
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 04期
基金
美国国家科学基金会;
关键词
Bayesian hierarchical models; predictive validation; model comparison; ecological functional response model; BAYESIAN-INFERENCE; FOREST COMPOSITION; MODEL; PH; DIATOMS; RECONSTRUCTION; CALIBRATION; LIKELIHOOD; REGRESSION; ANALOGS;
D O I
10.1214/19-AOAS1281
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multivariate compositional count data arise in many applications including ecology, microbiology, genetics and paleoclimate. A frequent question in the analysis of multivariate compositional count data is what underlying values of a covariate(s) give rise to the observed composition. Learning the relationship between covariates and the compositional count allows for inverse prediction of unobserved covariates given compositional count observations. Gaussian processes provide a flexible framework for modeling functional responses with respect to a covariate without assuming a functional form. Many scientific disciplines use Gaussian process approximations to improve prediction and make inference on latent processes and parameters. When prediction is desired on unobserved covariates given realizations of the response variable, this is called inverse prediction. Because inverse prediction is often mathematically and computationally challenging, predicting unobserved covariates often requires fitting models that are different from the hypothesized generative model. We present a novel computational framework that allows for efficient inverse prediction using a Gaussian process approximation to generative models. Our framework enables scientific learning about how the latent processes co-vary with respect to covariates while simultaneously providing predictions of missing covariates. The proposed framework is capable of efficiently exploring the high dimensional, multi-modal latent spaces that arise in the inverse problem. To demonstrate flexibility, we apply our method in a generalized linear model framework to predict latent climate states given multivariate count data. Based on cross-validation, our model has predictive skill competitive with current methods while simultaneously providing formal, statistical inference on the underlying community dynamics of the biological system previously not available.
引用
收藏
页码:2363 / 2388
页数:26
相关论文
共 50 条
  • [31] Prediction of Multivariate Time Series with Sparse Gaussian Process Echo State Network
    Han, Min
    Ren, Weijie
    Xu, Meiling
    PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 510 - 513
  • [32] Modeling electricity forward prices using the multivariate normal inverse Gaussian distribution
    Andresen, Arne
    Koekebakker, Steen
    Westgaard, Sjur
    JOURNAL OF ENERGY MARKETS, 2010, 3 (03) : 3 - 25
  • [33] Multivariate reparameterized inverse Gaussian processes with common effects for degradation-based reliability prediction
    Zhuang, Liangliang
    Xu, Ancha
    Fang, Guanqi
    Tang, Yincai
    JOURNAL OF QUALITY TECHNOLOGY, 2025, 57 (01) : 51 - 67
  • [34] MODELING AND PREDICTION OF SMART POWER SEMICONDUCTOR LIFETIME DATA USING A GAUSSIAN PROCESS PRIOR
    Plankensteiner, Kathrin
    Bluder, Olivia
    Pilz, Juergen
    PROCEEDINGS OF THE 2014 WINTER SIMULATION CONFERENCE (WSC), 2014, : 2671 - 2681
  • [35] Wind power prediction with missing data using Gaussian process regression and multiple imputation
    Liu, Tianhong
    Wei, Haikun
    Zhang, Kanjian
    APPLIED SOFT COMPUTING, 2018, 71 : 905 - 916
  • [36] Inverse lumping: Estimating compositional data from lumped information
    Schlijper, A.G.
    Drohm, J.K.
    SPE Reservoir Engineering (Society of Petroleum Engineers), 1988, 3 (03): : 1083 - 1089
  • [37] Predicting the organoleptic stability of beer from chemical data using multivariate analysis
    Guido, Luis Ferreira
    Curto, Andreia
    Boivin, Patrick
    Benismail, Nizar
    Goncalves, Cristina
    Barros, Aquiles Araujo
    EUROPEAN FOOD RESEARCH AND TECHNOLOGY, 2007, 226 (1-2) : 57 - 62
  • [38] Predicting the organoleptic stability of beer from chemical data using multivariate analysis
    Luís Ferreira Guido
    Andreia Curto
    Patrick Boivin
    Nizar Benismail
    Cristina Gonçalves
    Aquiles Araújo Barros
    European Food Research and Technology, 2007, 226 : 57 - 62
  • [39] Accelerated Degradation Test Planning Using the Inverse Gaussian Process
    Ye, Zhi-Sheng
    Chen, Liang-Peng
    Tang, Loon Ching
    Xie, Min
    IEEE TRANSACTIONS ON RELIABILITY, 2014, 63 (03) : 750 - 763
  • [40] Approximation for the Normal Inverse Gaussian Process Using Random Sums
    Pacheco-Gonzalez, Carlos G.
    STOCHASTIC ANALYSIS AND APPLICATIONS, 2009, 27 (06) : 1191 - 1200