PREDICTING PALEOCLIMATE FROM COMPOSITIONAL DATA USING MULTIVARIATE GAUSSIAN PROCESS INVERSE PREDICTION

被引:2
|
作者
Tipton, John R. [1 ]
Hooten, Mevin B. [2 ,3 ]
Nolan, Connor [4 ]
Booth, Robert K. [5 ]
McLachlan, Jason [6 ]
机构
[1] Univ Arkansas, Dept Math Sci, Fayetteville, AR 72701 USA
[2] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
[3] US Geol Survey, Colorado Cooperat Fish & Wildlife Res Unit, Dept Fish Wildlife & Conservat Biol, Ft Collins, CO 80523 USA
[4] Univ Arizona, Dept Geosci, Tucson, AZ 85721 USA
[5] Lehigh Univ, Earth & Environm Sci Dept, Bethlehem, PA 18015 USA
[6] Univ Notre Dame, Dept Biol, Notre Dame, IN 46556 USA
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 04期
基金
美国国家科学基金会;
关键词
Bayesian hierarchical models; predictive validation; model comparison; ecological functional response model; BAYESIAN-INFERENCE; FOREST COMPOSITION; MODEL; PH; DIATOMS; RECONSTRUCTION; CALIBRATION; LIKELIHOOD; REGRESSION; ANALOGS;
D O I
10.1214/19-AOAS1281
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multivariate compositional count data arise in many applications including ecology, microbiology, genetics and paleoclimate. A frequent question in the analysis of multivariate compositional count data is what underlying values of a covariate(s) give rise to the observed composition. Learning the relationship between covariates and the compositional count allows for inverse prediction of unobserved covariates given compositional count observations. Gaussian processes provide a flexible framework for modeling functional responses with respect to a covariate without assuming a functional form. Many scientific disciplines use Gaussian process approximations to improve prediction and make inference on latent processes and parameters. When prediction is desired on unobserved covariates given realizations of the response variable, this is called inverse prediction. Because inverse prediction is often mathematically and computationally challenging, predicting unobserved covariates often requires fitting models that are different from the hypothesized generative model. We present a novel computational framework that allows for efficient inverse prediction using a Gaussian process approximation to generative models. Our framework enables scientific learning about how the latent processes co-vary with respect to covariates while simultaneously providing predictions of missing covariates. The proposed framework is capable of efficiently exploring the high dimensional, multi-modal latent spaces that arise in the inverse problem. To demonstrate flexibility, we apply our method in a generalized linear model framework to predict latent climate states given multivariate count data. Based on cross-validation, our model has predictive skill competitive with current methods while simultaneously providing formal, statistical inference on the underlying community dynamics of the biological system previously not available.
引用
收藏
页码:2363 / 2388
页数:26
相关论文
共 50 条
  • [1] Sliced inverse regression method for multivariate compositional data modeling
    Wang, Huiwen
    Wang, Zhichao
    Wang, Shanshan
    STATISTICAL PAPERS, 2021, 62 (01) : 361 - 393
  • [2] Sliced inverse regression method for multivariate compositional data modeling
    Huiwen Wang
    Zhichao Wang
    Shanshan Wang
    Statistical Papers, 2021, 62 : 361 - 393
  • [3] Multivariate analysis for geochemical process identification using stream sediment geochemical data: A perspective from compositional data
    Liu, Yue
    Cheng, Qiuming
    Zhou, Kefa
    Xia, Qinglin
    Wang, Xinqing
    GEOCHEMICAL JOURNAL, 2016, 50 (04) : 293 - 314
  • [4] Compositional inverse Gaussian models with applications in compositional data analysis with possible zero observations
    Liu, Pengyi
    Tian, Guo-Liang
    Yuen, Kam Chuen
    Sun, Yuan
    Zhang, Chi
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (02) : 248 - 278
  • [5] Data-based Design of Inverse Dynamics Using Gaussian Process
    Lee, Junghoon
    Oh, Sehoon
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2019, : 449 - 454
  • [6] An Inverse Gaussian Process Model for Degradation Data
    Wang, Xiao
    Xu, Dihua
    TECHNOMETRICS, 2010, 52 (02) : 188 - 197
  • [7] A Multivariate extension of inverse Gaussian distribution derived from inverse relationship
    Minami, M
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (12) : 2285 - 2304
  • [8] Spatial Prediction for Multivariate Non-Gaussian Data
    Liu, Xutong
    Chen, Feng
    Lu, Yen-Cheng
    Lu, Chang-Tien
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [9] Regression analysis for multivariate process data of counts using convolved Gaussian processes
    Sofro, A'yunin
    Shi, Jian Qing
    Cao, Chunzheng
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 206 : 57 - 74
  • [10] Prediction from a normal model using a generalized inverse Gaussian prior
    Thabane, L
    Haq, MS
    STATISTICAL PAPERS, 1999, 40 (02) : 175 - 184