Identifying and characterizing extrapolation in multivariate response data

被引:12
|
作者
Bartley, Meridith L. [1 ]
Hanks, Ephraim M. [1 ]
Schliep, Erin M. [2 ]
Soranno, Patricia A. [3 ]
Wagner, Tyler [4 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Univ Missouri, Dept Stat, Columbia, MO 65211 USA
[3] Michigan State Univ, Dept Fisheries & Wildlife, E Lansing, MI 48824 USA
[4] Penn State Univ, Penn Cooperat Fish & Wildlife Res Unit, US Geol Survey, University Pk, PA 16802 USA
来源
PLOS ONE | 2019年 / 14卷 / 12期
关键词
PREDICTION; PHOSPHORUS; WATERS;
D O I
10.1371/journal.pone.0225715
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Faced with limitations in data availability, funding, and time constraints, ecologists are often tasked with making predictions beyond the range of their data. In ecological studies, it is not always obvious when and where extrapolation occurs because of the multivariate nature of the data. Previous work on identifying extrapolation has focused on univariate response data, but these methods are not directly applicable to multivariate response data, which are common in ecological investigations. In this paper, we extend previous work that identified extrapolation by applying the predictive variance from the univariate setting to the multivariate case. We propose using the trace or determinant of the predictive variance matrix to obtain a scalar value measure that, when paired with a selected cutoff value, allows for delineation between prediction and extrapolation. We illustrate our approach through an analysis of jointly modeled lake nutrients and indicators of algal biomass and water clarity in over 7000 inland lakes from across the Northeast and Mid-west US. In addition, we outline novel exploratory approaches for identifying regions of covariate space where extrapolation is more likely to occur using classification and regression trees. The use of our Multivariate Predictive Variance (MVPV) measures and multiple cutoff values when exploring the validity of predictions made from multivariate statistical models can help guide ecological inferences.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Characterizing the response of PET and fMRI data using multivariate linear models
    Worsley, KJ
    Poline, JB
    Friston, KJ
    Evans, AC
    NEUROIMAGE, 1997, 6 (04) : 305 - 319
  • [2] Characterizing the multivariate physiogenomic response to environmental change
    Lotterhos, Katie E.
    MOLECULAR ECOLOGY, 2019, 28 (11) : 2711 - 2714
  • [3] IDENTIFYING MULTIPLE OUTLIERS IN MULTIVARIATE DATA
    HADI, AS
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1992, 54 (03): : 761 - 771
  • [4] Biomechanical data characterizing for multifactor and multivariate analysis
    Loslever, P
    Flahaut, JJ
    I S B S 1995 PROCEEDINGS - XIII INTERNATIONAL SYMPOSIUM FOR BIOMECHANICS IN SPORT, 1996, : 399 - 403
  • [5] AN ALGORITHM FOR IDENTIFYING STRUCTURAL MODELS OF MULTIVARIATE DATA
    KRIPPENDORFF, K
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1981, 7 (01) : 63 - 79
  • [6] Data Extrapolation in Social Sensing for Disaster Response
    Gu, Siyu
    Pan, Chenji
    Liu, Hengchang
    Li, Shen
    Hu, Shaohan
    Su, Lu
    Wang, Shiguang
    Wang, Dong
    Amin, Tanvir
    Govindan, Ramesh
    Aggarwal, Charu
    Ganti, Raghu
    Srivatsa, Mudhakar
    Barnoy, Amotz
    Terlecky, Peter
    Abdelzaher, Tarek
    2014 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (IEEE DCOSS 2014), 2014, : 119 - 126
  • [7] Identifying and Characterizing Truck Stops from GPS Data
    Aziz, Russel
    Kedia, Manav
    Dan, Soham
    Basu, Sayantan
    Sarkar, Sudeshna
    Mitra, Sudeshna
    Mitra, Pabitra
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2016, 9728 : 168 - 182
  • [8] Multivariate extrapolation in the offshore environment
    Zachary, S
    Feld, G
    Ward, G
    Wolfram, J
    APPLIED OCEAN RESEARCH, 1998, 20 (05) : 273 - 295
  • [9] Response to "Identifying and characterizing basal cell carcinomas in persons with albinism"
    Juhasz, Margit
    Mavura, Daudi
    Kini, Lullyrita
    Levin, Melissa Kanachanapoomi
    Sharp, Andrew
    INTERNATIONAL JOURNAL OF DERMATOLOGY, 2021, 60 (01) : E1 - E1
  • [10] Framework for characterizing data and identifying anomalies in health care databases
    Savage, AM
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, : 374 - 378