Modelling collinear and spatially correlated data

被引:26
|
作者
Liverani, Silvia [1 ,2 ,4 ]
Lavigne, Aurore [3 ]
Blangiardo, Marta
机构
[1] Brunel Univ London, Dept Math, Uxbridge UB8 3PH, Middx, England
[2] Cambridge Inst Publ Hlth, Med Res Ctr, Biostat Unit, Forvie Site,Robinson Way,Cambridge Biomed Campus, Cambridge CB2 0SR, England
[3] Domaine Univ Pont de Bois, Univ Lille 3, UFR MIME, BP 60149, F-59653 Villeneuve Dascq, France
[4] Imperial Coll London, Dept Epidemiol & Biostat, MRC PHE Ctr Environm & Hlth, 2 Norfolk Pl, London W2 8PG, England
基金
英国自然环境研究理事会;
关键词
Profile regression; Bayesian clustering; Spatial modelling; Collinearity; Index of multiple deprivation; Pollution;
D O I
10.1016/j.sste.2016.04.003
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
In this work we present a statistical approach to distinguish and interpret the complex relationship between several predictors and a response variable at the small area level, in the presence of (i) high correlation between the predictors and (ii) spatial correlation for the response. Covariates which are highly correlated create collinearity problems when used in a standard multiple regression model. Many methods have been proposed in the literature to address this issue. A very common approach is to create an index which aggregates all the highly correlated variables of interest. For example, it is well known that there is a relationship between social deprivation measured through the Multiple Deprivation Index (IMD) and air pollution; this index is then used as a confounder in assessing the effect of air pollution on health outcomes (e.g. respiratory hospital admissions or mortality). However it would be more informative to look specifically at each domain of the IMD and at its relationship with air pollution to better understand its role as a confounder in the epidemiological analyses. In this paper we illustrate how the complex relationships between the domains of IMD and air pollution can be deconstructed and analysed using profile regression, a Bayesian non-parametric model for clustering responses and covariates simultaneously. Moreover, we include an intrinsic spatial conditional autoregressive (ICAR) term to account for the spatial correlation of the response variable. (C) 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:63 / 73
页数:11
相关论文
共 50 条
  • [1] The role of simulation in modelling spatially correlated data
    Augustin, NH
    Mugglestone, MA
    Buckland, ST
    ENVIRONMETRICS, 1998, 9 (02) : 175 - 196
  • [2] Modelling spatially correlated data via mixtures:: a Bayesian approach
    Fernández, C
    Green, PJ
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 805 - 826
  • [3] Modelling spatially correlated survival data for individuals with multiple cancers
    Diva, Ulysses
    Banerjee, Sudipto
    Dey, Dipak K.
    STATISTICAL MODELLING, 2007, 7 (02) : 191 - 213
  • [4] Modelling of a Spatially Correlated MIMO Wireless Channel
    Varghese, B. B.
    Maharaj, B. T.
    SAIEE AFRICA RESEARCH JOURNAL, 2009, 100 (01): : 24 - 31
  • [5] Intrinsic classification of spatially correlated data
    Wallace, CS
    COMPUTER JOURNAL, 1998, 41 (08): : 602 - 611
  • [6] Normality tests for spatially correlated data
    Pardo-Igúzquiza, E
    Dowd, PA
    MATHEMATICAL GEOLOGY, 2004, 36 (06): : 659 - 681
  • [7] Clustering Spatially Correlated Functional Data
    Romano, Elvira
    Giraldo, Ramon
    Mateu, Jorge
    RECENT ADVANCES IN FUNCTIONAL DATA ANALYSIS AND RELATED TOPICS, 2011, : 277 - 282
  • [8] Normality Tests for Spatially Correlated Data
    Eulogio Pardo-Igúzquiza
    Peter A. Dowd
    Mathematical Geology, 2004, 36 : 659 - 681
  • [9] Comparisons of spatially correlated binary data
    Sim, SY
    Johnson, RA
    STATISTICS & PROBABILITY LETTERS, 1998, 39 (02) : 81 - 87
  • [10] Longitudinal analysis of spatially correlated data
    Militino, Ana F.
    Ugarte, M. Dolores
    Ibanez, Berta
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2008, 22 (Suppl 1) : S49 - S57