Scalable Bayesian modelling for smoothing disease risks in large spatial data sets using INLA

被引:18
|
作者
Orozco-Acosta, Erick [1 ,2 ,3 ]
Adin, Aritz [1 ,2 ,3 ]
Dolores Ugarte, Maria [1 ,2 ,3 ,4 ]
机构
[1] Univ Publ Navarra, Campus Arrosadia, Pamplona 31006, Spain
[2] Univ Publ Navarra, Dept Stat Comp Sci & Math, Navarra, Spain
[3] Univ Publ Navarra, Inst Adv Mat & Math InaMat, Navarra, Spain
[4] UNED, Ctr Asociado Pamplona, Madrid, Spain
关键词
High-dimensional data; Hierarchical models; Mixture models; Spatial epidemiology; MARKOV RANDOM-FIELDS; INFERENCE;
D O I
10.1016/j.spasta.2021.100496
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Several methods have been proposed in the spatial statistics literature to analyse big data sets in continuous domains. However, new methods for analysing high-dimensional areal data are still scarce. Here, we propose a scalable Bayesian modelling approach for smoothing mortality (or incidence) risks in high-dimensional data, that is, when the number of small areas is very large. The method is implemented in the R add-on package bigDM and it is based on the idea of "divide and conquer". Although this proposal could possibly be implemented using any Bayesian fitting technique, we use INLA here (integrated nested Laplace approximations) as it is now a well-known technique, computationally efficient, and easy for practitioners to handle. We analyse the proposal's empirical performance in a comprehensive simulation study that considers two model-free settings. Finally, the methodology is applied to analyse male colorectal cancer mortality in Spanish municipalities showing its benefits with regard to the standard approach in terms of goodness of fit and computational time. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条