Missing data analysis and imputation via latent Gaussian Markov random felds

被引:0
|
作者
Department of Mathematics, School of Industrial Engineering, Albacete, Universidad de Castilla-La Mancha, Spain [1 ]
不详 [2 ]
不详 [3 ]
机构
来源
SORT | / 2卷 / 217-243期
基金
美国国家卫生研究院; 英国医学研究理事会;
关键词
Health risks - Hierarchical systems - Regression analysis;
D O I
暂无
中图分类号
学科分类号
摘要
This paper recasts the problem of missing values in the covariates of a regression model as a latent Gaussian Markov random feld (GMRF) model in a fully Bayesian framework. The proposed approach is based on the defnition of the covariate imputation sub-model as a latent effect with a GMRF structure. This formulation works for continuous covariates but for categorical covariates a typical multiple imputation approach is employed. Both techniques can be easily combined for the case in which continuous and categorical variables have missing values. The resulting Bayesian hierarchical model naturally fts within the integrated nested Laplace approximation (INLA) framework, which is used for model ftting. Hence, this work flls an important gap in the INLA methodology as it allows to treat models with missing values in the covariates. As in any other fully Bayesian framework, by relying on INLA for model ftting it is possible to formulate a joint model for the data, the imputed covariates and their missingness mechanism. In this way, it is possible to tackle the more general problem of assessing the missingness mechanism by conducting a sensitivity analysis on the different alternatives to model the non-observed covariates. Finally, the proposed approach is illustrated in two examples on modeling health risk factors and disease mapping. © 2022 Institut d'Estadistica de Catalunya. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [1] Missing data analysis and imputation via latent Gaussian Markov random fields
    Gomez-Rubio, Virgilio
    Cameletti, Michela
    Blangiardo, Marta
    [J]. SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2022, 46 (02) : 217 - 244
  • [2] Missing Value Imputation for Mixed Data via Gaussian Copula
    Zhao, Yuxuan
    Udell, Madeleine
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 636 - 646
  • [3] Imputation of data Missing Not at Random: Artificial generation and benchmark analysis
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    Figueiredo, Mario A. T.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [4] Multiple imputation of ordinal missing not at random data
    Hammon, Angelina
    [J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2023, 107 (04) : 671 - 692
  • [5] Multiple imputation of ordinal missing not at random data
    Angelina Hammon
    [J]. AStA Advances in Statistical Analysis, 2023, 107 : 671 - 692
  • [6] Siamese Autoencoder Architecture for the Imputation of Data Missing Not at Random
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2024, 78
  • [7] Identifiable Generative Models for Missing Not at Random Data Imputation
    Ma, Chao
    Zhang, Cheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Deep Generative Imputation Model for Missing Not At Random Data
    Chen, Jialei
    Xu, Yuanbo
    Wang, Pengyang
    Yang, Yongjian
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 316 - 325
  • [9] Multiple imputation of binary multilevel missing not at random data
    Hammon, Angelina
    Zinn, Sabine
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (03) : 547 - 564
  • [10] Efficient random imputation for missing data in complex surveys
    Chen, J
    Rao, JNK
    Sitter, RR
    [J]. STATISTICA SINICA, 2000, 10 (04) : 1153 - 1169