Joint regression modeling for missing categorical covariates in generalized linear models

被引:1
|
作者
Carlos Perez-Ruiz, Luis [1 ]
Escarela, Gabriel [1 ]
机构
[1] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Av San Rafael Atlixco 186, Mexico City 09340, DF, Mexico
关键词
Copula; missing data; vines; pair copula constructions; EM algorithm; multivariate distribution; DEPENDENT RANDOM-VARIABLES; PAIR-COPULA CONSTRUCTIONS; VINES;
D O I
10.1080/02664763.2018.1438376
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Missing covariates data is a common issue in generalized linear models (GLMs). A model-based procedure arising from properly specifying joint models for both the partially observed covariates and the corresponding missing indicator variables represents a sound and flexible methodology, which lends itself to maximum likelihood estimation as the likelihood function is available in computable form. In this paper, a novel model-based methodology is proposed for the regression analysis of GLMs when the partially observed covariates are categorical. Pair-copula constructions are used as graphical tools in order to facilitate the specification of the high-dimensional probability distributions of the underlying missingness components. The model parameters are estimated by maximizing the weighted loglikelihood function by using an EM algorithm. In order to compare the performance of the proposed methodology with other well-established approaches, which include complete-cases and multiple imputation, several simulation experiments of Binomial, Poisson and Normal regressions are carried out under both missing at random and non-missing at random mechanisms scenarios. The methods are illustrated by modeling data from a stage III melanoma clinical trial. The results show that the methodology is rather robust and flexible, representing a competitive alternative to traditional techniques.
引用
下载
收藏
页码:2741 / 2759
页数:19
相关论文
共 50 条
  • [1] Conditional and unconditional categorical regression models with missing covariates
    Satten, GA
    Carroll, RJ
    BIOMETRICS, 2000, 56 (02) : 384 - 388
  • [2] Linear regression models with incomplete categorical covariates
    Toutenburg, H
    Nittner, T
    COMPUTATIONAL STATISTICS, 2002, 17 (02) : 215 - 232
  • [3] Linear Regression Models with Incomplete Categorical Covariates
    Helge Toutenburg
    Thomas Nittner
    Computational Statistics, 2002, 17 : 215 - 232
  • [4] Generalized partially linear models with missing covariates
    Liang, Hua
    JOURNAL OF MULTIVARIATE ANALYSIS, 2008, 99 (05) : 880 - 895
  • [5] Diagnostic Measures for Generalized Linear Models with Missing Covariates
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Shi, Xiaoyan
    SCANDINAVIAN JOURNAL OF STATISTICS, 2009, 36 (04) : 686 - 712
  • [6] Local Influence for Generalized Linear Models with Missing Covariates
    Shi, Xiaoyan
    Zhu, Hongtu
    Ibrahim, Joseph G.
    BIOMETRICS, 2009, 65 (04) : 1164 - 1174
  • [7] Imputation and variable selection in linear regression models with missing covariates
    Yang, XW
    Belin, TR
    Boscardin, WJ
    BIOMETRICS, 2005, 61 (02) : 498 - 506
  • [8] Bias correction in logistic regression with missing categorical covariates
    Das, Ujjwal
    Maiti, Tapabrata
    Pradhan, Vivek
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (09) : 2478 - 2485
  • [9] Non-ignorable missing covariates in generalized linear models
    Lipsitz, SR
    Ibrahim, JG
    Chen, MH
    Peterson, H
    STATISTICS IN MEDICINE, 1999, 18 (17-18) : 2435 - 2448
  • [10] Model selection of generalized partially linear models with missing covariates
    Fu, Ying-Zi
    Chen, Xue-Dong
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (01) : 126 - 138