Joint regression modeling for missing categorical covariates in generalized linear models

被引:1
|
作者
Carlos Perez-Ruiz, Luis [1 ]
Escarela, Gabriel [1 ]
机构
[1] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Av San Rafael Atlixco 186, Mexico City 09340, DF, Mexico
关键词
Copula; missing data; vines; pair copula constructions; EM algorithm; multivariate distribution; DEPENDENT RANDOM-VARIABLES; PAIR-COPULA CONSTRUCTIONS; VINES;
D O I
10.1080/02664763.2018.1438376
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Missing covariates data is a common issue in generalized linear models (GLMs). A model-based procedure arising from properly specifying joint models for both the partially observed covariates and the corresponding missing indicator variables represents a sound and flexible methodology, which lends itself to maximum likelihood estimation as the likelihood function is available in computable form. In this paper, a novel model-based methodology is proposed for the regression analysis of GLMs when the partially observed covariates are categorical. Pair-copula constructions are used as graphical tools in order to facilitate the specification of the high-dimensional probability distributions of the underlying missingness components. The model parameters are estimated by maximizing the weighted loglikelihood function by using an EM algorithm. In order to compare the performance of the proposed methodology with other well-established approaches, which include complete-cases and multiple imputation, several simulation experiments of Binomial, Poisson and Normal regressions are carried out under both missing at random and non-missing at random mechanisms scenarios. The methods are illustrated by modeling data from a stage III melanoma clinical trial. The results show that the methodology is rather robust and flexible, representing a competitive alternative to traditional techniques.
引用
收藏
页码:2741 / 2759
页数:19
相关论文
共 50 条