PATTERN-MIXTURE MODELS FOR MULTIVARIATE INCOMPLETE DATA

被引:643
|
作者
LITTLE, RJA
机构
关键词
EM ALGORITHM; IMPUTATION; MAXIMUM LIKELIHOOD; MISSING VALUES; MONOTONE MISSING DATA; MULTIPLE IMPUTATION; NONRESPONSE;
D O I
10.1080/01621459.1993.10594302
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider a random sample on variables X1,..., X(v) with some values of X(v) missing. Selection models specify the distribution of X1, ..., X(v) over respondents and nonrespondents to X(v), and the conditional distribution that X(v) is missing given X1,...,X(v). In contrast, pattern-mixture models specify the conditional distribution of X1,...,X(v) given that X(v) is observed or missing respectively and the marginal distribution of the binary indicator for whether or not X(v) is missing. For multivariate data with a general pattern of missing values, the literature has tended to adopt the selection-modeling approach (see for example Little and Rubin); here, pattern-mixture models are proposed for this more general problem. Pattern-mixture models are chronically underidentified; in particular for the case of univariate nonresponse mentioned above, there are no data on the distribution of X(v) given X1,...,X(v-1) in the stratum with X(v) missing. Thus the models require restrictions or prior information to identify the parameters. Complete-case restrictions tie unidentified parameters to their (identified) analogs in the stratum of complete cases. Alternative types of restriction tie unidentified parameters to parameters in other missing-value patterns or sets of such patterns. This large set of possible identifying restrictions yields a rich class of missing-data models. Unlike ignorable selection models, which generally requires iterative methods except for special missing-data patterns, some pattern-mixture models yield explicit ML estimates for general patterns. Such models are readily amenable to Bayesian methods and form a convenient basis for multiple imputation. Some previously considered noniterative estimation methods are shown to be maximum likelihood (ML) under a pattern-mixture model. For example, Buck's method for continuous data, corrected as in Beale and Little (1975), and Brown's estimators for nonrandomly missing data are ML for pattern-mixture models with particular complete-case restrictions. Available-case analyses, where the mean and variance of X(j) are computed using all cases with X(j) observed and the correlation (or covariance) of X(j) and X(k) is computed using all cases with X(j) and X(k) observed, are also close to ML for another pattern-mixture model. Asymptotic theory for this class of estimators is outlined.
引用
收藏
页码:125 / 134
页数:10
相关论文
共 50 条
  • [1] Pattern-mixture models for multivariate incomplete data with covariates
    Little, RJA
    Wang, YX
    [J]. BIOMETRICS, 1996, 52 (01) : 98 - 111
  • [2] A CLASS OF PATTERN-MIXTURE MODELS FOR NORMAL INCOMPLETE DATA
    LITTLE, RJA
    [J]. BIOMETRIKA, 1994, 81 (03) : 471 - 483
  • [3] Selection models and pattern-mixture models for incomplete data with covariates
    Michiels, B
    Molenberghs, G
    Lipsitz, SR
    [J]. BIOMETRICS, 1999, 55 (03) : 978 - 983
  • [4] Bayesian sensitivity analysis of incomplete data: bridging pattern-mixture and selection models
    Kaciroti, Niko A.
    Raghunathan, Trivellore
    [J]. STATISTICS IN MEDICINE, 2014, 33 (27) : 4841 - 4857
  • [5] Pseudo-likelihood for combined selection and pattern-mixture models for incomplete data
    Molenberghs, G
    Michiels, B
    Kenward, MG
    [J]. BIOMETRICAL JOURNAL, 1998, 40 (05) : 557 - 572
  • [6] Monotone missing data and pattern-mixture models
    Molenberghs, G
    Michiels, B
    Kenward, MG
    Diggle, PJ
    [J]. STATISTICA NEERLANDICA, 1998, 52 (02) : 153 - 161
  • [7] A pattern-mixture odds ratio model for incomplete categorical data.
    Michiels, B
    Molenberghs, G
    Lipsitz, SR
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1999, 28 (12) : 2843 - 2869
  • [8] Strategies to fit pattern-mixture models
    Thijs, H
    Molenberghs, G
    Michiels, B
    Verbeke, G
    Curran, D
    [J]. BIOSTATISTICS, 2002, 3 (02) : 245 - 265
  • [9] Pattern-Mixture Models with Incomplete Informative Cluster Size: Application to a Repeated Pregnancy Study
    Chaurasia, Ashok
    Liu, Danping
    Albert, Paul S.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2018, 67 (01) : 255 - 273
  • [10] Pattern-mixture models for analyzing normal outcome data with proxy respondents
    Shardell, Michelle
    Hicks, Gregory E.
    Miller, Ram R.
    Langenberg, Patricia
    Magaziner, Jay
    [J]. STATISTICS IN MEDICINE, 2010, 29 (14) : 1522 - 1538