PATTERN-MIXTURE MODELS FOR MULTIVARIATE INCOMPLETE DATA

被引：643

作者：

LITTLE, RJA

机构：

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 1993年 / 88卷 / 421期

关键词：

EM ALGORITHM; IMPUTATION; MAXIMUM LIKELIHOOD; MISSING VALUES; MONOTONE MISSING DATA; MULTIPLE IMPUTATION; NONRESPONSE;

D O I：

10.1080/01621459.1993.10594302

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Consider a random sample on variables X1,..., X(v) with some values of X(v) missing. Selection models specify the distribution of X1, ..., X(v) over respondents and nonrespondents to X(v), and the conditional distribution that X(v) is missing given X1,...,X(v). In contrast, pattern-mixture models specify the conditional distribution of X1,...,X(v) given that X(v) is observed or missing respectively and the marginal distribution of the binary indicator for whether or not X(v) is missing. For multivariate data with a general pattern of missing values, the literature has tended to adopt the selection-modeling approach (see for example Little and Rubin); here, pattern-mixture models are proposed for this more general problem. Pattern-mixture models are chronically underidentified; in particular for the case of univariate nonresponse mentioned above, there are no data on the distribution of X(v) given X1,...,X(v-1) in the stratum with X(v) missing. Thus the models require restrictions or prior information to identify the parameters. Complete-case restrictions tie unidentified parameters to their (identified) analogs in the stratum of complete cases. Alternative types of restriction tie unidentified parameters to parameters in other missing-value patterns or sets of such patterns. This large set of possible identifying restrictions yields a rich class of missing-data models. Unlike ignorable selection models, which generally requires iterative methods except for special missing-data patterns, some pattern-mixture models yield explicit ML estimates for general patterns. Such models are readily amenable to Bayesian methods and form a convenient basis for multiple imputation. Some previously considered noniterative estimation methods are shown to be maximum likelihood (ML) under a pattern-mixture model. For example, Buck's method for continuous data, corrected as in Beale and Little (1975), and Brown's estimators for nonrandomly missing data are ML for pattern-mixture models with particular complete-case restrictions. Available-case analyses, where the mean and variance of X(j) are computed using all cases with X(j) observed and the correlation (or covariance) of X(j) and X(k) is computed using all cases with X(j) and X(k) observed, are also close to ML for another pattern-mixture model. Asymptotic theory for this class of estimators is outlined.

引用

页码：125 / 134

页数：10

共 50 条

[1] Pattern-mixture models for multivariate incomplete data with covariates
Little, RJA
Wang, YX
[J]. BIOMETRICS, 1996, 52 (01) : 98 - 111
[2] A CLASS OF PATTERN-MIXTURE MODELS FOR NORMAL INCOMPLETE DATA
LITTLE, RJA
[J]. BIOMETRIKA, 1994, 81 (03) : 471 - 483
[3] Selection models and pattern-mixture models for incomplete data with covariates
Michiels, B
Molenberghs, G
Lipsitz, SR
[J]. BIOMETRICS, 1999, 55 (03) : 978 - 983
[4] Bayesian sensitivity analysis of incomplete data: bridging pattern-mixture and selection models
Kaciroti, Niko A.
Raghunathan, Trivellore
[J]. STATISTICS IN MEDICINE, 2014, 33 (27) : 4841 - 4857
[5] Pseudo-likelihood for combined selection and pattern-mixture models for incomplete data
Molenberghs, G
Michiels, B
Kenward, MG
[J]. BIOMETRICAL JOURNAL, 1998, 40 (05) : 557 - 572
[6] Monotone missing data and pattern-mixture models
Molenberghs, G
Michiels, B
Kenward, MG
Diggle, PJ
[J]. STATISTICA NEERLANDICA, 1998, 52 (02) : 153 - 161
[7] A pattern-mixture odds ratio model for incomplete categorical data.
Michiels, B
Molenberghs, G
Lipsitz, SR
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1999, 28 (12) : 2843 - 2869
[8] Strategies to fit pattern-mixture models
Thijs, H
Molenberghs, G
Michiels, B
Verbeke, G
Curran, D
[J]. BIOSTATISTICS, 2002, 3 (02) : 245 - 265
[9] Pattern-Mixture Models with Incomplete Informative Cluster Size: Application to a Repeated Pregnancy Study
Chaurasia, Ashok
Liu, Danping
Albert, Paul S.
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2018, 67 (01) : 255 - 273
[10] Pattern-mixture models for analyzing normal outcome data with proxy respondents
Shardell, Michelle
Hicks, Gregory E.
Miller, Ram R.
Langenberg, Patricia
Magaziner, Jay
[J]. STATISTICS IN MEDICINE, 2010, 29 (14) : 1522 - 1538

← 1 2 3 4 5 →