Missing Data Assumptions

被引:20
|
作者
Little, Roderick J. [1 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48105 USA
关键词
missing at random; ignorable missing data; Bayesian and frequentist inference; incomplete data; informative missingness; likelihood inference; missing-data mechanism; partially missing at random; MAXIMUM-LIKELIHOOD ESTIMATION; DROP-OUT; LONGITUDINAL DATA; IMPUTATION; REGRESSION; INFERENCE; MODELS;
D O I
10.1146/annurev-statistics-040720-031104
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
I review assumptions about the missing-data mechanisms that underlie methods for the statistical analysis of data with missing values. I describe Rubin's original definition of missing at random (MAR), its motivation and criticisms, and his sufficient conditions for ignoring the missingness mechanism for likelihood-based, Bayesian, and frequentist inference. Related definitions, including missing completely at random, always MAR, always missing completely at random, and partially MAR, are also covered. I present a formal argument for weakening Rubin's sufficient conditions for frequentist maximum likelihood inference with precision based on the observed information. Some simple examples of MAR are described, together with an example where the missingness mechanism can be ignored even though MAR does not hold. Alternative approaches to statistical inference based on the likelihood function are reviewed, along with non-likelihood frequentist approaches, including weighted generalized estimating equations. Connections with the causal inference literature are also discussed. Finally, alternatives to Rubin's MAR definition are discussed, including informative missingness, informative censoring, and coarsening at random. The intent is to provide a relatively nontechnical discussion, although some of the underlying issues are challenging and touch on fundamental questions of statistical inference.
引用
收藏
页码:89 / 107
页数:19
相关论文
共 50 条
  • [21] Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study
    van Kuijk, Sander M. J.
    Viechtbauer, Wolfgang
    Peeters, Louis L.
    Smits, Luc
    [J]. EPIDEMIOLOGY BIOSTATISTICS AND PUBLIC HEALTH, 2016, 13 (01)
  • [22] Handling Missing Covariates in Conditional Mixture Models Under Missing at Random Assumptions
    Sterba, Sonya K.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2014, 49 (06) : 614 - 632
  • [23] Race/ethnicity and breast cancer estrogen receptor status: impact of class, missing data, and modeling assumptions
    Nancy Krieger
    Jarvis T. Chen
    James H. Ware
    Afamia Kaddour
    [J]. Cancer Causes & Control, 2008, 19 : 1305 - 1318
  • [24] Race/ethnicity and breast cancer estrogen receptor status: impact of class, missing data, and modeling assumptions
    Krieger, Nancy
    Chen, Jarvis T.
    Ware, James H.
    Kaddour, Afamia
    [J]. CANCER CAUSES & CONTROL, 2008, 19 (10) : 1305 - 1318
  • [25] An approach to evaluating the missing data assumptions of the chain and post-stratification equating methods for the NEAT design
    Holland, Paul W.
    Sinharay, Sandip
    von Davier, Alina A.
    Han, Ning
    [J]. JOURNAL OF EDUCATIONAL MEASUREMENT, 2008, 45 (01) : 17 - 43
  • [26] Intracortical circuitry: One of psychiatry's missing assumptions
    Casanova, MF
    [J]. EUROPEAN ARCHIVES OF PSYCHIATRY AND CLINICAL NEUROSCIENCE, 2004, 254 (03) : 148 - 151
  • [27] Mining Missing Assumptions from Counter-Examples
    Plassan, Guillaume
    Morin-Allory, Katell
    Borrione, Dominique
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (01)
  • [28] Intracortical circuitry: One of psychiatry’s missing assumptions
    Manuel F. Casanova
    [J]. European Archives of Psychiatry and Clinical Neuroscience, 2004, 254 : 148 - 151
  • [29] Semiparametric inference of competing risks data with additive hazards and missing cause of failure under MCAR or MAR assumptions
    Bordes, Laurent
    Dauxois, Jean-Yves
    Joly, Pierre
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2014, 8 : 41 - 95
  • [30] Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR/MAR/MNAR classification
    Lee, Katherine J.
    Carlin, John B.
    Simpson, Julie A.
    Moreno-Betancur, Margarita
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2023, 52 (04) : 1268 - 1275