The effect of sample size and missingness on inference with missing data

被引:0
|
作者
Morimoto, Julian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
关键词
Incomplete data; sample size and missing data mechanism; partial likelihood; asymptotic inference with missing data; MULTIPLE IMPUTATION; LIKELIHOOD;
D O I
10.1080/03610926.2022.2152287
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This article answers this question by offering a new asymptotic theory about inference with missing data that is more general than existing theories. It proves that as the sample size increases and the extent of missingness decreases, the average-loglikelihood function generated by partial data and that ignores the missingness mechanism will converge in probability to that which would have been generated by complete data; and if the data are Missing at Random, this convergence depends only on sample size. Thus, inferences from partial data, such as posterior modes, confidence intervals, likelihood ratios, test statistics, and indeed, all quantities or features derived from the partial-data loglikelihood function, will be consistently estimated. Additionally, the missing data mechanism has asymptotically no effect on parameter estimation and hypothesis testing if the data are Missing at Random. This adds to previous research which has only proved the consistency and asymptotic normality of the posterior mode. Practical implications are discussed, and the theory is illustrated through simulation using a previous study of International Human Rights Law.
引用
收藏
页码:3292 / 3311
页数:20
相关论文
共 50 条
  • [41] Sample Size Determination for Individual Bioequivalence Inference
    Chiang, Chieh
    Hsiao, Chin-Fu
    Liu, Jen-Pei
    PLOS ONE, 2014, 9 (10):
  • [42] Sample rotation theory with missing data
    邹国华
    冯士雍
    秦怀振
    Science China Mathematics, 2002, (01) : 42 - 63
  • [43] Bayesian nonparametric for causal inference and missing data
    Chen, Li-Pang
    BIOMETRICS, 2024, 80 (01)
  • [44] Inference of missing data in photovoltaic monitoring datasets
    Koubli, Eleni
    Palmer, Diane
    Rowley, Paul
    Gottschalg, Ralph
    IET RENEWABLE POWER GENERATION, 2016, 10 (04) : 434 - 439
  • [45] Haplotype and missing data inference in nuclear families
    Lin, S
    Chakravarti, A
    Cutler, DJ
    GENOME RESEARCH, 2004, 14 (08) : 1624 - 1632
  • [46] On sample size determination and inference in work sampling
    Tryfos, Peter
    IIE Transactions (Institute of Industrial Engineers), 1988, 20 (03): : 255 - 262
  • [47] IDENTIFICATION AND INFERENCE ON REGRESSIONS WITH MISSING COVARIATE DATA
    Aucejo, Esteban M.
    Bugni, Federico A.
    Hotz, V. Joseph
    ECONOMETRIC THEORY, 2017, 33 (01) : 196 - 241
  • [48] Inference of stochastic time series with missing data
    Lee, Sangwon
    Periwal, Vipul
    Jo, Junghyo
    PHYSICAL REVIEW E, 2021, 104 (02)
  • [49] A bayesian framework to address missing not at random data in longitudinal studies with multiple types of missingness
    Mason, Alexina
    Grieve, Richard
    Gordon, Anthony C.
    Russell, James A.
    Walker, Simon
    Paton, Nick
    Carpenter, James
    Gomes, Manuel
    TRIALS, 2017, 18
  • [50] Evaluating the Performance of Bayesian Approach for Imputing Missing Data under different Missingness Mechanism
    Sanju, Vinay
    Kumar, Vinay
    Kumari, Pavitra
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2024, 86 (02): : 713 - 723