Analyzing incomplete political science data: An alternative algorithm for multiple imputation

被引:1024
|
作者
King, G [1 ]
Honaker, J
Joseph, A
Scheve, K
机构
[1] Harvard Univ, Ctr Basic Res Social Sci, World Hlth Org, Global Programme Evidence Hlth Policy, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Basic Res Social Sci, Dept Govt, Cambridge, MA 02138 USA
[3] Yale Univ, Inst Social & Policy Studies, Dept Polit Sci, New Haven, CT 06520 USA
关键词
D O I
10.1017/S0003055401000235
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticians agree that "multiple imputation" is a superior approach to the problem of missing data scattered through one's explanatory and dependent variables than the methods currently used in applied data analysis. The discrepancy occurs because the computational algorithms used to apply the best multiple imputation models have been slow, difficult to implement, impossible to run with existing commercial statistical packages, and have demanded considerable expertise. We adapt an algorithm and use it to implement a general-purpose, multiple imputation model for missing data. This algorithm is considerably faster and easier to use than the leading method recommended in the statistics literature. We also quantify the risks of current missing data practices, illustrate how to use the new procedure, and evaluate this alternative through simulated data as well as actual empirical examples. Finally, we offer easy-to-use software that implements all methods discussed.
引用
收藏
页码:49 / 69
页数:21
相关论文
共 50 条
  • [41] A Noise-Aware Multiple Imputation Algorithm for Missing Data
    Li, Fangfang
    Sun, Hui
    Gu, Yu
    Yu, Ge
    [J]. MATHEMATICS, 2023, 11 (01)
  • [42] A Novel Nonparametric Multiple Imputation Algorithm for Estimating Missing Data
    Gheyas, Iffat A.
    Smith, Leslie S.
    [J]. WORLD CONGRESS ON ENGINEERING 2009, VOLS I AND II, 2009, : 1281 - 1286
  • [43] Multivariable data imputation for the analysis of incomplete credit data
    Lan, Qiujun
    Xu, Xuqing
    Ma, Haojie
    Li, Gang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 141
  • [44] Best Fit Missing Value Imputation (BFMVI) Algorithm for Incomplete Data in the Internet of Things
    Agbo, Benjamin
    Qin, Yongrui
    Hill, Richard
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 130 - 137
  • [45] Erratum to: Assessment of predictive performance in incomplete data by combining internal validation and multiple imputation
    Simone Wahl
    Anne-Laure Boulesteix
    Astrid Zierer
    Barbara Thorand
    Mark A. van de Wiel
    [J]. BMC Medical Research Methodology, 16
  • [46] Estimating the Effect of Multiple Imputation on Incomplete Longitudinal Data with Application to a Randomized Clinical Study
    Fong, Daniel Y. T.
    Rai, Shesh N.
    Lam, Karen S. L.
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2013, 23 (05) : 1004 - 1022
  • [47] Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data
    Witte, Janine
    Foraita, Ronja
    Didelez, Vanessa
    [J]. STATISTICS IN MEDICINE, 2022, 41 (23) : 4716 - 4743
  • [48] Dual imputation model for incomplete longitudinal data
    Jolani, Shahab
    Frank, Laurence E.
    van Buuren, Stef
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2014, 67 (02): : 197 - 212
  • [49] ANALYZING INCOMPLETE COUNT DATA
    YANNAROS, N
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1993, 42 (02) : 181 - 187
  • [50] An imputation strategy for incomplete longitudinal ordinal data
    Demirtas, Hakan
    Hedeker, Donald
    [J]. STATISTICS IN MEDICINE, 2008, 27 (20) : 4086 - 4093