Analyzing incomplete political science data: An alternative algorithm for multiple imputation

被引:1024
|
作者
King, G [1 ]
Honaker, J
Joseph, A
Scheve, K
机构
[1] Harvard Univ, Ctr Basic Res Social Sci, World Hlth Org, Global Programme Evidence Hlth Policy, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Basic Res Social Sci, Dept Govt, Cambridge, MA 02138 USA
[3] Yale Univ, Inst Social & Policy Studies, Dept Polit Sci, New Haven, CT 06520 USA
关键词
D O I
10.1017/S0003055401000235
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticians agree that "multiple imputation" is a superior approach to the problem of missing data scattered through one's explanatory and dependent variables than the methods currently used in applied data analysis. The discrepancy occurs because the computational algorithms used to apply the best multiple imputation models have been slow, difficult to implement, impossible to run with existing commercial statistical packages, and have demanded considerable expertise. We adapt an algorithm and use it to implement a general-purpose, multiple imputation model for missing data. This algorithm is considerably faster and easier to use than the leading method recommended in the statistics literature. We also quantify the risks of current missing data practices, illustrate how to use the new procedure, and evaluate this alternative through simulated data as well as actual empirical examples. Finally, we offer easy-to-use software that implements all methods discussed.
引用
收藏
页码:49 / 69
页数:21
相关论文
共 50 条
  • [21] Multiple imputation combined with bootstrapping for analysing incomplete cost and effect data
    Heymans, M. W.
    De Bruyne, M. C.
    Van Buuren, S.
    [J]. EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2006, 21 : 57 - 57
  • [22] Multiple imputation of incomplete multilevel data using Heckman selection models
    Munoz, Johanna
    Efthimiou, Orestis
    Audigier, Vincent
    de Jong, Valentijn M. T.
    Debray, Thomas P. A.
    [J]. STATISTICS IN MEDICINE, 2024, 43 (03) : 514 - 533
  • [23] MULTIPLE IMPUTATION OF INCOMPLETE CATEGORICAL DATA USING LATENT CLASS ANALYSIS
    Vermunt, Jeroen K.
    van Ginkel, Joost R.
    van der Ark, L. Andries
    Sijtsma, Klaas
    [J]. SOCIOLOGICAL METHODOLOGY, VOL 38, 2008, 38 : 369 - 397
  • [24] Multiple Imputation for Incomplete Traffic Accident Data Using Chained Equations
    Li, Linchao
    Zhang, Jian
    Wang, Yonggang
    Ran, Bin
    [J]. 2017 IEEE 20TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2017,
  • [25] Recursive partitioning on incomplete data using surrogate decisions and multiple imputation
    Hapfelmeier, A.
    Hothorn, T.
    Ulm, K.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) : 1552 - 1565
  • [26] Multiple imputation confidence intervals for the mean of the discrete distributions for incomplete data
    Lee, Chung-Han
    Wang, Hsiuying
    [J]. STATISTICS IN MEDICINE, 2022, 41 (07) : 1172 - 1190
  • [27] Multiple imputation and analysis for high-dimensional incomplete proteomics data
    Yin, Xiaoyan
    Levy, Daniel
    Willinger, Christine
    Adourian, Aram
    Larson, Martin G.
    [J]. STATISTICS IN MEDICINE, 2016, 35 (08) : 1315 - 1326
  • [28] Multiple imputation for an incomplete covariate that is a ratio
    Morris, Tim P.
    White, Ian R.
    Royston, Patrick
    Seaman, Shaun R.
    Wood, Angela M.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (01) : 88 - 104
  • [29] Application of Multiple Imputation Method in Analyzing Data with Missing Continuous Covariates
    Tamar, S. Ghasemizadeh
    Ganjali, M.
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (04) : 659 - 664
  • [30] Incomplete big data imputation mining algorithm based on BP neural network
    Liu, Yutang
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4457 - 4466