Analyzing incomplete political science data: An alternative algorithm for multiple imputation

被引:1024
|
作者
King, G [1 ]
Honaker, J
Joseph, A
Scheve, K
机构
[1] Harvard Univ, Ctr Basic Res Social Sci, World Hlth Org, Global Programme Evidence Hlth Policy, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Basic Res Social Sci, Dept Govt, Cambridge, MA 02138 USA
[3] Yale Univ, Inst Social & Policy Studies, Dept Polit Sci, New Haven, CT 06520 USA
关键词
D O I
10.1017/S0003055401000235
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticians agree that "multiple imputation" is a superior approach to the problem of missing data scattered through one's explanatory and dependent variables than the methods currently used in applied data analysis. The discrepancy occurs because the computational algorithms used to apply the best multiple imputation models have been slow, difficult to implement, impossible to run with existing commercial statistical packages, and have demanded considerable expertise. We adapt an algorithm and use it to implement a general-purpose, multiple imputation model for missing data. This algorithm is considerably faster and easier to use than the leading method recommended in the statistics literature. We also quantify the risks of current missing data practices, illustrate how to use the new procedure, and evaluate this alternative through simulated data as well as actual empirical examples. Finally, we offer easy-to-use software that implements all methods discussed.
引用
收藏
页码:49 / 69
页数:21
相关论文
共 50 条
  • [11] Multiple imputation for analysis of incomplete data in distributed health data networks
    Changgee Chang
    Yi Deng
    Xiaoqian Jiang
    Qi Long
    [J]. Nature Communications, 11
  • [12] Multiple imputation for analysis of incomplete data in distributed health data networks
    Chang, Changgee
    Deng, Yi
    Jiang, Xiaoqian
    Long, Qi
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [13] Multiple Imputation by Generative Adversarial Networks for Classification with Incomplete Data
    Bao Ngoc Vi
    Dinh Tan Nguyen
    Cao Truong Tran
    Huu Phuc Ngo
    Chi Cong Nguyen
    Hai-Hong Phan
    [J]. 2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 162 - 167
  • [14] On using multiple imputation for exploratory factor analysis of incomplete data
    Nassiri, Vahid
    Lovik, Aniko
    Molenberghs, Geert
    Verbeke, Geert
    [J]. BEHAVIOR RESEARCH METHODS, 2018, 50 (02) : 501 - 517
  • [15] On using multiple imputation for exploratory factor analysis of incomplete data
    Vahid Nassiri
    Anikó Lovik
    Geert Molenberghs
    Geert Verbeke
    [J]. Behavior Research Methods, 2018, 50 : 501 - 517
  • [16] Using multiple imputation for analysis of incomplete data in clinical research
    McCleary, L
    [J]. NURSING RESEARCH, 2002, 51 (05) : 339 - 343
  • [17] Analysis of incomplete longitudinal binary data using multiple imputation
    Li, Xiaoming
    Mehrotra, Devan V.
    Barnard, John
    [J]. STATISTICS IN MEDICINE, 2006, 25 (12) : 2107 - 2124
  • [18] A comparison of multiple imputation methods for incomplete longitudinal binary data
    Yamaguchi, Yusuke
    Misumi, Toshihiro
    Maruo, Kazushi
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2018, 28 (04) : 645 - 667
  • [19] Multiple imputation of incomplete zero-inflated count data
    Kleinke, Kristian
    Reinecke, Jost
    [J]. STATISTICA NEERLANDICA, 2013, 67 (03) : 311 - 336
  • [20] Imputation Methods for Incomplete Data
    Umathe, Vaishali H.
    Chaudhary, Gauri
    [J]. 2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,