Multiple imputation for missing edge data: A predictive evaluation method with application to Add Health

被引:41
|
作者
Wang, Cheng [1 ]
Butts, Carter T. [2 ,3 ]
Hipp, John R. [2 ,4 ]
Jose, Rupa [5 ]
Lakon, Cynthia M. [6 ]
机构
[1] Univ Notre Dame, Dept Sociol, 810 Flanner Hall, Notre Dame, IN 46556 USA
[2] Univ Calif Irvine, Dept Sociol, Irvine, CA USA
[3] Univ Calif Irvine, Dept Stat, Irvine, CA USA
[4] Univ Calif Irvine, Dept Criminol Law & Soc, Irvine, CA USA
[5] Univ Calif Irvine, Dept Psychol & Social Behav, Irvine, CA USA
[6] Univ Calif Irvine, Program Publ Hlth, Irvine, CA USA
关键词
Missing edge data; ERGM-based imputation; Held-Out Predictive Evaluation (HOPE); NETWORK DATA; LONGITUDINAL NETWORK; SOCIAL NETWORKS; MODELS; INFERENCE;
D O I
10.1016/j.socnet.2015.12.003
中图分类号
Q98 [人类学];
学科分类号
030303 ;
摘要
Recent developments have made model-based imputation of network data feasible in principle, but the extant literature provides few practical examples of its use. In this paper, we consider 14 schools from the widely used In-School Survey of Add Health (Harris et al., 2009), applying an ERGM-based estimation and simulation approach to impute the network missing data for each school. Add Health's complex study design leads to multiple types of missingness, and we introduce practical techniques for handing each. We also develop a cross-validation based method - Held-Out Predictive Evaluation (HOPE) - for assessing this approach. Our results suggest that ERGM-based imputation of edge variables is a viable approach to the analysis of complex studies such as Add Health, provided that care is used in understanding and accounting for the study design. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 98
页数:10
相关论文
共 50 条
  • [41] FIGAN: A Missing Industrial Data Imputation Method Customized for Soft Sensor Application
    Yao, Zoujing
    Zhao, Chunhui
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 3712 - 3722
  • [42] Application of Sequential Regression Multivariate Imputation Method on Multivariate Normal Missing Data
    Nurzaman
    Siswantining, Titin
    Soemartojo, Saskya Mary
    Sarwinda, Devvi
    2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,
  • [43] A MISSING DATA IMPUTATION METHOD WITH DISTANCE FUNCTION
    Jea, Kuen-Fang
    Hsu, Chin-Wei
    Tang, Li-You
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 450 - 455
  • [44] Multiple Imputation for Missing Values in Homicide Incident Data: An Evaluation Using Unique Test Data
    Roberts, John M., Jr.
    Roberts, Aki
    Wadsworth, Tim
    HOMICIDE STUDIES, 2018, 22 (04) : 391 - 409
  • [46] From Predictive Methods to Missing Data Imputation: An Optimization Approach
    Bertsimas, Dimitris
    Pawlowski, Colin
    Zhuo, Ying Daisy
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [47] Multiple imputation: a mature approach to dealing with missing data
    Chevret, S.
    Seaman, S.
    Resche-Rigon, M.
    INTENSIVE CARE MEDICINE, 2015, 41 (02) : 348 - 350
  • [48] MULTIPLE IMPUTATION: A POSSIBLE SOLUTION TO THE PROBLEM OF MISSING DATA
    Sergeant, J. C.
    ANNALS OF THE RHEUMATIC DISEASES, 2016, 75 : 45 - 46
  • [49] Multiple Imputation for Missing Data Using Genetic Programming
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 583 - 590
  • [50] Multiple Imputation for Missing Data in Life Cycle Inventory
    Liu, Yu
    Gong, Xianzheng
    Wang, ZhiHong
    Liu, Wei
    Nie, Zuoren
    MATERIALS RESEARCH, PTS 1 AND 2, 2009, 610-613 : 21 - 27