Simulation-Based Performance Evaluation of Missing Data Handling in Network Analysis

被引:0
|
作者
Nehler, Kai Jannik [1 ,2 ]
Schultze, Martin [1 ]
机构
[1] Goethe Univ Frankfurt, Dept Psychol, Frankfurt, Germany
[2] Goethe Univ Frankfurt, Dept Psychol, Theodor W Adorno Pl 6, D-60323 Frankfurt, Germany
关键词
Network analysis; missing values; simulation study; graphical lasso regularization; EM algorithms; MAXIMUM-LIKELIHOOD; INFORMATION; MODELS;
D O I
10.1080/00273171.2023.2283638
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Network analysis has gained popularity as an approach to investigate psychological constructs. However, there are currently no guidelines for applied researchers when encountering missing values. In this simulation study, we compared the performance of a two-step EM algorithm with separated steps for missing handling and regularization, a combined direct EM algorithm, and pairwise deletion. We investigated conditions with varying network sizes, numbers of observations, missing data mechanisms, and percentages of missing values. These approaches are evaluated with regard to recovering population networks in terms of loss in the precision matrix, edge set identification and network statistics. The simulation showed adequate performance only in conditions with large samples (n >= 500) or small networks (p = 10). Comparing the missing data approaches, the direct EM appears to be more sensitive and superior in nearly all chosen conditions. The two-step EM yields better results when the ratio of n/p is very large - being less sensitive but more specific. Pairwise deletion failed to converge across numerous conditions and yielded inferior results overall. Overall, direct EM is recommended in most cases, as it is able to mitigate the impact of missing data quite well, while modifications to two-step EM could improve its performance.
引用
收藏
页码:461 / 481
页数:21
相关论文
共 50 条
  • [21] Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies
    Kieu Trinh Do
    Wahl, Simone
    Raffler, Johannes
    Molnos, Sophie
    Laimighofer, Michael
    Adamski, Jerzy
    Suhre, Karsten
    Strauch, Konstantin
    Peters, Annette
    Gieger, Christian
    Langenberg, Claudia
    Stewart, Isobel D.
    Theis, Fabian J.
    Grallert, Harald
    Kastenmueller, Gabi
    Krumsiek, Jan
    METABOLOMICS, 2018, 14 (10)
  • [22] Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies
    Kieu Trinh Do
    Simone Wahl
    Johannes Raffler
    Sophie Molnos
    Michael Laimighofer
    Jerzy Adamski
    Karsten Suhre
    Konstantin Strauch
    Annette Peters
    Christian Gieger
    Claudia Langenberg
    Isobel D. Stewart
    Fabian J. Theis
    Harald Grallert
    Gabi Kastenmüller
    Jan Krumsiek
    Metabolomics, 2018, 14
  • [23] Design of Simulation-based Network Vulnerability Analysis System
    You, Yong-Jun
    Lee, Jang-Se
    Chi, Sung-Do
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (08): : 3551 - 3559
  • [24] Handling missing continuous outcome data in a Bayesian network meta-analysis
    Azzolina, Danila
    Baldi, Ileana
    Minto, Clara
    Bottigliengo, Daniele
    Lorenzoni, Giulia
    Gregori, Dario
    EPIDEMIOLOGY BIOSTATISTICS AND PUBLIC HEALTH, 2018, 15 (04):
  • [25] Handling missing or incomplete data in a Bayesian network meta-analysis framework
    Azzolina, Danila
    Baldi, Ileana
    Berchialla, Paola
    Minto, Clara
    Gregori, Dario
    TRIALS, 2017, 18
  • [26] Simulation-Based Evaluation of Robot and Wireless Sensor Network Interaction
    Sebestyen-Pal, Gheorghe
    Rus, George-Daniel
    2012 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS, THETA 18TH EDITION, 2012, : 210 - 215
  • [27] Simulation-based safety evaluation model integrated with network schedule
    Wang, WC
    Liu, JJ
    Chou, SC
    AUTOMATION IN CONSTRUCTION, 2006, 15 (03) : 341 - 354
  • [28] Neural Network Simulation-Based Research on Highway Safety Evaluation
    Cheng Jia
    Guo Hongxia
    Li Qingyao
    Du Wen
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 209 - +
  • [29] Neural Network Approximation of Simulation-based IDS Fitness Evaluation
    Alshahrani, Abdulmonem
    Clark, John A.
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING, CSE, 2022, : 81 - 89
  • [30] A Simulation-Based Closed Queueing Network Approximation of Semiconductor Automated Material Handling Systems
    Govind, Nirmal
    Roeder, Theresa M.
    Schruben, Lee W.
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2011, 24 (01) : 5 - 13