Software application profile: tpc and micd-R packages for causal discovery with incomplete cohort data

被引:0
|
作者
Andrews, Ryan M. [1 ,2 ]
Bang, Christine W. [2 ,3 ]
Didelez, Vanessa [2 ,3 ]
Witte, Janine [2 ]
Foraita, Ronja [2 ]
机构
[1] Boston Univ, Dept Epidemiol, Boston, MA USA
[2] Leibniz Inst Prevent Res & Epidemiol BIPS, Dept Biometry & Data Management, Achterstr 30, D-28359 Bremen, Germany
[3] Univ Bremen, Dept Math & Comp Sci, Bremen, Germany
基金
美国国家卫生研究院;
关键词
Causal discovery; R; cohort studies; missing data; longitudinal data; BAYESIAN NETWORKS; GRAPHICAL MODELS; INFERENCE; OBESITY;
D O I
10.1093/ije/dyae113
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Motivation The Peter Clark (PC) algorithm is a popular causal discovery method to learn causal graphs in a data-driven way. Until recently, existing PC algorithm implementations in R had important limitations regarding missing values, temporal structure or mixed measurement scales (categorical/continuous), which are all common features of cohort data. The new R packages presented here, micd and tpc, fill these gaps.Implementation micd and tpc packages are R packages.General features The micd package provides add-on functionality for dealing with missing values to the existing pcalg R package, including methods for multiple imputations relying on the Missing At Random assumption. Also, micd allows for mixed measurement scales assuming conditional Gaussianity. The tpc package efficiently exploits temporal information in a way that results in a more informative output that is less prone to statistical errors.Availability The tpc and micd packages are freely available on the Comprehensive R Archive Network (CRAN). Their source code is also available on GitHub (https://github.com/bips-hb/micd; https://github.com/bips-hb/tpc).
引用
收藏
页数:5
相关论文
共 3 条
  • [1] Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data
    Witte, Janine
    Foraita, Ronja
    Didelez, Vanessa
    STATISTICS IN MEDICINE, 2022, 41 (23) : 4716 - 4743
  • [2] Software Application Profile: ShinyDataSHIELD-an R Shiny application to perform federated non-disclosive data analysis in multicohort studies
    Escriba-Montagut, Xavier
    Marcon, Yannick
    Avraam, Demetris
    Banerjee, Soumya
    Bishop, Tom R. P.
    Burton, Paul
    Gonzalez, Juan R.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2023, 52 (01) : 315 - 320
  • [3] Software Application Profile: CaseCohortCoxSurvival-an R package for case-cohort inference for relative hazard and pure risk under the Cox model
    Etievant, Lola
    Gail, Mitchell H.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2025, 54 (02)