MRPC: An R Package for Inference of Causal Graphs

被引:1
|
作者
Badsha, Md Bahadur [1 ,4 ]
Martin, Evan A. [2 ]
Fu, Audrey Qiuyan [1 ,3 ]
机构
[1] Univ Idaho, Inst Modeling Collaborat & Innovat, Moscow, ID 83843 USA
[2] Univ Idaho, Grad Program Bioinformat & Computat Biol, Moscow, ID 83843 USA
[3] Univ Idaho, Dept Math & Stat Sci, Inst Bioinformat & Evolutionary Studies, Moscow, ID 83843 USA
[4] Sera Prognost Inc, Salt Lake City, UT 84109 USA
关键词
causal inference; graphical models; networks; principle of Mendelian randomization; gene regulatory networks; R package; MENDELIAN RANDOMIZATION;
D O I
10.3389/fgene.2021.651812
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Understanding the causal relationships between variables is a central goal of many scientific inquiries. Causal relationships may be represented by directed edges in a graph (or equivalently, a network). In biology, for example, gene regulatory networks may be viewed as a type of causal networks, where X -> Y represents gene X regulating (i.e., being causal to) gene Y. However, existing general-purpose graph inference methods often result in a high number of false edges, whereas current causal inference methods developed for observational data in genomics can handle only limited types of causal relationships. We present MRPC (a PC algorithm with the principle of Mendelian Randomization), an R package that learns causal graphs with improved accuracy over existing methods. Our algorithm builds on the powerful PC algorithm (named after its developers Peter Spirtes and Clark Glymour), a canonical algorithm in computer science for learning directed acyclic graphs. The improvements in MRPC result in increased accuracy in identifying v-structures (i.e., X -> Y <- Z), and robustness to how the nodes are arranged in the input data. In the special case of genomic data that contain genotypes and phenotypes (e.g., gene expression) at the individual level, MRPC incorporates the principle of Mendelian randomization as constraints on edge direction to help orient the edges. MRPC allows for inference of causal graphs not only for general purposes, but also for biomedical data where multiple types of data may be input to provide evidence for causality. The R package is available on CRAN and is a free open-source software package under a GPL (>= 2) license.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] CGene: an R package for implementation of causal genetic analyses
    Lipman, Peter J.
    Lange, Christoph
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (12) : 1292 - 1294
  • [22] CGene: an R package for implementation of causal genetic analyses
    Peter J Lipman
    Christoph Lange
    [J]. European Journal of Human Genetics, 2011, 19 : 1292 - 1294
  • [23] Nonparametric Inference for Multivariate Data: The R Package npmv
    Ellis, Amanda R.
    Burchett, Woodrow W.
    Harrar, Solomon W.
    Bathke, Arne C.
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (04): : 1 - 18
  • [24] xcore: an R package for inference of gene expression regulators
    Maciej Migdał
    Takahiro Arakawa
    Satoshi Takizawa
    Masaaki Furuno
    Harukazu Suzuki
    Erik Arner
    Cecilia Lanny Winata
    Bogumił Kaczkowski
    [J]. BMC Bioinformatics, 24
  • [25] xcore: an R package for inference of gene expression regulators
    Migdal, Maciej
    Arakawa, Takahiro
    Takizawa, Satoshi
    Furuno, Masaaki
    Suzuki, Harukazu
    Arner, Erik
    Winata, Cecilia Lanny
    Kaczkowski, Bogumil
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [26] BayesBD: An R Package for Bayesian Inference on Image Boundaries
    Syring, Nicholas
    Li, Meng
    [J]. R JOURNAL, 2017, 9 (02): : 149 - 162
  • [27] BioRssay: an R package for analyses of bioassays and probit graphs
    Piyal Karunarathne
    Nicolas Pocquet
    Pierrick Labbé
    Pascal Milesi
    [J]. Parasites & Vectors, 15
  • [28] BioRssay: an R package for analyses of bioassays and probit graphs
    Karunarathne, Piyal
    Pocquet, Nicolas
    Labbe, Pierrick
    Milesi, Pascal
    [J]. PARASITES & VECTORS, 2022, 15 (01)
  • [29] Causal inference in cumulative risk assessment: The roles of directed acyclic graphs
    Brewer, L. Elizabeth
    Wright, J. Michael
    Rice, Glenn
    Neas, Lucas
    Teuschler, Linda
    [J]. ENVIRONMENT INTERNATIONAL, 2017, 102 : 30 - 41
  • [30] ParallelPC: An R Package for Efficient Causal Exploration in Genomic Data
    Thuc Duy Le
    Xu, Taosheng
    Liu, Lin
    Shu, Hu
    Hoang, Tao
    Li, Jiuyong
    [J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 207 - 218