Learning genetic and environmental graphical models from family data

被引:0
|
作者
Ribeiro, Adele H. [1 ]
Maria Pavan Soler, Julia [2 ]
机构
[1] Univ Sao Paulo, IME, Inst Math & Stat, Dept Comp Sci, Sao Paulo, Brazil
[2] Univ Sao Paulo, IME, Inst Math & Stat, Dept Stat, Sao Paulo, Brazil
关键词
covariance matrix decomposition; family data; polygenic mixed model; structure learning; test for zero partial correlation; QUANTITATIVE TRAITS; LINKAGE ANALYSIS; SELECTION; PHENOTYPES; COMPONENTS; POWER;
D O I
10.1002/sim.8545
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many challenging problems in biomedical research rely on understanding how variables are associated with each other and influenced by genetic and environmental factors. Probabilistic graphical models (PGMs) are widely acknowledged as a very natural and formal language to describe relationships among variables and have been extensively used for studying complex diseases and traits. In this work, we propose methods that leverage observational Gaussian family data for learning a decomposition of undirected and directed acyclic PGMs according to the influence of genetic and environmental factors. Many structure learning algorithms are strongly based on a conditional independence test. For independent measurements of normally distributed variables, conditional independence can be tested through standard tests for zero partial correlation. In family data, the assumption of independent measurements does not hold since related individuals are correlated due to mainly genetic factors. Based on univariate polygenic linear mixed models, we propose tests that account for the familial dependence structure and allow us to assess the significance of the partial correlation due to genetic (between-family) factors and due to other factors, denoted here as environmental (within-family) factors, separately. Then, we extend standard structure learning algorithms, including the IC/PC and the really fast causal inference (RFCI) algorithms, to Gaussian family data. The algorithms learn the most likely PGM and its decomposition into two components, one explained by genetic factors and the other by environmental factors. The proposed methods are evaluated by simulation studies and applied to the Genetic Analysis Workshop 13 simulated dataset, which captures significant features of the Framingham Heart Study.
引用
收藏
页码:2403 / 2422
页数:20
相关论文
共 50 条
  • [1] Learning posisibilistic graphical models from data
    Borgelt, C
    Kruse, R
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (02) : 159 - 172
  • [2] Learning from imprecise data: possibilistic graphical models
    Borgelt, C
    Kruse, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) : 449 - 463
  • [3] Learning large-scale graphical Gaussian models from genomic data
    Schäfer, J
    Strimmer, K
    [J]. SCIENCE OF COMPLEX NETWORKS: FROM BIOLOGY TO THE INTERNET AND WWW, 2005, 776 : 263 - 276
  • [4] Structure Learning of Undirected Graphical Models for Count Data
    Hue, Nguyen Thi Kim
    Chiogna, Monica
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [5] Structure learning of undirected graphical models for count data
    Hue, Nguyen Thi Kim
    Chiogna, Monica
    [J]. Journal of Machine Learning Research, 2021, 22
  • [6] Learning Semantic Models of Data Sources Using Probabilistic Graphical Models
    Binh Vu
    Knoblock, Craig A.
    Pujara, Jay
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1944 - 1953
  • [7] Learning Graphical Models from a Distributed Stream
    Zhang, Yu
    Tirthapura, Srikanta
    Cormode, Graham
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 725 - 736
  • [8] Learning graphical models from the Glauber dynamics
    Bresler, Guy
    Gamarnik, David
    Shah, Devavrat
    [J]. 2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1148 - 1155
  • [9] Learning Graphical Models From the Glauber Dynamics
    Bresler, Guy
    Gamarnik, David
    Shah, Devavrat
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (06) : 4072 - 4080
  • [10] Reproducible Learning of Gaussian Graphical Models via Graphical Lasso Multiple Data Splitting
    Kang Hu
    Danning Li
    Binghui Liu
    [J]. Acta Mathematica Sinica,English Series, 2025, (02) : 553 - 568