Causal discovery of gene regulation with incomplete data

被引:14
|
作者
Foraita, Ronja [1 ]
Friemel, Juliane [1 ,2 ]
Guenther, Kathrin [1 ]
Behrens, Thomas [3 ]
Bullerdiek, Joern [4 ]
Nimzyk, Rolf [4 ]
Ahrens, Wolfgang [1 ,5 ]
Didelez, Vanessa [1 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Achterstr 30, D-28359 Bremen, Germany
[2] Univ & Univ Hosp Zurich, Zurich, Switzerland
[3] Ruhr Univ Bochum, Bochum, Germany
[4] Ctr Human Genet, Bremen, Germany
[5] Univ Bremen, Bremen, Germany
关键词
Gene expression; Graphical models; Head-and-neck squamous cell carcinoma; HMGA2; gene; Human papilloma-virus; PC algorithm; Protein 53 signalling pathway; HMGA2 PROTEIN EXPRESSION; SQUAMOUS-CELL CARCINOMAS; DIRECTED ACYCLIC GRAPHS; EQUIVALENCE CLASSES; MAXIMUM-LIKELIHOOD; SEROUS CARCINOMA; HEAD; P53; INFERENCE; CANCER;
D O I
10.1111/rssa.12565
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Causal discovery algorithms aim to identify causal relations from observational data and have become a popular tool for analysing genetic regulatory systems. In this work, we applied causal discovery to obtain novel insights into the genetic regulation underlying head-and-neck squamous cell carcinoma. Some methodological challenges needed to be resolved first. The available data contained missing values, but most approaches to causal discovery require complete data. Hence, we propose a new procedure combining constraint-based causal discovery with multiple imputation. This is based on using Rubin's rules for pooling tests of conditional independence. A second challenge was that causal discovery relies on strong assumptions and can be rather unstable. To assess the robustness of our results, we supplemented our investigation with sensitivity analyses, including a non-parametric bootstrap to quantify the variability of the estimated causal structures. We applied these methods to investigate how the high mobility group AT-Hook 2 (HMGA2) gene is incorporated in the protein 53 signalling pathway playing an important role in head-and-neck squamous cell carcinoma. Our results were quite stable and found direct associations between HMGA2 and other relevant proteins, but they did not provide clear support for the claim that HMGA2 itself is a key regulator gene.
引用
收藏
页码:1747 / 1775
页数:29
相关论文
共 50 条
  • [1] Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data
    Witte, Janine
    Foraita, Ronja
    Didelez, Vanessa
    STATISTICS IN MEDICINE, 2022, 41 (23) : 4716 - 4743
  • [2] Discovery of gene-regulation pathways using local causal search
    Yoo, CW
    Cooper, GF
    AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 914 - 918
  • [3] An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation
    Sisi Ma
    Patrick Kemmeren
    Constantin F. Aliferis
    Alexander Statnikov
    Scientific Reports, 6
  • [4] An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation
    Ma, Sisi
    Kemmeren, Patrick
    Aliferis, Constantin F.
    Statnikov, Alexander
    SCIENTIFIC REPORTS, 2016, 6
  • [5] Software application profile: tpc and micd-R packages for causal discovery with incomplete cohort data
    Andrews, Ryan M.
    Bang, Christine W.
    Didelez, Vanessa
    Witte, Janine
    Foraita, Ronja
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2024, 53 (05)
  • [6] Discovery of incomplete knowledge in electrocardiographic data
    Azuaje, FJ
    Dubitzky, W
    Lopes, P
    Black, ND
    Adamson, K
    Wu, X
    White, JA
    PROCEEDING OF THE THIRD INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND EXPERT SYSTEMS IN MEDICINE AND HEALTHCARE, 1998, : 286 - 294
  • [7] Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis
    Jiao, Rong
    Lin, Nan
    Hu, Zixin
    Bennett, David A.
    Jin, Li
    Xiong, Momiao
    FRONTIERS IN GENETICS, 2018, 9
  • [8] "Super Gene Set" Causal Relationship Discovery from Functional Genomics Data
    Yue, Zongliang
    Neylon, Michael T.
    Thanh Nguyen
    Ratliff, Timothy
    Chen, Jake Y.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 1991 - 1998
  • [9] Causal discovery with Point of Sales data
    Gmeiner, Peter
    3RD INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2020), 2020, : 339 - 339
  • [10] Causal discovery on high dimensional data
    Zhifeng Hao
    Hao Zhang
    Ruichu Cai
    Wen Wen
    Zhihao Li
    Applied Intelligence, 2015, 42 : 594 - 607