Causal discovery of gene regulation with incomplete data

被引:14
|
作者
Foraita, Ronja [1 ]
Friemel, Juliane [1 ,2 ]
Guenther, Kathrin [1 ]
Behrens, Thomas [3 ]
Bullerdiek, Joern [4 ]
Nimzyk, Rolf [4 ]
Ahrens, Wolfgang [1 ,5 ]
Didelez, Vanessa [1 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Achterstr 30, D-28359 Bremen, Germany
[2] Univ & Univ Hosp Zurich, Zurich, Switzerland
[3] Ruhr Univ Bochum, Bochum, Germany
[4] Ctr Human Genet, Bremen, Germany
[5] Univ Bremen, Bremen, Germany
关键词
Gene expression; Graphical models; Head-and-neck squamous cell carcinoma; HMGA2; gene; Human papilloma-virus; PC algorithm; Protein 53 signalling pathway; HMGA2 PROTEIN EXPRESSION; SQUAMOUS-CELL CARCINOMAS; DIRECTED ACYCLIC GRAPHS; EQUIVALENCE CLASSES; MAXIMUM-LIKELIHOOD; SEROUS CARCINOMA; HEAD; P53; INFERENCE; CANCER;
D O I
10.1111/rssa.12565
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Causal discovery algorithms aim to identify causal relations from observational data and have become a popular tool for analysing genetic regulatory systems. In this work, we applied causal discovery to obtain novel insights into the genetic regulation underlying head-and-neck squamous cell carcinoma. Some methodological challenges needed to be resolved first. The available data contained missing values, but most approaches to causal discovery require complete data. Hence, we propose a new procedure combining constraint-based causal discovery with multiple imputation. This is based on using Rubin's rules for pooling tests of conditional independence. A second challenge was that causal discovery relies on strong assumptions and can be rather unstable. To assess the robustness of our results, we supplemented our investigation with sensitivity analyses, including a non-parametric bootstrap to quantify the variability of the estimated causal structures. We applied these methods to investigate how the high mobility group AT-Hook 2 (HMGA2) gene is incorporated in the protein 53 signalling pathway playing an important role in head-and-neck squamous cell carcinoma. Our results were quite stable and found direct associations between HMGA2 and other relevant proteins, but they did not provide clear support for the claim that HMGA2 itself is a key regulator gene.
引用
收藏
页码:1747 / 1775
页数:29
相关论文
共 50 条
  • [21] Causal inference of gene regulation with subnetwork assembly from genetical genomics data
    Peng, Chien-Hua
    Jiang, Yi-Zhi
    Tai, An-Shun
    Liu, Chun-Bin
    Peng, Shih-Chi
    Liao, Chun-Ta
    Yen, Tzu-Chen
    Hsieh, Wen-Ping
    NUCLEIC ACIDS RESEARCH, 2014, 42 (05) : 2803 - 2819
  • [22] Applying Causal Discovery to Intensive Longitudinal Data
    Stevenson, Brittany L.
    Kummerfeld, Erich
    Merrill, Jennifer E.
    CAUSAL ANALYSIS WORKSHOP SERIES, VOL 160, 2021, 160 : 20 - +
  • [23] Aristotle: stratified causal discovery for omics data
    Mansouri, Mehrdad
    Khakabimamaghani, Sahand
    Chindelevitch, Leonid
    Ester, Martin
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [24] Causal discovery from medical textual data
    Mani, S
    Cooper, GF
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 542 - 546
  • [25] Causal Discovery on Non-Euclidean Data
    Yang, Jing
    Xie, Kai
    An, Ning
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2202 - 2211
  • [26] Causal Discovery from Heterogeneous/Nonstationary Data
    Huang, Biwei
    Zhang, Kun
    Zhang, Jiji
    Ramsey, Joseph
    Sanchez-Romero, Ruben
    Glymour, Clark
    Schoelkopf, Bernhard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [27] Data-driven discovery of causal interactions
    Ma, Saisai
    Liu, Lin
    Li, Jiuyong
    Thuc Duy Le
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (03) : 285 - 297
  • [28] Causal discovery from heterogeneous/nonstationary data
    Huang, Biwei
    Zhang, Kun
    Zhang, Jiji
    Ramsey, Joseph
    Sanchez-Romero, Ruben
    Glymour, Clark
    Schölkopf, Bernhard
    Journal of Machine Learning Research, 2020, 21
  • [29] Aristotle: stratified causal discovery for omics data
    Mehrdad Mansouri
    Sahand Khakabimamaghani
    Leonid Chindelevitch
    Martin Ester
    BMC Bioinformatics, 23
  • [30] Data-driven discovery of causal interactions
    Saisai Ma
    Lin Liu
    Jiuyong Li
    Thuc Duy Le
    International Journal of Data Science and Analytics, 2019, 8 : 285 - 297