Aristotle: stratified causal discovery for omics data

被引:5
|
作者
Mansouri, Mehrdad [1 ]
Khakabimamaghani, Sahand [1 ]
Chindelevitch, Leonid [1 ]
Ester, Martin [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, 8888 Univ Dr, Burnaby, CA 77004 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Causal discovery; Stratification; Biclustering; Quasi-experiment; CARDIOTOXICITY; SELECTION; NETWORK; LATENT;
D O I
10.1186/s12859-021-04521-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying pathophysiology of diseases. Causal analysis of omics data may provide important insight into the underlying biological mechanisms. Existing causal analysis methods yield promising results when identifying potential general causes of an observed outcome based on omics data. However, they may fail to discover the causes specific to a particular stratum of individuals and missing from others. Methods To fill this gap, we introduce the problem of stratified causal discovery and propose a method, Aristotle, for solving it. Aristotle addresses the two challenges intrinsic to omics data: high dimensionality and hidden stratification. It employs existing biological knowledge and a state-of-the-art patient stratification method to tackle the above challenges and applies a quasi-experimental design method to each stratum to find stratum-specific potential causes. Results Evaluation based on synthetic data shows better performance for Aristotle in discovering true causes under different conditions compared to existing causal discovery methods. Experiments on a real dataset on Anthracycline Cardiotoxicity indicate that Aristotle's predictions are consistent with the existing literature. Moreover, Aristotle makes additional predictions that suggest further investigations.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Causal Discovery from Heterogeneous/Nonstationary Data
    Huang, Biwei
    Zhang, Kun
    Zhang, Jiji
    Ramsey, Joseph
    Sanchez-Romero, Ruben
    Glymour, Clark
    Schoelkopf, Bernhard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [22] Data-driven discovery of causal interactions
    Ma, Saisai
    Liu, Lin
    Li, Jiuyong
    Thuc Duy Le
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (03) : 285 - 297
  • [23] Causal discovery from heterogeneous/nonstationary data
    Huang, Biwei
    Zhang, Kun
    Zhang, Jiji
    Ramsey, Joseph
    Sanchez-Romero, Ruben
    Glymour, Clark
    Schölkopf, Bernhard
    Journal of Machine Learning Research, 2020, 21
  • [24] Data-driven discovery of causal interactions
    Saisai Ma
    Lin Liu
    Jiuyong Li
    Thuc Duy Le
    International Journal of Data Science and Analytics, 2019, 8 : 285 - 297
  • [25] Omics Data and Their Integrative Analysis to Support Stratified Medicine in Neurodegenerative Diseases
    La Cognata, Valentina
    Morello, Giovanna
    Cavallaro, Sebastiano
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (09)
  • [26] CAUSAL EXPLANATION IN THE BIOLOGY OF ARISTOTLE
    PELLEGRIN, P
    REVUE DE METAPHYSIQUE ET DE MORALE, 1990, 95 (02): : 197 - 219
  • [27] Causal inference of molecular networks integrating multi-omics data
    Penagaricano, F.
    JOURNAL OF ANIMAL SCIENCE, 2016, 94 : 199 - 200
  • [28] Big Data in Omics and Imaging: Integrated Analysis and Causal Inference.
    Chen, Oliver Y.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 487 - 488
  • [29] Interpretable AI for inference of causal molecular relationships from omics data
    Dibaeinia, Payam
    Ojha, Abhishek
    Sinha, Saurabh
    SCIENCE ADVANCES, 2025, 11 (07):
  • [30] Causality and Causal Explanation in Aristotle
    Marre, Thomas
    REVIEW OF METAPHYSICS, 2024, 78 (01):