Aristotle: stratified causal discovery for omics data

被引:5
|
作者
Mansouri, Mehrdad [1 ]
Khakabimamaghani, Sahand [1 ]
Chindelevitch, Leonid [1 ]
Ester, Martin [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, 8888 Univ Dr, Burnaby, CA 77004 USA
基金
加拿大自然科学与工程研究理事会;
关键词
Causal discovery; Stratification; Biclustering; Quasi-experiment; CARDIOTOXICITY; SELECTION; NETWORK; LATENT;
D O I
10.1186/s12859-021-04521-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying pathophysiology of diseases. Causal analysis of omics data may provide important insight into the underlying biological mechanisms. Existing causal analysis methods yield promising results when identifying potential general causes of an observed outcome based on omics data. However, they may fail to discover the causes specific to a particular stratum of individuals and missing from others. Methods To fill this gap, we introduce the problem of stratified causal discovery and propose a method, Aristotle, for solving it. Aristotle addresses the two challenges intrinsic to omics data: high dimensionality and hidden stratification. It employs existing biological knowledge and a state-of-the-art patient stratification method to tackle the above challenges and applies a quasi-experimental design method to each stratum to find stratum-specific potential causes. Results Evaluation based on synthetic data shows better performance for Aristotle in discovering true causes under different conditions compared to existing causal discovery methods. Experiments on a real dataset on Anthracycline Cardiotoxicity indicate that Aristotle's predictions are consistent with the existing literature. Moreover, Aristotle makes additional predictions that suggest further investigations.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Aristotle: stratified causal discovery for omics data
    Mehrdad Mansouri
    Sahand Khakabimamaghani
    Leonid Chindelevitch
    Martin Ester
    BMC Bioinformatics, 23
  • [2] From 'Omics to Multi-omics Technologies: the Discovery of Novel Causal Mediators
    Mohammadi-Shemirani, Pedrum
    Sood, Tushar
    Pare, Guillaume
    CURRENT ATHEROSCLEROSIS REPORTS, 2023, 25 (02) : 55 - 65
  • [3] From ‘Omics to Multi-omics Technologies: the Discovery of Novel Causal Mediators
    Pedrum Mohammadi-Shemirani
    Tushar Sood
    Guillaume Paré
    Current Atherosclerosis Reports, 2023, 25 : 55 - 65
  • [4] "Omics" data and levels of evidence for biomarker discovery
    Ghosh, Debashis
    Poisson, Laila M.
    GENOMICS, 2009, 93 (01) : 13 - 16
  • [5] Causal discovery with Point of Sales data
    Gmeiner, Peter
    3RD INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2020), 2020, : 339 - 339
  • [6] Causal discovery on high dimensional data
    Zhifeng Hao
    Hao Zhang
    Ruichu Cai
    Wen Wen
    Zhihao Li
    Applied Intelligence, 2015, 42 : 594 - 607
  • [7] Causal Discovery from Temporal Data
    Gong, Chang
    Yao, Di
    Zhang, Chuzhe
    Li, Wenbin
    Bi, Jingping
    Du, Lun
    Wang, Jin
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5803 - 5804
  • [8] Causal Discovery in the Presence of Missing Data
    Tu, Ruibo
    Zhang, Cheng
    Ackermann, Paul
    Mohan, Karthika
    Kjellstrom, Hedvig
    Zhang, Kun
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [9] Causal Discovery with Heterogeneous Observational Data
    Zhou, Fangting
    He, Kejun
    Ni, Yang
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2383 - 2393
  • [10] Causal discovery on high dimensional data
    Hao, Zhifeng
    Zhang, Hao
    Cai, Ruichu
    Wen, Wen
    Li, Zhihao
    APPLIED INTELLIGENCE, 2015, 42 (03) : 594 - 607