A method for improving the reliability of causal inference from large-scale data in biomedicine

被引:0
|
作者
Liu, Yitao [1 ,2 ]
Lyu, Xiaoqing [1 ]
Xie, Haihua [3 ]
Yan, Xiaotong [1 ,4 ]
Wang, Bei [1 ]
Tang, Zhi [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing, Peoples R China
[2] NYU Shanghai, Shanghai, Peoples R China
[3] Peking Univ Founder Grp Co LTD, State Key Lab Digital Publishing Technol, Beijing, Peoples R China
[4] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
causal inference; causal relation; drug-target-disease analysis; significance level; GRAPHICAL MODELS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Causal inference is an essential problem in the field of drug discovery. Most existing methods depend on hypothesis testing with the analysis of drug-disease or drug-target-disease relations. However, a remaining challenge is how to choose a significance level for conditional independence tests to achieve the most reliable causal relation result. In this paper, we propose a constraint-based causal discovery method to achieve more reliable causal relations. An independence test reliability (ITR) is adopted to measure the reliability of each independence test result, and an aggregated causal relation reliability (ACRR) is introduced to evaluate the global reliability of a causal discovery. A drug-target-disease dataset is established by collecting information from the literature in PubMed using data mining techniques. The result of our experiments on different datasets shows that our proposed method obtains more stable causal relations than the existing approaches.
引用
收藏
页码:693 / 696
页数:4
相关论文
共 50 条
  • [1] Scalable Algorithms for Bayesian Inference of Large-Scale Models from Large-Scale Data
    Ghattas, Omar
    Isaac, Tobin
    Petra, Noemi
    Stadler, Georg
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2016, 2017, 10150 : 3 - 6
  • [2] Causal relationship inference for a large-scale cellular network
    Zhou, Tong
    Wang, Ya-Li
    [J]. BIOINFORMATICS, 2010, 26 (16) : 2020 - 2028
  • [3] Large-scale local causal inference of gene regulatory relationships
    Bucur, Ioan Gabriel
    Claassen, Tom
    Heskes, Tom
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 115 : 50 - 68
  • [4] Large-scale inference of human genetic data
    Rivas, M. A.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1064 - 1064
  • [5] Causal inference with large-scale assessments in education from a Bayesian perspective: a review and synthesis
    Kaplan D.
    [J]. Large-scale Assessments in Education, 4 (1)
  • [6] Predicting causal effects in large-scale systems from observational data
    Maathuis, Marloes H.
    Colombo, Diego
    Kalisch, Markus
    Buehlmann, Peter
    [J]. NATURE METHODS, 2010, 7 (04) : 247 - 248
  • [7] Predicting causal effects in large-scale systems from observational data
    Marloes H Maathuis
    Diego Colombo
    Markus Kalisch
    Peter Bühlmann
    [J]. Nature Methods, 2010, 7 : 247 - 248
  • [8] ZaliQL: Causal Inference from Observational Data at Scale
    Salimi, Babak
    Cole, Corey
    Ports, Dan R. K.
    Suciu, Dan
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1957 - 1960
  • [9] Bagging Statistical Network Inference from Large-Scale Gene Expression Data
    Simoes, Ricardo de Matos
    Emmert-Streib, Frank
    [J]. PLOS ONE, 2012, 7 (03):
  • [10] FLAME: A Fast Large-scale Almost Matching Exactly Approach to Causal Inference
    Wang, Tianyu
    Morucci, Marco
    Awan, M. Usaid
    Liu, Yameng
    Roy, Sudeepa
    Rudin, Cynthia
    Volfovsky, Alexander
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22