Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data

被引:19
|
作者
Gao, Shouguo [1 ,2 ]
Wang, Xujing [1 ,2 ]
机构
[1] Univ Alabama Birmingham, Dept Phys, Birmingham, AL 35294 USA
[2] Univ Alabama Birmingham, Comprehens Diabet Ctr, Birmingham, AL 35294 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
SACCHAROMYCES-CEREVISIAE; REGULATORY NETWORKS; CELL-CYCLE; YEAST; FRAMEWORK; ONTOLOGY; DESIGN;
D O I
10.1186/1471-2105-12-359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Bayesian Network (BN) is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable. Results: We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naive Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a similar to 2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information. Conclusion: our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Clustering gene expression series with prior knowledge
    Bréhélin, L
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2005, 3692 : 27 - 38
  • [32] Bayesian Network Webserver: a comprehensive tool for biological network modeling
    Ziebarth, Jesse D.
    Bhattacharya, Anindya
    Cui, Yan
    BIOINFORMATICS, 2013, 29 (21) : 2801 - 2803
  • [33] BioBayesNet: a web server for feature extraction and Bayesian network modeling of biological sequence data
    Nikolajewa, Swetlana
    Pudimat, Rainer
    Hiller, Michael
    Platzer, Matthias
    Backofen, Rolf
    NUCLEIC ACIDS RESEARCH, 2007, 35 : W688 - W693
  • [34] Dynamic Transformation of Prior Knowledge Into Bayesian Models for Data Streams
    Bach, Tran Xuan
    Anh, Nguyen Duc
    Linh, Ngo Van
    Than, Khoat
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3742 - 3750
  • [35] Measures of Bayesian discrepancy between prior beliefs and data knowledge
    Bousquet, N.
    Celeux, G.
    SAFETY AND RELIABILITY FOR MANAGING RISK, VOLS 1-3, 2006, : 867 - 872
  • [36] Editing Bayesian Networks: A New Approach for Combining Prior Knowledge and Gene Expression Measurements for Researching Diseases
    Rubinstein, Udi
    Felder, Yifat
    Ginzbourg, Nana
    Gurevich, Michael
    Tuller, Tarnir
    2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, : 273 - +
  • [37] Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection
    Dhavala, Soma S.
    Datta, Sujay
    Mallick, Bani K.
    Carroll, Raymond J.
    Khare, Sangeeta
    Lawhon, Sara D.
    Adams, L. Garry
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (491) : 956 - 967
  • [38] Bayesian hierarchical modeling of means and covariances of gene expression data within families
    Roger Pique-Regi
    John Morrison
    Duncan C Thomas
    BMC Proceedings, 1 (Suppl 1)
  • [39] Bayesian biclustering of gene expression data
    Jiajun Gu
    Jun S Liu
    BMC Genomics, 9
  • [40] Bayesian biclustering of gene expression data
    Gu, Jiajun
    Liu, Jun S.
    BMC GENOMICS, 2008, 9 (Suppl 1)