The effect of prior assumptions over the weights in BayesPI with application to study protein-DNA interactions from ChIP-based high-throughput data

被引:4
|
作者
Wang, Junbai [1 ]
机构
[1] Oslo Univ Hosp, Norwegian Radium Hosp, Dept Pathol, N-0310 Oslo, Norway
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
BINDING-SITES; IN-VIVO; STRESS; REPRESSOR; GENES;
D O I
10.1186/1471-2105-11-412
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: To further understand the implementation of hyperparameters re-estimation technique in Bayesian hierarchical model, we added two more prior assumptions over the weight in BayesPI, namely Laplace prior and Cauchy prior, by using the evidence approximation method. In addition, we divided hyperparameter (regularization constants a of the model) into multiple distinct classes based on either the structure of the neural networks or the property of the weights. Results: The newly implemented BayesPI was tested on both synthetic and real ChIP-based high-throughput datasets to identify the corresponding protein binding energy matrices. The results obtained were encouraging: 1) there was a minor effect on the quality of predictions when prior assumptions over the weights were altered (e. g. the prior probability distributions to the weights and the number of classes to the hyperparameters) in BayesPI; 2) however, there was a significant impact on the computational speed when tuning the weight prior in the model: for example, BayesPI with a Laplace weight prior achieved the best performance with regard to both the computational speed and the prediction accuracy. Conclusions: From this study, we learned that it is absolutely necessary to try different prior assumptions over the weights in Bayesian hierarchical model to design an efficient learning algorithm, though the quality of the final results may not be associated with such changes. In future, the evidence approximation method can be an alternative to Monte Carlo methods for computational implementation of Bayesian hierarchical model.
引用
收藏
页数:10
相关论文
共 7 条
  • [1] The effect of prior assumptions over the weights in BayesPI with application to study protein-DNA interactions from ChIP-based high-throughput data
    Junbai Wang
    BMC Bioinformatics, 11
  • [2] ChIP-seq: Using high-throughput sequencing to discover protein-DNA interactions
    Schmidt, Dominic
    Wilson, Michael D.
    Spyrou, Christiana
    Brown, Gordon D.
    Hadfield, James
    Odom, Duncan T.
    METHODS, 2009, 48 (03) : 240 - 248
  • [3] DNB-based on-chip motif finding: A high-throughput method to profile different types of protein-DNA interactions
    Li, Zhuokun
    Wang, Xiaojue
    Xu, Dongyang
    Zhang, Dengwei
    Wang, Dan
    Dai, Xuechen
    Wang, Qi
    Li, Zhou
    Gu, Ying
    Ouyang, Wenjie
    Zhao, Shuchang
    Huang, Baoqian
    Gong, Jian
    Zhao, Jing
    Chen, Ao
    Shen, Yue
    Dong, Yuliang
    Zhang, Wenwei
    Xu, Xun
    Xu, Chongjun
    Jiang, Yuan
    SCIENCE ADVANCES, 2020, 6 (31)
  • [4] W-ChIPMotifs: a web application tool for de novo motif discovery from ChIP-based high-throughput data
    Jin, Victor X.
    Apostolos, Jeff
    Nagisetty, Naga Satya Venkateswara Ra
    Farnham, Peggy J.
    BIOINFORMATICS, 2009, 25 (23) : 3191 - 3193
  • [5] Evaluating a linear k-mer model for protein-DNA interactions using high-throughput SELEX data
    Kahara, Juhani
    Lahdesmaki, Harri
    BMC BIOINFORMATICS, 2013, 14
  • [6] Evaluating a linear k-mer model for protein-DNA interactions using high-throughput SELEX data
    Juhani Kähärä
    Harri Lähdesmäki
    BMC Bioinformatics, 14
  • [7] Building accurate sequence-to-affinity models from high-throughput in vitro protein-DNA binding data using FeatureREDUCE
    Riley, Todd R.
    Lazarovici, Allan
    Mann, Richard S.
    Bussemaker, Harmen J.
    ELIFE, 2015, 4