A Bayesian Approach to High-Throughput Biological Model Generation

被引:0
|
作者
Shi, Xinghua [1 ]
Stevens, Rick [1 ]
机构
[1] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
ESCHERICHIA-COLI; ENZYMES;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the availability of hundreds and soon thousands of complete genomes, the construction of genome-scale metabolic models for these organisms has attracted much attention. Manual work still dominates the process of model generation, however, and leads to the huge gap between the number of complete genomes and genome-scale metabolic models. The challenge in constructing genome-scale models from existing databases is that usually such a directly extracted model is incomplete and contains network holes. Network holes occur when a network is disconnected and certain metabolites cannot be produced or consumed. In order to construct a valid metabolic model, network holes need to be filled by introducing candidate reactions into the network. As a step toward the high-throughput generation of biological models, we propose a Bayesian approach to improving draft genome-scale metabolic models. A collection of 23 types of biological and topological evidence is extracted from the SEED [1), KEGG [2], and BiGG [3] databases. Based on this evidence, we create 23 individual predictors using Bayesian approaches. To combine these individual predictors and unify their predictive results, we build an ensemble of individual predictors on majority vote and four classifiers: naive Bayes classifier, Bayesian network, multilayer perceptron network and AdaBoost. A set of experiments is performed to train and test individual predictors and integrative mechanisms of single predictors and to evaluate the performance of our approach.
引用
收藏
页码:376 / 387
页数:12
相关论文
共 50 条
  • [1] BAYESIAN MULTISTUDY FACTOR ANALYSIS FOR HIGH-THROUGHPUT BIOLOGICAL DATA
    De Vito, Roberta
    Bellio, Ruggero
    Trippa, Lorenzo
    Parmigiani, Giovanni
    [J]. ANNALS OF APPLIED STATISTICS, 2021, 15 (04): : 1723 - 1741
  • [2] A reproducible approach to high-throughput biological data acquisition and integration
    Boernigen, Daniela
    Moon, Yo Sup
    Rahnavard, Gholamali
    Waldron, Levi
    McIver, Lauren
    Shafquat, Afrah
    Franzosa, Eric A.
    Miropolsky, Larissa
    Sweeney, Christopher
    Morgan, Xochitl C.
    Garrett, Wendy S.
    Huttenhower, Curtis
    [J]. PEERJ, 2015, 3
  • [3] A Bayesian method for biological pathway discovery from high-throughput experimental data
    Wang, W
    Cooper, GF
    [J]. 2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 645 - 646
  • [4] Pathway analysis of high-throughput biological data within a Bayesian network framework
    Isci, Senol
    Ozturk, Cengizhan
    Jones, Jon
    Otu, Hasan H.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1667 - 1674
  • [5] An Empirical Bayesian Approach for Identifying Differential Coexpression in High-Throughput Experiments
    Dawson, John A.
    Kendziorski, Christina
    [J]. BIOMETRICS, 2012, 68 (02) : 455 - 465
  • [6] High-throughput analysis of biological activities
    Lazo, J. S.
    [J]. MOLECULAR INTERVENTIONS, 2006, 6 (04) : 192 - 192
  • [7] An approach to high-throughput genotyping
    Hall, JM
    LeDuc, CA
    Watson, AR
    Roter, AH
    [J]. GENOME RESEARCH, 1996, 6 (09): : 781 - 790
  • [8] Graph Theory Approach to High-Throughput Surface Adsorption Structure Generation
    Boes, Jacob R.
    Mamun, Osman
    Winther, Kirsten
    Bligaard, Thomas
    [J]. JOURNAL OF PHYSICAL CHEMISTRY A, 2019, 123 (11): : 2281 - 2285
  • [9] High-throughput docking for lead generation
    Abagyan, R
    Totrov, M
    [J]. CURRENT OPINION IN CHEMICAL BIOLOGY, 2001, 5 (04) : 375 - 382
  • [10] Automated high-throughput generation of droplets
    Guzowski, Jan
    Korczyk, Piotr M.
    Jakiela, Slawomir
    Garstecki, Piotr
    [J]. LAB ON A CHIP, 2011, 11 (21) : 3593 - 3595