Bayesian Active Learning for Optimization and Uncertainty Quantification in Protein Docking

被引:10
|
作者
Cao, Yue [1 ]
Shen, Yang [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, TEES AgriLife Ctr Bioinformat & Genom Syst Engn, College Stn, TX 77840 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
PREDICTION; REFINEMENT;
D O I
10.1021/acs.jctc.0c00476
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is a lack of rigorous uncertainty quantification (UQ). To fill the gap, we introduce a novel algorithm, Bayesian active learning (BAL), for optimization and UQ of such black-box functions with applications to flexible protein docking. BAL directly models the posterior distribution of the global optimum (i.e., native structures) with active sampling and posterior estimation iteratively feeding each other. Furthermore, it uses complex normal modes to span a homogeneous, Euclidean conformation space suitable for high-dimensional optimization and constructs funnel-like energy models for quality estimation of encounter complexes. Over a protein-docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improves against starting points from rigid docking and refinements by particle swarm optimization, providing a top-3 near-native prediction for one third targets. Quality assessment empowered with UQ leads to tight quality intervals with half range around 25% of the actual interface root-mean-square deviation and confidence level at 85%. BAL's estimated probability of a prediction being near-native achieves binary classification AUROC at 0.93 and area under the precision recall curve over 0.60 (compared to 0.50 and 0.14, respectively, by chance), which also improves ranking predictions. This study represents the first UQ solution for protein docking, with rigorous theoretical frameworks and comprehensive empirical assessments.
引用
收藏
页码:5334 / 5347
页数:14
相关论文
共 50 条
  • [31] Efficient Bayesian Yield Analysis and Optimization with Active Learning
    Yin, Shuo
    Jin, Xiang
    Shi, Linxu
    Wang, Kang
    Xing, Wei W.
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 1195 - 1200
  • [32] Bayesian Uncertainty Quantification with Synthetic Data
    Phan, Buu
    Khan, Samin
    Salay, Rick
    Czarnecki, Krzysztof
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2019, 2019, 11699 : 378 - 390
  • [33] On the Quantification of Model Uncertainty: A Bayesian Perspective
    David Kaplan
    Psychometrika, 2021, 86 : 215 - 238
  • [34] A Bayesian approach for quantification of model uncertainty
    Park, Inseok
    Amarchinta, Hemanth K.
    Grandhi, Ramana V.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2010, 95 (07) : 777 - 785
  • [35] Bayesian optical flow with uncertainty quantification
    Sun, Jie
    Quevedo, Fernando J.
    Bolit, Erik
    INVERSE PROBLEMS, 2018, 34 (10)
  • [36] On the Quantification of Model Uncertainty: A Bayesian Perspective
    Kaplan, David
    PSYCHOMETRIKA, 2021, 86 (01) : 215 - 238
  • [37] Uncertainty Quantification in Classifying Complex Geological Facies Using Bayesian Deep Learning
    Hossain, Touhid Mohammad
    Hermana, Maman
    Jaya, Makky Sandra
    Sakai, Hiroshi
    Abdulkadir, Said Jadid
    IEEE ACCESS, 2022, 10 : 113767 - 113777
  • [38] Uncertainty Quantification in Predicting Physical Property of Porous Medium With Bayesian Evidential Learning
    Fan, Zhenzhen
    Zuo, Chen
    Guo, Chen
    Yang, Zhifang
    Yan, Xinfei
    Li, Xiaoming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [39] Volcano-Seismic Transfer Learning and Uncertainty Quantification With Bayesian Neural Networks
    Bueno, Angel
    Benitez, Carmen
    De Angelis, Silvio
    Diaz Moreno, Alejandro
    Ibanez, Jesus M.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (02): : 892 - 902
  • [40] A Novel Bayesian Deep Learning Approach to the Downscaling of Wind Speed with Uncertainty Quantification
    Gerges, Firas
    Boufadel, Michel C.
    Bou-Zeid, Elie
    Nassif, Hani
    Wang, Jason T. L.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT III, 2022, 13282 : 55 - 66