Bayesian Active Learning for Optimization and Uncertainty Quantification in Protein Docking

被引:10
|
作者
Cao, Yue [1 ]
Shen, Yang [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, TEES AgriLife Ctr Bioinformat & Genom Syst Engn, College Stn, TX 77840 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
PREDICTION; REFINEMENT;
D O I
10.1021/acs.jctc.0c00476
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is a lack of rigorous uncertainty quantification (UQ). To fill the gap, we introduce a novel algorithm, Bayesian active learning (BAL), for optimization and UQ of such black-box functions with applications to flexible protein docking. BAL directly models the posterior distribution of the global optimum (i.e., native structures) with active sampling and posterior estimation iteratively feeding each other. Furthermore, it uses complex normal modes to span a homogeneous, Euclidean conformation space suitable for high-dimensional optimization and constructs funnel-like energy models for quality estimation of encounter complexes. Over a protein-docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improves against starting points from rigid docking and refinements by particle swarm optimization, providing a top-3 near-native prediction for one third targets. Quality assessment empowered with UQ leads to tight quality intervals with half range around 25% of the actual interface root-mean-square deviation and confidence level at 85%. BAL's estimated probability of a prediction being near-native achieves binary classification AUROC at 0.93 and area under the precision recall curve over 0.60 (compared to 0.50 and 0.14, respectively, by chance), which also improves ranking predictions. This study represents the first UQ solution for protein docking, with rigorous theoretical frameworks and comprehensive empirical assessments.
引用
收藏
页码:5334 / 5347
页数:14
相关论文
共 50 条
  • [41] Machine learning and model driven bayesian uncertainty quantification in suspended nonstructural systems
    Qin, Zhiyuan
    Naser, M. Z.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 237
  • [42] A Bayesian Deep Learning Framework for RUL Prediction Incorporating Uncertainty Quantification and Calibration
    Lin, Yan-Hui
    Li, Gang-Hui
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 7274 - 7284
  • [43] BAYESIAN DEEP LEARNING FRAMEWORK FOR UNCERTAINTY QUANTIFICATION IN STOCHASTIC PARTIAL DIFFERENTIAL EQUATIONS
    Jung, Jeahan
    Shin, Hyomin
    Choi, Minseok
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (01): : C57 - C76
  • [44] Uncertainty-Guided Active Reinforcement Learning with Bayesian Neural Networks
    Wu, Xinyang
    El-Shamouty, Mohamed
    Nitsche, Christof
    Huber, Marco F.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5751 - 5757
  • [45] Active learning for SNAP interatomic potentials via Bayesian predictive uncertainty
    Williams, Logan
    Sargsyan, Khachik
    Rohskopf, Andrew
    Najm, Habib N.
    COMPUTATIONAL MATERIALS SCIENCE, 2024, 242
  • [46] Conservative Gaussian Process Models for Uncertainty Quantification and Bayesian Optimization in Signal Integrity Applications
    Manfredi, Paolo
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2024, 14 (07): : 1261 - 1272
  • [47] Heterogeneous uncertainty quantification using Bayesian inference for simulation-based design optimization
    Li, Mingyang
    Wang, Zequn
    STRUCTURAL SAFETY, 2020, 85
  • [48] Multi-fidelity machine-learning with uncertainty quantification and Bayesian optimization for materials design: Application to ternary random alloys
    Tran, Anh
    Tranchida, Julien
    Wildey, Tim
    Thompson, Aidan P.
    JOURNAL OF CHEMICAL PHYSICS, 2020, 153 (07):
  • [49] Multi-Objective Bayesian Optimization with Active Preference Learning
    Ozaki, Ryota
    Ishikawa, Kazuki
    Kanzaki, Youhei
    Takeno, Shion
    Takeuchi, Ichiro
    Karasuyama, Masayuki
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14490 - 14498
  • [50] Simultaneous optimization and uncertainty quantification
    Gray, Genetha A.
    Lee, Herbert K. H.
    Guenther, John
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2012, 12 (1-2) : 99 - 110