Bayesian Active Learning for Optimization and Uncertainty Quantification in Protein Docking

被引:10
|
作者
Cao, Yue [1 ]
Shen, Yang [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, TEES AgriLife Ctr Bioinformat & Genom Syst Engn, College Stn, TX 77840 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
PREDICTION; REFINEMENT;
D O I
10.1021/acs.jctc.0c00476
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is a lack of rigorous uncertainty quantification (UQ). To fill the gap, we introduce a novel algorithm, Bayesian active learning (BAL), for optimization and UQ of such black-box functions with applications to flexible protein docking. BAL directly models the posterior distribution of the global optimum (i.e., native structures) with active sampling and posterior estimation iteratively feeding each other. Furthermore, it uses complex normal modes to span a homogeneous, Euclidean conformation space suitable for high-dimensional optimization and constructs funnel-like energy models for quality estimation of encounter complexes. Over a protein-docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improves against starting points from rigid docking and refinements by particle swarm optimization, providing a top-3 near-native prediction for one third targets. Quality assessment empowered with UQ leads to tight quality intervals with half range around 25% of the actual interface root-mean-square deviation and confidence level at 85%. BAL's estimated probability of a prediction being near-native achieves binary classification AUROC at 0.93 and area under the precision recall curve over 0.60 (compared to 0.50 and 0.14, respectively, by chance), which also improves ranking predictions. This study represents the first UQ solution for protein docking, with rigorous theoretical frameworks and comprehensive empirical assessments.
引用
收藏
页码:5334 / 5347
页数:14
相关论文
共 50 条
  • [1] Bayesian Learning for Uncertainty Quantification, Optimization, and Inverse Design
    Swaminathan, Madhavan
    Bhatti, Osama Waqar
    Guo, Yiliang
    Huang, Eric
    Akinwande, Oluwaseyi
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2022, 70 (11) : 4620 - 4634
  • [2] Uncertainty Quantification for Bayesian Optimization
    Tuo, Rui
    Wang, Wenjia
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [3] Bayesian Active Learning for Uncertainty Quantification of High Speed Channel Signaling
    Torun, Hakki M.
    Hejase, Jose A.
    Tang, Junyan
    Becker, Wiren D.
    Swaminathan, Madhavan
    2018 IEEE 27TH CONFERENCE ON ELECTRICAL PERFORMANCE OF ELECTRONIC PACKAGING AND SYSTEMS (EPEPS), 2018, : 311 - 313
  • [4] BAYESIAN NETWORK LEARNING FOR UNCERTAINTY QUANTIFICATION
    Hu, Zhen
    Mahadevan, Sankaran
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2017, VOL 2A, 2017,
  • [5] Uncertainty quantification for Bayesian active learning in rupture life prediction of ferritic steels
    Osman Mamun
    M. F. N. Taufique
    Madison Wenzlick
    Jeffrey Hawk
    Ram Devanathan
    Scientific Reports, 12
  • [6] Uncertainty quantification for Bayesian active learning in rupture life prediction of ferritic steels
    Mamun, Osman
    Taufique, M. F. N.
    Wenzlick, Madison
    Hawk, Jeffrey
    Devanathan, Ram
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [7] Bayesian Optimization Algorithm Applied to Uncertainty Quantification
    Abdollahzadeh, Asaad
    Reynolds, Alan
    Christie, Mike
    Corne, David
    Davies, Brian
    Williams, Glyn
    SPE JOURNAL, 2012, 17 (03): : 865 - 873
  • [8] Quantification of Uncertainty in Brain Tumor Segmentation using Generative Network and Bayesian Active Learning
    Alshehhi, Rasha
    Alshehhi, Anood
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 701 - 709
  • [9] Bayesian optimization for learning gaits under uncertainty
    Calandra, Roberto
    Seyfarth, Andre
    Peters, Jan
    Deisenroth, Marc Peter
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2016, 76 (1-2) : 5 - 23
  • [10] Bayesian uncertainty quantification for data-driven equation learning
    Martina-Perez, Simon
    Simpson, Matthew J.
    Baker, Ruth E.
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 477 (2254):