Bayesian Active Learning for Optimization and Uncertainty Quantification in Protein Docking

被引:10
|
作者
Cao, Yue [1 ]
Shen, Yang [1 ,2 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, TEES AgriLife Ctr Bioinformat & Genom Syst Engn, College Stn, TX 77840 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
PREDICTION; REFINEMENT;
D O I
10.1021/acs.jctc.0c00476
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Ab initio protein docking represents a major challenge for optimizing a noisy and costly "black box"-like function in a high-dimensional space. Despite progress in this field, there is a lack of rigorous uncertainty quantification (UQ). To fill the gap, we introduce a novel algorithm, Bayesian active learning (BAL), for optimization and UQ of such black-box functions with applications to flexible protein docking. BAL directly models the posterior distribution of the global optimum (i.e., native structures) with active sampling and posterior estimation iteratively feeding each other. Furthermore, it uses complex normal modes to span a homogeneous, Euclidean conformation space suitable for high-dimensional optimization and constructs funnel-like energy models for quality estimation of encounter complexes. Over a protein-docking benchmark set and a CAPRI set including homology docking, we establish that BAL significantly improves against starting points from rigid docking and refinements by particle swarm optimization, providing a top-3 near-native prediction for one third targets. Quality assessment empowered with UQ leads to tight quality intervals with half range around 25% of the actual interface root-mean-square deviation and confidence level at 85%. BAL's estimated probability of a prediction being near-native achieves binary classification AUROC at 0.93 and area under the precision recall curve over 0.60 (compared to 0.50 and 0.14, respectively, by chance), which also improves ranking predictions. This study represents the first UQ solution for protein docking, with rigorous theoretical frameworks and comprehensive empirical assessments.
引用
收藏
页码:5334 / 5347
页数:14
相关论文
共 50 条
  • [21] Bayesian Active Learning by Soft Mean Objective Cost of Uncertainty
    Zhao, Guang
    Dougherty, Edward R.
    Yoon, Byung-Jun
    Alexander, Francis J.
    Qian, Xiaoning
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [22] Self-Correcting Bayesian Optimization through Bayesian Active Learning
    Hvarfner, Carl
    Hellsten, Erik Orm
    Hutter, Frank
    Nardi, Luigi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Surrogate model uncertainty quantification for active learning reliability analysis
    Yong PANG
    Shuai ZHANG
    Pengwei LIANG
    Muchen WANG
    Zhuangzhuang GONG
    Xueguan SONG
    Ziyun KAN
    Chinese Journal of Aeronautics, 2024, 37 (12) : 55 - 70
  • [24] An active learning method for diabetic retinopathy classification with uncertainty quantification
    Ahsan, Muhammad Ahtazaz
    Qayyum, Adnan
    Razi, Adeel
    Qadir, Junaid
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (10) : 2797 - 2811
  • [25] A Comparison of Uncertainty Quantification Methods for Active Learning in Image Classification
    Hein, Alice
    Roehrl, Stefan
    Grobel, Thea
    Lengl, Manuel
    Hafez, Nawal
    Knopp, Martin
    Klenk, Christian
    Heim, Dominik
    Hayden, Oliver
    Diepold, Klaus
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [26] An active learning method for diabetic retinopathy classification with uncertainty quantification
    Muhammad Ahtazaz Ahsan
    Adnan Qayyum
    Adeel Razi
    Junaid Qadir
    Medical & Biological Engineering & Computing, 2022, 60 : 2797 - 2811
  • [27] Surrogate model uncertainty quantification for active learning reliability analysis
    PANG, Yong
    ZHANG, Shuai
    LIANG, Pengwei
    WANG, Muchen
    GONG, Zhuangzhuang
    SONG, Xueguan
    KAN, Ziyun
    Chinese Journal of Aeronautics, 2024, 37 (12) : 55 - 70
  • [28] Learning and optimization under epistemic uncertainty with Bayesian hybrid models
    Eugene, Elvis A.
    Jones, Kyla D.
    Gao, Xian
    Wang, Jialu
    Dowling, Alexander W.
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 179
  • [29] Korali: Efficient and scalable software framework for Bayesian uncertainty quantification and stochastic optimization
    Martin, Sergio M.
    Waelchli, Daniel
    Arampatzis, Georgios
    Economides, Athena E.
    Karnakov, Petr
    Koumoutsakos, Petros
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 389
  • [30] Scalable Bayesian Uncertainty Quantification in Imaging Inverse Problems via Convex Optimization
    Repetti, Audrey
    Pereyra, Marcelo
    Wiaux, Yves
    SIAM JOURNAL ON IMAGING SCIENCES, 2019, 12 (01): : 87 - 118