Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning

被引:2
|
作者
Ahalawat, Navjeet [2 ]
Sahil, Mohammad [1 ]
Mondal, Jagannath [1 ]
机构
[1] Tata Inst Fundamental Res, Ctr Interdisciplinary Sci, Hyderabad 500046, India
[2] CCS Haryana Agr Univ, Coll Biotechnol, Dept Bioinformat & Computat Biol, Hisar 125004, Haryana, India
关键词
MARKOV STATE MODELS; LIGAND-BINDING; RECOGNITION; METADYNAMICS; DETERMINANTS; ENSEMBLES; DYNAMICS; KINETICS;
D O I
10.1021/acs.jctc.2c00932
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A long-standing target in elucidating the biomolecular recognition process is the identification of binding competent conformations of the receptor protein. However, protein conformational plasticity and the stochastic nature of the recognition processes often preclude the assignment of a specific protein conformation to an individual ligand-bound pose. Here, we demonstrate that a computational framework coined as RF-TICAMD, which integrates an ensemble decision-tree-based Random Forest (RF) machine learning (ML) technique with an unsupervised dimension reduction approach time-structured independent component analysis (TICA), provides an efficient and unambiguous solution toward resolving protein conformational plasticity and the substrate binding process. In particular, we consider multimicrosecond-long molecular dynamics (MD) simulation trajectories of a ligand recognition process in solvent inaccessible cavities of archetypal proteins T4 lysozyme and cytochrome P450cam. We show that in a scenario in which clear correspondence between protein conformation and binding-competent macrostates could not be obtained via an unsupervised dimension reduction approach, an a priori decision-tree-based supervised classification of the simulated recognition trajectories via RF would help characterize key amino acid residue pairs of the protein that are deemed sensitive for ligand binding. A subsequent unsupervised dimensional reduction of the selected residue pairs via TICA would then delineate a conformational landscape of protein which is able to demarcate ligand-bound poses from unbound ones. The proposed RF-TICA-MD approach is shown to be data agnostic and found to be robust when using other ML-based classification methods such as XGBoost. As a promising spinoff of the protocol, the framework is found to be capable of identifying distal protein locations which would be allosterically important for ligand binding and would characterize their roles in recognition pathways. A Python implementation of a proposed ML workflow is available in GitHub https://github.com/navjeet0211/rf-tica-md.
引用
收藏
页码:2644 / 2657
页数:14
相关论文
共 50 条
  • [41] Conformational Plasticity of Centrin 1 from Toxoplasma gondii in Binding to the Centrosomal Protein SFI1
    Bombardi, Luca
    Favretto, Filippo
    Pedretti, Marco
    Conter, Carolina
    Dominici, Paola
    Astegno, Alessandra
    BIOMOLECULES, 2022, 12 (08)
  • [42] Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models
    Nuria Plattner
    Frank Noé
    Nature Communications, 6
  • [43] Resolving Quantitative MRI Model Degeneracy with Machine Learning via Training Data Distribution Design
    Guerreri, Michele
    Epstein, Sean
    Azadbakht, Hojjat
    Zhang, Hui
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 3 - 14
  • [44] Predicting binding energies of astrochemically relevant molecules via machine learning
    Villadsen, T.
    Ligterink, N.F.W.
    Andersen, M.
    Astronomy and Astrophysics, 2022, 666
  • [45] Predicting binding energies of astrochemically relevant molecules via machine learning
    Villadsen, T.
    Ligterink, N. F. W.
    Andersen, M.
    ASTRONOMY & ASTROPHYSICS, 2022, 666
  • [46] An Overview on Protein Fold Classification via Machine Learning Approach
    Tian, Xiaoyu
    Chen, Daozheng
    Gao, Jun
    CURRENT PROTEOMICS, 2018, 15 (02) : 85 - 98
  • [47] Conformational plasticity of glycogenin and its maltosaccharide substrate during glycogen biogenesis
    Chaikuad, Apirat
    Froese, D. Sean
    Berridge, Georgina
    von Delft, Frank
    Oppermann, Udo
    Yue, Wyatt W.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (52) : 21028 - 21033
  • [48] Conformational Changes in Protein Binding Processes
    Voss, Bela
    Grubmueller, Helmut
    BIOPHYSICAL JOURNAL, 2014, 106 (02) : 652A - 652A
  • [49] Conformational selection in protein binding and function
    Weikl, Thomas R.
    Paul, Fabian
    PROTEIN SCIENCE, 2014, 23 (11) : 1508 - 1518
  • [50] Conformational stabilization of an engineered binding protein
    Wahlberg, Elisabet
    Härd, Torleif
    Journal of the American Chemical Society, 2006, 128 (23): : 7651 - 7660