Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning

被引:2
|
作者
Ahalawat, Navjeet [2 ]
Sahil, Mohammad [1 ]
Mondal, Jagannath [1 ]
机构
[1] Tata Inst Fundamental Res, Ctr Interdisciplinary Sci, Hyderabad 500046, India
[2] CCS Haryana Agr Univ, Coll Biotechnol, Dept Bioinformat & Computat Biol, Hisar 125004, Haryana, India
关键词
MARKOV STATE MODELS; LIGAND-BINDING; RECOGNITION; METADYNAMICS; DETERMINANTS; ENSEMBLES; DYNAMICS; KINETICS;
D O I
10.1021/acs.jctc.2c00932
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A long-standing target in elucidating the biomolecular recognition process is the identification of binding competent conformations of the receptor protein. However, protein conformational plasticity and the stochastic nature of the recognition processes often preclude the assignment of a specific protein conformation to an individual ligand-bound pose. Here, we demonstrate that a computational framework coined as RF-TICAMD, which integrates an ensemble decision-tree-based Random Forest (RF) machine learning (ML) technique with an unsupervised dimension reduction approach time-structured independent component analysis (TICA), provides an efficient and unambiguous solution toward resolving protein conformational plasticity and the substrate binding process. In particular, we consider multimicrosecond-long molecular dynamics (MD) simulation trajectories of a ligand recognition process in solvent inaccessible cavities of archetypal proteins T4 lysozyme and cytochrome P450cam. We show that in a scenario in which clear correspondence between protein conformation and binding-competent macrostates could not be obtained via an unsupervised dimension reduction approach, an a priori decision-tree-based supervised classification of the simulated recognition trajectories via RF would help characterize key amino acid residue pairs of the protein that are deemed sensitive for ligand binding. A subsequent unsupervised dimensional reduction of the selected residue pairs via TICA would then delineate a conformational landscape of protein which is able to demarcate ligand-bound poses from unbound ones. The proposed RF-TICA-MD approach is shown to be data agnostic and found to be robust when using other ML-based classification methods such as XGBoost. As a promising spinoff of the protocol, the framework is found to be capable of identifying distal protein locations which would be allosterically important for ligand binding and would characterize their roles in recognition pathways. A Python implementation of a proposed ML workflow is available in GitHub https://github.com/navjeet0211/rf-tica-md.
引用
收藏
页码:2644 / 2657
页数:14
相关论文
共 50 条
  • [1] Direct generation of protein conformational ensembles via machine learning
    Janson, Giacomo
    Valdes-Garcia, Gilberto
    Heo, Lim
    Feig, Michael
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [2] Direct generation of protein conformational ensembles via machine learning
    Giacomo Janson
    Gilberto Valdes-Garcia
    Lim Heo
    Michael Feig
    Nature Communications, 14
  • [3] Markov State Models Reconcile Conformational Plasticity of GTPase with Its Substrate Binding Event
    Dandekar, Bhupendra R.
    Ahalawat, Navjeet
    Sinha, Suman
    Mondal, Jagannath
    JACS AU, 2023, 3 (06): : 1728 - 1741
  • [4] Redefining the protein kinase conformational space with machine learning
    Rahman, Rayees
    Ung, Peter
    Schlessinger, Avner
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [5] Redefining the Protein Kinase Conformational Space with Machine Learning
    Ung, Peter Man-Un
    Rahman, Rayees
    Schlessinger, Avner
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 58A - 59A
  • [6] Protein conformational flexibility prediction using machine learning
    Trott, Oleg
    Siggers, Ken
    Rost, Burkhard
    Palmer, Arthur G., III
    JOURNAL OF MAGNETIC RESONANCE, 2008, 192 (01) : 37 - 47
  • [7] Machine Learning Generation of Dynamic Protein Conformational Ensembles
    Zheng, Li-E
    Barethiya, Shrishti
    Nordquist, Erik
    Chen, Jianhan
    MOLECULES, 2023, 28 (10):
  • [8] Redefining the Protein Kinase Conformational Space with Machine Learning
    Ung, Peter Man-Un
    Rahman, Rayees
    Schlessinger, Avner
    CELL CHEMICAL BIOLOGY, 2018, 25 (07): : 916 - +
  • [9] Resolving large conformational change pathways through machine learning and enhanced sampling simulations
    Pang, Yui Tik
    Gumbart, James C.
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 445A - 445A
  • [10] The conformational plasticity of protein kinases
    Huse, M
    Kuriyan, J
    CELL, 2002, 109 (03) : 275 - 282