Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning

被引:2
|
作者
Ahalawat, Navjeet [2 ]
Sahil, Mohammad [1 ]
Mondal, Jagannath [1 ]
机构
[1] Tata Inst Fundamental Res, Ctr Interdisciplinary Sci, Hyderabad 500046, India
[2] CCS Haryana Agr Univ, Coll Biotechnol, Dept Bioinformat & Computat Biol, Hisar 125004, Haryana, India
关键词
MARKOV STATE MODELS; LIGAND-BINDING; RECOGNITION; METADYNAMICS; DETERMINANTS; ENSEMBLES; DYNAMICS; KINETICS;
D O I
10.1021/acs.jctc.2c00932
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A long-standing target in elucidating the biomolecular recognition process is the identification of binding competent conformations of the receptor protein. However, protein conformational plasticity and the stochastic nature of the recognition processes often preclude the assignment of a specific protein conformation to an individual ligand-bound pose. Here, we demonstrate that a computational framework coined as RF-TICAMD, which integrates an ensemble decision-tree-based Random Forest (RF) machine learning (ML) technique with an unsupervised dimension reduction approach time-structured independent component analysis (TICA), provides an efficient and unambiguous solution toward resolving protein conformational plasticity and the substrate binding process. In particular, we consider multimicrosecond-long molecular dynamics (MD) simulation trajectories of a ligand recognition process in solvent inaccessible cavities of archetypal proteins T4 lysozyme and cytochrome P450cam. We show that in a scenario in which clear correspondence between protein conformation and binding-competent macrostates could not be obtained via an unsupervised dimension reduction approach, an a priori decision-tree-based supervised classification of the simulated recognition trajectories via RF would help characterize key amino acid residue pairs of the protein that are deemed sensitive for ligand binding. A subsequent unsupervised dimensional reduction of the selected residue pairs via TICA would then delineate a conformational landscape of protein which is able to demarcate ligand-bound poses from unbound ones. The proposed RF-TICA-MD approach is shown to be data agnostic and found to be robust when using other ML-based classification methods such as XGBoost. As a promising spinoff of the protocol, the framework is found to be capable of identifying distal protein locations which would be allosterically important for ligand binding and would characterize their roles in recognition pathways. A Python implementation of a proposed ML workflow is available in GitHub https://github.com/navjeet0211/rf-tica-md.
引用
收藏
页码:2644 / 2657
页数:14
相关论文
共 50 条
  • [21] SUBSTRATE INDUCED CONFORMATIONAL CHANGE OF GALACTOSE-BINDING PROTEIN OF E-COLI
    BOOS, W
    GORDON, A
    HALL, RE
    FEDERATION PROCEEDINGS, 1971, 30 (03) : 1062 - &
  • [22] Substrate Binding by γ-Secretase: Conformational Dynamics of the Enzyme Active Site and Substrate Recognition with an Example of the Amyloid Precursor Protein
    Nierzwicki, Lukasz Piotr
    Olewniczak, Michal
    Chodnicki, Pawel
    Czub, Jacek
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 372A - 373A
  • [23] Conformational plasticity of recombinant bovine prion protein
    V. B. Grigoriev
    S. L. Kalnov
    A. N. Pokidyshev
    S. M. Klimenko
    Doklady Biochemistry and Biophysics, 2010, 430 : 17 - 19
  • [24] Conformational plasticity of the Ebola virus matrix protein
    Radzimanowski, Jens
    Effantin, Gregory
    Weissenhorn, Winfried
    PROTEIN SCIENCE, 2014, 23 (11) : 1519 - 1527
  • [25] Conformational Plasticity and Ligand Binding of Bacterial Monoacylglycerol Lipase
    Rengachari, Srinivasan
    Aschauer, Philipp
    Schittmayer, Matthias
    Mayer, Nicole
    Gruber, Karl
    Breinbauer, Rolf
    Birner-Gruenberger, Ruth
    Dreveny, Ingrid
    Oberer, Monika
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2013, 288 (43) : 31093 - 31104
  • [26] Conformational plasticity of recombinant bovine prion protein
    Grigoriev, V. B.
    Kalnov, S. L.
    Pokidyshev, A. N.
    Klimenko, S. M.
    DOKLADY BIOCHEMISTRY AND BIOPHYSICS, 2010, 430 (01) : 17 - 19
  • [27] Functional plasticity in the substrate binding site of β-secretase
    Gorfe, AA
    Caflisch, A
    STRUCTURE, 2005, 13 (10) : 1487 - 1498
  • [28] Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
    Audagnotto, Martina
    Czechtizky, Werngard
    De Maria, Leonardo
    Kack, Helena
    Papoian, Garegin
    Tornberg, Lars
    Tyrchan, Christian
    Ulander, Johan
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [29] Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble
    Martina Audagnotto
    Werngard Czechtizky
    Leonardo De Maria
    Helena Käck
    Garegin Papoian
    Lars Tornberg
    Christian Tyrchan
    Johan Ulander
    Scientific Reports, 12 (1)
  • [30] Machine learning methods for protein-protein binding affinity prediction in protein design
    Guo, Zhongliang
    Yamaguchi, Rui
    FRONTIERS IN BIOINFORMATICS, 2022, 2