Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning

被引:2
|
作者
Ahalawat, Navjeet [2 ]
Sahil, Mohammad [1 ]
Mondal, Jagannath [1 ]
机构
[1] Tata Inst Fundamental Res, Ctr Interdisciplinary Sci, Hyderabad 500046, India
[2] CCS Haryana Agr Univ, Coll Biotechnol, Dept Bioinformat & Computat Biol, Hisar 125004, Haryana, India
关键词
MARKOV STATE MODELS; LIGAND-BINDING; RECOGNITION; METADYNAMICS; DETERMINANTS; ENSEMBLES; DYNAMICS; KINETICS;
D O I
10.1021/acs.jctc.2c00932
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
A long-standing target in elucidating the biomolecular recognition process is the identification of binding competent conformations of the receptor protein. However, protein conformational plasticity and the stochastic nature of the recognition processes often preclude the assignment of a specific protein conformation to an individual ligand-bound pose. Here, we demonstrate that a computational framework coined as RF-TICAMD, which integrates an ensemble decision-tree-based Random Forest (RF) machine learning (ML) technique with an unsupervised dimension reduction approach time-structured independent component analysis (TICA), provides an efficient and unambiguous solution toward resolving protein conformational plasticity and the substrate binding process. In particular, we consider multimicrosecond-long molecular dynamics (MD) simulation trajectories of a ligand recognition process in solvent inaccessible cavities of archetypal proteins T4 lysozyme and cytochrome P450cam. We show that in a scenario in which clear correspondence between protein conformation and binding-competent macrostates could not be obtained via an unsupervised dimension reduction approach, an a priori decision-tree-based supervised classification of the simulated recognition trajectories via RF would help characterize key amino acid residue pairs of the protein that are deemed sensitive for ligand binding. A subsequent unsupervised dimensional reduction of the selected residue pairs via TICA would then delineate a conformational landscape of protein which is able to demarcate ligand-bound poses from unbound ones. The proposed RF-TICA-MD approach is shown to be data agnostic and found to be robust when using other ML-based classification methods such as XGBoost. As a promising spinoff of the protocol, the framework is found to be capable of identifying distal protein locations which would be allosterically important for ligand binding and would characterize their roles in recognition pathways. A Python implementation of a proposed ML workflow is available in GitHub https://github.com/navjeet0211/rf-tica-md.
引用
收藏
页码:2644 / 2657
页数:14
相关论文
共 50 条
  • [31] Protein binding site fingerprinting for activity screening in machine learning
    Bergman, Bastiaan
    Stafford, Kate
    Bernard, Denzil
    Schroedl, Stefan
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [32] Ranking Protein-Protein Binding Using Evolutionary Information and Machine Learning
    Farhoodi, Roshanak
    Akbal-Delibas, Bahar
    Haspel, Nurit
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 667 - 672
  • [33] Resolving Code Review Comments with Machine Learning
    Frommgen, Alexander
    Austin, Jacob
    Choy, Peter
    Ghelani, Nimesh
    Kharatyan, Lera
    Surita, Gabriela
    Khrapko, Elena
    Lamblin, Pascal
    Manzagol, Pierre-Antoine
    Revaj, Marcus
    Tabachnyk, Maxim
    Tarlow, Daniel
    Villela, Kevin
    Zheng, Daniel
    Chandra, Satish
    Maniatis, Petros
    2024 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE, ICSE-SEIP 2024, 2024, : 204 - 215
  • [34] Conformational coupling of the sialic acid TRAP transporter HiSiaQM with its substrate binding protein HiSiaP
    Martin F. Peter
    Jan A. Ruland
    Yeojin Kim
    Philipp Hendricks
    Niels Schneberger
    Jan Peter Siebrasse
    Gavin H. Thomas
    Ulrich Kubitscheck
    Gregor Hagelueken
    Nature Communications, 15
  • [35] Conformational coupling of the sialic acid TRAP transporter HiSiaQM with its substrate binding protein HiSiaP
    Peter, Martin F.
    Ruland, Jan A.
    Kim, Yeojin
    Hendricks, Philipp
    Schneberger, Niels
    Siebrasse, Jan Peter
    Thomas, Gavin H.
    Kubitscheck, Ulrich
    Hagelueken, Gregor
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [36] Protein substrate binding induces conformational changes in the chaperonin GroEL -: A suggested mechanism for unfoldase activity
    Hammarström, P
    Persson, M
    Owenius, R
    Lindgren, M
    Carlsson, U
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (30) : 22832 - 22838
  • [37] Monitoring conformational rearrangements in the substrate-binding site of a membrane transport protein by mass spectrometry
    Weinglass, A
    Whitelegge, JP
    Faull, KF
    Kaback, HR
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (40) : 41858 - 41865
  • [38] Conformational Plasticity of Hepatitis B Core Protein Spikes Promotes Peptide Binding Independent of the Secretion Phenotype
    Makbul, Cihan
    Khayenko, Vladimir
    Maric, Hans Michael
    Boettcher, Bettina
    MICROORGANISMS, 2021, 9 (05)
  • [39] Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models
    Plattner, Nuria
    Noe, Frank
    NATURE COMMUNICATIONS, 2015, 6
  • [40] Timesaving for Conformational Analysis by Machine Learning
    Sakiyama, Hiroshi
    JOURNAL OF COMPUTER CHEMISTRY-JAPAN, 2019, 18 (03) : 150 - 151