Direct generation of protein conformational ensembles via machine learning

被引:57
|
作者
Janson, Giacomo [1 ]
Valdes-Garcia, Gilberto [1 ]
Heo, Lim [1 ]
Feig, Michael [1 ]
机构
[1] Michigan State Univ, Dept Biochem & Mol Biol, E Lansing, MI 48824 USA
基金
美国国家卫生研究院;
关键词
MOLECULAR-DYNAMICS; BIOMOLECULAR SIMULATION; DISORDERED PROTEINS; ENZYME;
D O I
10.1038/s41467-023-36443-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Computational methods to study protein structural dynamics are a powerful tool in life sciences but are computationally expensive. Here, the authors show that machine learning can be used to efficiently generate protein conformational ensembles and test their method on intrinsically disordered peptides. Dynamics and conformational sampling are essential for linking protein structure to biological function. While challenging to probe experimentally, computer simulations are widely used to describe protein dynamics, but at significant computational costs that continue to limit the systems that can be studied. Here, we demonstrate that machine learning can be trained with simulation data to directly generate physically realistic conformational ensembles of proteins without the need for any sampling and at negligible computational cost. As a proof-of-principle we train a generative adversarial network based on a transformer architecture with self-attention on coarse-grained simulations of intrinsically disordered peptides. The resulting model, idpGAN, can predict sequence-dependent coarse-grained ensembles for sequences that are not present in the training set demonstrating that transferability can be achieved beyond the limited training data. We also retrain idpGAN on atomistic simulation data to show that the approach can be extended in principle to higher-resolution conformational ensemble generation.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Direct generation of protein conformational ensembles via machine learning
    Giacomo Janson
    Gilberto Valdes-Garcia
    Lim Heo
    Michael Feig
    Nature Communications, 14
  • [2] Machine Learning Generation of Dynamic Protein Conformational Ensembles
    Zheng, Li-E
    Barethiya, Shrishti
    Nordquist, Erik
    Chen, Jianhan
    MOLECULES, 2023, 28 (10):
  • [3] Molecular Insights from Conformational Ensembles via Machine Learning
    Fleetwood, Oliver
    Kasimova, Marina A.
    Westerlund, Annie M.
    Delemotte, Lucie
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 765 - 780
  • [4] Determining protein conformational ensembles by combining machine learning and small-angle scattering
    Lidbrink, Samuel Eriksson
    Haloi, Nandan
    Howard, Rebecca J.
    Lindahl, Erik R.
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 431A - 431A
  • [5] Resolving Protein Conformational Plasticity and Substrate Binding via Machine Learning
    Ahalawat, Navjeet
    Sahil, Mohammad
    Mondal, Jagannath
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (09) : 2644 - 2657
  • [6] Characterizing Ensembles of Platelike Particles via Machine Learning
    Jaeggi, Anna
    Rajagopalan, Ashwin Kumar
    Morari, Manfred
    Mazzotti, Marco
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2021, 60 (01) : 473 - 483
  • [7] EnGens: a computational framework for generation and analysis of representative protein conformational ensembles
    Conev, Anja
    Rigo, Mauricio Menegatti
    Devaurs, Didier
    Fonseca, Andre Faustino
    Kalavadwala, Hussain
    de Freitas, Martiela Vaz
    Clementi, Cecilia
    Zanatta, Geancarlo
    Antunes, Dinler Amaral
    Kavraki, Lydia E.
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [8] Machine-learning-based methods to generate conformational ensembles of disordered proteins
    Taneja, Ishan
    Lasker, Keren
    BIOPHYSICAL JOURNAL, 2024, 123 (01) : 101 - 113
  • [9] Predicting Adverse Childhood Experiences via Machine Learning Ensembles
    Rao, Akash K.
    Trivedi, Gunjan Y.
    Bajpai, Anshika
    Chouhan, Gajraj Singh
    Trivedi, Riri G.
    Kumar, Anita
    Soundappan, Kathirvel
    Dutt, Varun
    Ramani, Hemalatha
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2023, 2023, : 773 - 779
  • [10] Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning
    Ruzmetov, Talant
    Hung, Ta, I
    Jonnalagedda, Saisri Padmaja
    Chen, Si-han
    Fasihianifard, Parisa
    Guo, Zhefeng
    Bhanu, Bir
    Chang, Chia-en A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025, 65 (05) : 2487 - 2502