Variational embedding of protein folding simulations using Gaussian mixture variational autoencoders

被引:11
|
作者
Ghorbani, Mahdi [1 ,2 ]
Prasad, Samarjeet [1 ]
Klauda, Jeffery B. [2 ]
Brooks, Bernard R. [1 ]
机构
[1] NHLBI, Lab Computat Biol, NIH, Bethesda, MD 20824 USA
[2] Univ Maryland, Dept Chem & Biomol Engn, College Pk, MD 20742 USA
来源
JOURNAL OF CHEMICAL PHYSICS | 2021年 / 155卷 / 19期
基金
美国国家科学基金会;
关键词
MARKOV STATE MODELS; MOLECULAR-DYNAMICS SIMULATIONS; TRP-CAGE; KINETICS;
D O I
10.1063/5.0069708
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Conformational sampling of biomolecules using molecular dynamics simulations often produces a large amount of high dimensional data that makes it difficult to interpret using conventional analysis techniques. Dimensionality reduction methods are thus required to extract useful and relevant information. Here, we devise a machine learning method, Gaussian mixture variational autoencoder (GMVAE), that can simultaneously perform dimensionality reduction and clustering of biomolecular conformations in an unsupervised way. We show that GMVAE can learn a reduced representation of the free energy landscape of protein folding with highly separated clusters that correspond to the metastable states during folding. Since GMVAE uses a mixture of Gaussians as its prior, it can directly acknowledge the multi-basin nature of the protein folding free energy landscape. To make the model end-to-end differentiable, we use a Gumbel-softmax distribution. We test the model on three long-timescale protein folding trajectories and show that GMVAE embedding resembles the folding funnel with folded states down the funnel and unfolded states outside the funnel path. Additionally, we show that the latent space of GMVAE can be used for kinetic analysis and Markov state models built on this embedding produce folding and unfolding timescales that are in close agreement with other rigorous dynamical embeddings such as time independent component analysis.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] SPEECH DEREVERBERATION USING VARIATIONAL AUTOENCODERS
    Baby, Deepak
    Bourlard, Herve
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5784 - 5788
  • [22] Energy disaggregation using variational autoencoders
    Langevin, Antoine
    Carbonneau, Marc-Andre
    Cheriet, Mohamed
    Gagnon, Ghyslain
    ENERGY AND BUILDINGS, 2022, 254
  • [23] Generating functional protein variants with variational autoencoders
    Hawkins-Hooker, Alex
    Depardieu, Florence
    Baur, Sebastien
    Couairon, Guillaume
    Chen, Arthur
    Bikard, David
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (02)
  • [24] Gaussian Process Modeling of Approximate Inference Errors for Variational Autoencoders
    Kim, Minyoung
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 244 - 253
  • [25] An optimized method for variational autoencoders based on Gaussian cloud model
    Dai, Jin
    Guo, Qiuyan
    Wang, Guoyin
    Liu, Xiao
    Zheng, Zhifang
    INFORMATION SCIENCES, 2023, 645
  • [26] Bayesian mixture variational autoencoders for multi-modal learning
    Keng-Te Liao
    Bo-Wei Huang
    Chih-Chun Yang
    Shou-De Lin
    Machine Learning, 2022, 111 : 4329 - 4357
  • [27] Bayesian mixture variational autoencoders for multi-modal learning
    Liao, Keng-Te
    Huang, Bo-Wei
    Yang, Chih-Chun
    Lin, Shou-De
    MACHINE LEARNING, 2022, 111 (12) : 4329 - 4357
  • [28] Towards learning transferable embeddings for protein conformations using Variational Autoencoders
    Albu, Alexandra-Ioana
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 10 - 19
  • [29] Neural Variational Gaussian Mixture Topic Model
    Tang, Kun
    Huang, Heyan
    Shi, Xuewen
    Mao, Xian-Ling
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (04)
  • [30] ROBUST UNSUPERVISED AUDIO-VISUAL SPEECH ENHANCEMENT USING A MIXTURE OF VARIATIONAL AUTOENCODERS
    Sadeghi, Mostafa
    Alameda-Pineda, Xavier
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7534 - 7538