Streamlining NMR Chemical Shift Predictions for Intrinsically Disordered Proteins: Design of Ensembles with Dimensionality Reduction and Clustering

被引:0
|
作者
Bakker, Michael J. [1 ]
Gaffour, Amina [1 ]
Juhas, Martin [1 ,2 ]
Zapletal, Vojtech [1 ]
Stosek, Jakub [1 ,3 ]
Bratholm, Lars A. [4 ]
Precechtelova, Jana Pavlikova [1 ]
机构
[1] Charles Univ Prague, Fac Pharm Hradec Kralove, Hradec Kralove 50005, Czech Republic
[2] Univ Hradec Kralove, Fac Sci, Dept Chem, Hradec Kralove 50003, Czech Republic
[3] Masaryk Univ, Fac Sci, Dept Chem, Kotlarska 2, Brno 61137, Czech Republic
[4] Univ Bristol, Sch Chem, Bristol BS8 1TS, England
关键词
MOLECULAR-DYNAMICS SIMULATIONS; GAUSSIAN-TYPE BASIS; ORBITAL METHODS; FORCE-FIELD; TYROSINE-HYDROXYLASE; FUNCTIONAL THEORY; BASIS-SETS; BINDING; PHOSPHORYLATION; ACCURACY;
D O I
10.1021/acs.jcim.4c00809
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
By merging advanced dimensionality reduction (DR) and clustering algorithm (CA) techniques, our study advances the sampling procedure for predicting NMR chemical shifts (CS) in intrinsically disordered proteins (IDPs), making a significant leap forward in the field of protein analysis/modeling. We enhance NMR CS sampling by generating clustered ensembles that accurately reflect the different properties and phenomena encapsulated by the IDP trajectories. This investigation critically assessed different rapid CS predictors, both neural network (e.g., Sparta+ and ShiftX2) and database-driven (ProCS-15), and highlighted the need for more advanced quantum calculations and the subsequent need for more tractable-sized conformational ensembles. Although neural network CS predictors outperformed ProCS-15 for all atoms, all tools showed poor agreement with H-N CSs, and the neural network CS predictors were unable to capture the influence of phosphorylated residues, highly relevant for IDPs. This study also addressed the limitations of using direct clustering with collective variables, such as the widespread implementation of the GROMOS algorithm. Clustered ensembles (CEs) produced by this algorithm showed poor performance with chemical shifts compared to sequential ensembles (SEs) of similar size. Instead, we implement a multiscale DR and CA approach and explore the challenges and limitations of applying these algorithms to obtain more robust and tractable CEs. The novel feature of this investigation is the use of solvent-accessible surface area (SASA) as one of the fingerprints for DR alongside previously investigated alpha carbon distance/angles or phi/psi dihedral angles. The ensembles produced with SASA tSNE DR produced CEs better aligned with the experimental CS of between 0.17 and 0.36 r(2) (0.18-0.26 ppm) depending on the system and replicate. Furthermore, this technique produced CEs with better agreement than traditional SEs in 85.7% of all ensemble sizes. This study investigates the quality of ensembles produced based on different input features, comparing latent spaces produced by linear vs nonlinear DR techniques and a novel integrated silhouette score scanning protocol for tSNE DR.
引用
收藏
页码:6542 / 6556
页数:15
相关论文
共 35 条
  • [1] Constructing Structure Ensembles of Intrinsically Disordered Proteins from Chemical Shift Data
    Gong, Huichao
    Zhang, Sai
    Wang, Jiangdian
    Gong, Haipeng
    Zeng, Jianyang
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY (RECOMB 2015), 2015, 9029 : 108 - 121
  • [2] Constructing Structure Ensembles of Intrinsically Disordered Proteins from Chemical Shift Data
    Gong, Huichao
    Zhang, Sai
    Wang, Jiangdian
    Gong, Haipeng
    Zeng, Jianyang
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2016, 23 (05) : 300 - 310
  • [3] Improving the chemical shift dispersion of multidimensional NMR spectra of intrinsically disordered proteins
    Bermel, Wolfgang
    Bruix, Marta
    Felli, Isabella C.
    Kumar, Vasantha M. V.
    Pierattelli, Roberta
    Serrano, Soraya
    JOURNAL OF BIOMOLECULAR NMR, 2013, 55 (03) : 231 - 237
  • [4] Improving the chemical shift dispersion of multidimensional NMR spectra of intrinsically disordered proteins
    Wolfgang Bermel
    Marta Bruix
    Isabella C. Felli
    Vasantha Kumar M. V.
    Roberta Pierattelli
    Soraya Serrano
    Journal of Biomolecular NMR, 2013, 55 : 231 - 237
  • [5] NMR Spectroscopic Studies of the Conformational Ensembles of Intrinsically Disordered Proteins
    Kurzbach, Dennis
    Kontaxis, Georg
    Coudevylle, Nicolas
    Konrat, Robert
    INTRINSICALLY DISORDERED PROTEINS STUDIED BY NMR SPECTROSCOPY, 2015, 870 : 149 - 185
  • [6] Determination of statistical ensembles of intrinsically disordered proteins using NMR measurements
    Vendruscolo, Michele
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252
  • [7] Conformational ensembles explain NMR spectra of frozen intrinsically disordered proteins
    Kragelj, Jaka
    Dumarieh, Rania
    Xiao, Yiling
    Frederick, Kendra K. K.
    PROTEIN SCIENCE, 2023, 32 (05)
  • [8] Generating NMR chemical shift assignments of intrinsically disordered proteins using carbon-detected NMR methods
    Sahu, Debashish
    Bastidas, Monique
    Showalter, Scott A.
    ANALYTICAL BIOCHEMISTRY, 2014, 449 : 17 - 25
  • [9] Deep learning for intrinsically disordered proteins: From improved predictions to deciphering conformational ensembles
    Erdos, Gabor
    Dosztanyi, Zsuzsanna
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 89
  • [10] Quantum Chemical Calculations of NMR Chemical Shifts in Phosphorylated Intrinsically Disordered Proteins
    Precechtelova, Jana Pavlikova
    Mladek, Arnost
    Zapletal, Vojtech
    Hritz, Jozef
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (10) : 5642 - 5658