Streamlining NMR Chemical Shift Predictions for Intrinsically Disordered Proteins: Design of Ensembles with Dimensionality Reduction and Clustering

被引:0
|
作者
Bakker, Michael J. [1 ]
Gaffour, Amina [1 ]
Juhas, Martin [1 ,2 ]
Zapletal, Vojtech [1 ]
Stosek, Jakub [1 ,3 ]
Bratholm, Lars A. [4 ]
Precechtelova, Jana Pavlikova [1 ]
机构
[1] Charles Univ Prague, Fac Pharm Hradec Kralove, Hradec Kralove 50005, Czech Republic
[2] Univ Hradec Kralove, Fac Sci, Dept Chem, Hradec Kralove 50003, Czech Republic
[3] Masaryk Univ, Fac Sci, Dept Chem, Kotlarska 2, Brno 61137, Czech Republic
[4] Univ Bristol, Sch Chem, Bristol BS8 1TS, England
关键词
MOLECULAR-DYNAMICS SIMULATIONS; GAUSSIAN-TYPE BASIS; ORBITAL METHODS; FORCE-FIELD; TYROSINE-HYDROXYLASE; FUNCTIONAL THEORY; BASIS-SETS; BINDING; PHOSPHORYLATION; ACCURACY;
D O I
10.1021/acs.jcim.4c00809
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
By merging advanced dimensionality reduction (DR) and clustering algorithm (CA) techniques, our study advances the sampling procedure for predicting NMR chemical shifts (CS) in intrinsically disordered proteins (IDPs), making a significant leap forward in the field of protein analysis/modeling. We enhance NMR CS sampling by generating clustered ensembles that accurately reflect the different properties and phenomena encapsulated by the IDP trajectories. This investigation critically assessed different rapid CS predictors, both neural network (e.g., Sparta+ and ShiftX2) and database-driven (ProCS-15), and highlighted the need for more advanced quantum calculations and the subsequent need for more tractable-sized conformational ensembles. Although neural network CS predictors outperformed ProCS-15 for all atoms, all tools showed poor agreement with H-N CSs, and the neural network CS predictors were unable to capture the influence of phosphorylated residues, highly relevant for IDPs. This study also addressed the limitations of using direct clustering with collective variables, such as the widespread implementation of the GROMOS algorithm. Clustered ensembles (CEs) produced by this algorithm showed poor performance with chemical shifts compared to sequential ensembles (SEs) of similar size. Instead, we implement a multiscale DR and CA approach and explore the challenges and limitations of applying these algorithms to obtain more robust and tractable CEs. The novel feature of this investigation is the use of solvent-accessible surface area (SASA) as one of the fingerprints for DR alongside previously investigated alpha carbon distance/angles or phi/psi dihedral angles. The ensembles produced with SASA tSNE DR produced CEs better aligned with the experimental CS of between 0.17 and 0.36 r(2) (0.18-0.26 ppm) depending on the system and replicate. Furthermore, this technique produced CEs with better agreement than traditional SEs in 85.7% of all ensemble sizes. This study investigates the quality of ensembles produced based on different input features, comparing latent spaces produced by linear vs nonlinear DR techniques and a novel integrated silhouette score scanning protocol for tSNE DR.
引用
收藏
页码:6542 / 6556
页数:15
相关论文
共 35 条
  • [21] MD simulations of intrinsically disordered proteins with replica-averaged chemical shift restraints
    Fu, B.
    Camilloni, C.
    Cavalli, A.
    Vendruscolo, M.
    EUROPEAN BIOPHYSICS JOURNAL WITH BIOPHYSICS LETTERS, 2013, 42 : S188 - S188
  • [22] MD Simulations of Intrinsically Disordered Proteins with Replica-Averaged Chemical Shift Restraints
    Fu, Biao
    Kukic, Predrag
    Camilloni, Carlo
    Vendruscolo, Michele
    BIOPHYSICAL JOURNAL, 2014, 106 (02) : 481A - 481A
  • [23] Comparative analysis of NMR chemical shift predictions for proteins in the solid phase
    Seidel, Karsten
    Etzkorn, Manuel
    Schneider, Robert
    Ader, Christian
    Baldus, Marc
    SOLID STATE NUCLEAR MAGNETIC RESONANCE, 2009, 35 (04) : 235 - 242
  • [24] Using Chemical Shifts to Generate Structural Ensembles for Intrinsically Disordered Proteins with Converged Distributions of Secondary Structure
    Ytreberg, F. Marty
    Borcherds, Wade
    Wu, Hongwei
    Daughdrill, Gary W.
    BIOPHYSICAL JOURNAL, 2015, 108 (02) : 227A - 228A
  • [25] Linear discriminant analysis reveals hidden patterns in NMR chemical shifts of intrinsically disordered proteins
    Romero, Javier A.
    Putko, Paulina
    Urbanczyk, Mateusz
    Kazimierczuk, Krzysztof
    Zawadzka-Kazimierczuk, Anna
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (10)
  • [26] High-dimensionality 13C direct-detected NMR experiments for the automatic assignment of intrinsically disordered proteins
    Wolfgang Bermel
    Isabella C. Felli
    Leonardo Gonnelli
    Wiktor Koźmiński
    Alessandro Piai
    Roberta Pierattelli
    Anna Zawadzka-Kazimierczuk
    Journal of Biomolecular NMR, 2013, 57 : 353 - 361
  • [27] High-dimensionality 13C direct-detected NMR experiments for the automatic assignment of intrinsically disordered proteins
    Bermel, Wolfgang
    Felli, Isabella C.
    Gonnelli, Leonardo
    Kozminski, Wiktor
    Piai, Alessandro
    Pierattelli, Roberta
    Zawadzka-Kazimierczuk, Anna
    JOURNAL OF BIOMOLECULAR NMR, 2013, 57 (04) : 353 - 361
  • [28] NMR- based investigation of intrinsically disordered regions of modular proteins for tailored drug-design
    Tino, Angela Sofia
    Schiavina, Marco
    Quagliata, Michael
    Pierattelli, Roberta
    Papini, Anna Maria
    Felli, Isabella Caterina
    JOURNAL OF PEPTIDE SCIENCE, 2024, 30
  • [29] Using NMR Chemical Shifts to Determine Residue-Specific Secondary Structure Populations for Intrinsically Disordered Proteins
    Borcherds, Wade M.
    Daughdrill, Gary W.
    INTRINSICALLY DISORDERED PROTEINS, 2018, 611 : 101 - 136
  • [30] Incorporating 1H chemical shift determination into 13C-direct detected spectroscopy of intrinsically disordered proteins in solution
    O'Hare, Bernie
    Benesi, Alan J.
    Showalter, Scott A.
    JOURNAL OF MAGNETIC RESONANCE, 2009, 200 (02) : 354 - 358