Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins

被引:29
|
作者
Raimondi, Daniele [1 ,2 ,3 ,4 ]
Orlando, Gabriele [1 ,2 ,3 ,4 ]
Pancsa, Rita [5 ]
Khan, Taushif [1 ,3 ,4 ]
Vranken, Wim F. [1 ,3 ,4 ]
机构
[1] ULB VUB, Interuniv Inst Bioinformat Brussels, BC Bldg,6th Floor,CP 263, B-1050 Brussels, Belgium
[2] Univ Libre Bruxelles, Machine Learning Grp, Blvd Triomphe,CP 212, B-1050 Brussels, Belgium
[3] VIB, Ctr Struct Biol, Pleinlaan 2, B-1050 Brussels, Belgium
[4] Vrije Univ Brussel, Struct Biol Brussels, Pleinlaan 2, B-1050 Brussels, Belgium
[5] MRC, Lab Mol Biol, Francis Crick Ave,Cambridge Biomed Campus, Cambridge CB2 0QH, England
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
比利时弗兰德研究基金会;
关键词
AMINO-ACID-SEQUENCE; SECONDARY STRUCTURE; HYDROGEN-EXCHANGE; EXTRACTING INFORMATION; PREFERRED CONFORMATION; TRYPTOPHAN SYNTHASE; ALPHA-SUBUNIT; CONTACT ORDER; MECHANISM; DYNAMICS;
D O I
10.1038/s41598-017-08366-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection
    Xu, Yan
    Ding, Ya-Xin
    Ding, Jun
    Wu, Ling-Yun
    Xue, Yu
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [22] Sequence-based methods for the design of unnatural backbones that manifest the tertiary folding patterns of natural proteins
    Horne, W. Seth
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244
  • [23] Sequence-based prediction of protein domains
    Liu, JF
    Rost, B
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (12) : 3522 - 3530
  • [24] Sequence-based prediction of variants’ effects
    Nicole Rusk
    [J]. Nature Methods, 2018, 15 : 571 - 571
  • [25] Sequence-based prediction of pathological mutations
    Ferrer-Costa, C
    Orozco, M
    de la Cruz, X
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (04) : 811 - 819
  • [26] Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach
    Nedyalkova, Miroslava
    Vasighi, Mahdi
    Azmoon, Amirreza
    Naneva, Ludmila
    Simeonov, Vasil
    [J]. ACS OMEGA, 2023, : 3698 - 3704
  • [27] Sequence-based prediction of variants' effects
    Rusk, Nicole
    [J]. NATURE METHODS, 2018, 15 (07) : 571 - 571
  • [28] Sequence-Based Prediction of Protein Solubility
    Agostini, Federico
    Vendruscolo, Michele
    Tartaglia, Gian Gaetano
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2012, 421 (2-3) : 237 - 241
  • [29] Folding initiation sites and protein folding
    Dadlez, M
    [J]. ACTA BIOCHIMICA POLONICA, 1999, 46 (03) : 487 - 508
  • [30] Fast Prediction of Protein Methylation Sites Using a Sequence-Based Feature Selection Technique
    Wei, Leyi
    Xing, Pengwei
    Shi, Gaotao
    Ji, Zhiliang
    Zou, Quan
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (04) : 1264 - 1273