Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins

被引:29
|
作者
Raimondi, Daniele [1 ,2 ,3 ,4 ]
Orlando, Gabriele [1 ,2 ,3 ,4 ]
Pancsa, Rita [5 ]
Khan, Taushif [1 ,3 ,4 ]
Vranken, Wim F. [1 ,3 ,4 ]
机构
[1] ULB VUB, Interuniv Inst Bioinformat Brussels, BC Bldg,6th Floor,CP 263, B-1050 Brussels, Belgium
[2] Univ Libre Bruxelles, Machine Learning Grp, Blvd Triomphe,CP 212, B-1050 Brussels, Belgium
[3] VIB, Ctr Struct Biol, Pleinlaan 2, B-1050 Brussels, Belgium
[4] Vrije Univ Brussel, Struct Biol Brussels, Pleinlaan 2, B-1050 Brussels, Belgium
[5] MRC, Lab Mol Biol, Francis Crick Ave,Cambridge Biomed Campus, Cambridge CB2 0QH, England
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
比利时弗兰德研究基金会;
关键词
AMINO-ACID-SEQUENCE; SECONDARY STRUCTURE; HYDROGEN-EXCHANGE; EXTRACTING INFORMATION; PREFERRED CONFORMATION; TRYPTOPHAN SYNTHASE; ALPHA-SUBUNIT; CONTACT ORDER; MECHANISM; DYNAMICS;
D O I
10.1038/s41598-017-08366-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
    Daniele Raimondi
    Gabriele Orlando
    Rita Pancsa
    Taushif Khan
    Wim F. Vranken
    [J]. Scientific Reports, 7
  • [2] Author Correction: Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins
    Daniele Raimondi
    Gabriele Orlando
    Rita Pancsa
    Taushif Khan
    Wim F. Vranken
    [J]. Scientific Reports, 9
  • [3] Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins (vol 7, 8826, 2017)
    Raimondi, Daniele
    Orlando, Gabriele
    Pancsa, Rita
    Khan, Taushif
    Vranken, Wim F.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [4] Sequence-based feature prediction and annotation of proteins
    Agnieszka S Juncker
    Lars J Jensen
    Andrea Pierleoni
    Andreas Bernsel
    Michael L Tress
    Peer Bork
    Gunnar von Heijne
    Alfonso Valencia
    Christos A Ouzounis
    Rita Casadio
    Søren Brunak
    [J]. Genome Biology, 10
  • [5] Sequence-Based Prediction of Metamorphic Behavior in Proteins
    Chen, Nanhao
    Das, Madhurima
    LiWang, Andy
    Wang, Lee-Ping
    [J]. BIOPHYSICAL JOURNAL, 2020, 119 (07) : 1380 - 1390
  • [6] Sequence-based feature prediction and annotation of proteins
    Juncker, Agnieszka S.
    Jensen, Lars J.
    Pierleoni, Andrea
    Bernsel, Andreas
    Tress, Michael L.
    Bork, Peer
    von Heijne, Gunnar
    Valencia, Alfonso
    Ouzounis, Christos A.
    Casadio, Rita
    Brunak, Soren
    [J]. GENOME BIOLOGY, 2009, 10 (02): : 206
  • [7] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [8] Improved sequence-based prediction of interaction sites in α-helical transmembrane proteins by deep learning
    Sun, Jianfeng
    Frishman, Dmitrij
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1512 - 1530
  • [9] Sequence-Based Prediction of Type III Secreted Proteins
    Arnold, Roland
    Brandmaier, Stefan
    Kleine, Frederick
    Tischler, Patrick
    Heinz, Eva
    Behrens, Sebastian
    Niinikoski, Antti
    Mewes, Hans-Werner
    Horn, Matthias
    Rattei, Thomas
    [J]. PLOS PATHOGENS, 2009, 5 (04)
  • [10] ThermoFinder: A sequence-based thermophilic proteins prediction framework
    Yu, Han
    Luo, Xiaozhou
    [J]. INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 270