Exploring the Sequence-based Prediction of Folding Initiation Sites in Proteins

被引:29
|
作者
Raimondi, Daniele [1 ,2 ,3 ,4 ]
Orlando, Gabriele [1 ,2 ,3 ,4 ]
Pancsa, Rita [5 ]
Khan, Taushif [1 ,3 ,4 ]
Vranken, Wim F. [1 ,3 ,4 ]
机构
[1] ULB VUB, Interuniv Inst Bioinformat Brussels, BC Bldg,6th Floor,CP 263, B-1050 Brussels, Belgium
[2] Univ Libre Bruxelles, Machine Learning Grp, Blvd Triomphe,CP 212, B-1050 Brussels, Belgium
[3] VIB, Ctr Struct Biol, Pleinlaan 2, B-1050 Brussels, Belgium
[4] Vrije Univ Brussel, Struct Biol Brussels, Pleinlaan 2, B-1050 Brussels, Belgium
[5] MRC, Lab Mol Biol, Francis Crick Ave,Cambridge Biomed Campus, Cambridge CB2 0QH, England
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
比利时弗兰德研究基金会;
关键词
AMINO-ACID-SEQUENCE; SECONDARY STRUCTURE; HYDROGEN-EXCHANGE; EXTRACTING INFORMATION; PREFERRED CONFORMATION; TRYPTOPHAN SYNTHASE; ALPHA-SUBUNIT; CONTACT ORDER; MECHANISM; DYNAMICS;
D O I
10.1038/s41598-017-08366-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein folding is a complex process that can lead to disease when it fails. Especially poorly understood are the very early stages of protein folding, which are likely defined by intrinsic local interactions between amino acids close to each other in the protein sequence. We here present EFoldMine, a method that predicts, from the primary amino acid sequence of a protein, which amino acids are likely involved in early folding events. The method is based on early folding data from hydrogen deuterium exchange (HDX) data from NMR pulsed labelling experiments, and uses backbone and sidechain dynamics as well as secondary structure propensities as features. The EFoldMine predictions give insights into the folding process, as illustrated by a qualitative comparison with independent experimental observations. Furthermore, on a quantitative proteome scale, the predicted early folding residues tend to become the residues that interact the most in the folded structure, and they are often residues that display evolutionary covariation. The connection of the EFoldMine predictions with both folding pathway data and the folded protein structure suggests that the initial statistical behavior of the protein chain with respect to local structure formation has a lasting effect on its subsequent states.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Sequence-based evaluation of promoter context for prediction of transcription start sites in Arabidopsis and rice
    Hiratsuka, Tosei
    Makita, Yuko
    Yamamoto, Yoshiharu Y.
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [32] Sequence-based evaluation of promoter context for prediction of transcription start sites in Arabidopsis and rice
    Tosei Hiratsuka
    Yuko Makita
    Yoshiharu Y. Yamamoto
    [J]. Scientific Reports, 12
  • [33] Smoothing a rugged protein folding landscape by sequence-based redesign
    Porebski, Benjamin T.
    Keleher, Shani
    Hollins, Jeffrey J.
    Nickson, Adrian A.
    Marijanovic, Emilia M.
    Borg, Natalie A.
    Costa, Mauricio G. S.
    Pearce, Mary A.
    Dai, Weiwen
    Zhu, Liguang
    Irving, James A.
    Hoke, David E.
    Kass, Itamar
    Whisstock, James C.
    Bottomley, Stephen P.
    Webb, Geoffrey I.
    McGowan, Sheena
    Buckle, Ashley M.
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [34] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [35] Smoothing a rugged protein folding landscape by sequence-based redesign
    Benjamin T. Porebski
    Shani Keleher
    Jeffrey J. Hollins
    Adrian A. Nickson
    Emilia M. Marijanovic
    Natalie A. Borg
    Mauricio G. S. Costa
    Mary A. Pearce
    Weiwen Dai
    Liguang Zhu
    James A. Irving
    David E. Hoke
    Itamar Kass
    James C. Whisstock
    Stephen P. Bottomley
    Geoffrey I. Webb
    Sheena McGowan
    Ashley M. Buckle
    [J]. Scientific Reports, 6
  • [36] EBGW_OMP: A Sequence-based Method for Accurate Prediction of Outer Membrane Proteins
    Zou, Lingyun
    Ni, Qingshan
    Hu, Fuquan
    [J]. 2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2014,
  • [37] A Sequence-Based Prediction Model of Vesicular Transport Proteins Using Ensemble Deep Learning
    Le, Nguyen Quoc Khanh
    Kha, Quang Hien
    [J]. 14TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, BCB 2023, 2023,
  • [38] Sequence-Based Prediction with Feature Representation Learning and Biological Function Analysis of Channel Proteins
    Chen, Zheng
    Jiao, Shihu
    Zhao, Da
    Hesham, Abd El-Latif
    Zou, Quan
    Xu, Lei
    Sun, Mingai
    Zhang, Lijun
    [J]. FRONTIERS IN BIOSCIENCE-LANDMARK, 2022, 27 (06):
  • [39] ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning
    Jiao, Shihu
    Chen, Zheng
    Zhang, Lichao
    Zhou, Xun
    Shi, Lei
    [J]. AMINO ACIDS, 2022, 54 (05) : 799 - 809
  • [40] ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning
    Shihu Jiao
    Zheng Chen
    Lichao Zhang
    Xun Zhou
    Lei Shi
    [J]. Amino Acids, 2022, 54 : 799 - 809