Biological Features for Sequence-Based Prediction of Protein Stability Changes upon Amino Acid Substitutions

被引:2
|
作者
Teng, Shaolei [1 ]
Srivastava, Anand K. [1 ]
Wang, Liangjiang [1 ]
机构
[1] Clemson Univ, Dept Biochem & Genet, Clemson, SC 29634 USA
关键词
protein stabiligy prediction; biological feature selection; support vector machines; machine learning; RESIDUES; SCALE; HYDROPHOBICITY; ACCURACY; DNA;
D O I
10.1109/IJCBS.2009.101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein destabilization is a common mechanism by which amino acid substitutions cause human diseases. In this study, a new machine learning method has been developed for sequence-based prediction of protein stability changes upon single amino acid substitutions. Support vector machines were trained with data from experimental studies on the free energy change of protein stability upon mutations. To construct accurate classifiers, twenty biological features were examined for input vector encoding. It was shown that classifier performance varied significantly by the use of different features. The most accurate classifier was constructed using a combination of several biological features. This classifier achieved an overall accuracy of 82.24% with 75.24% sensitivity and 85.36% specificity. Predictive results at this level of accuracy may be used in human genetic studies to distinguish between deleterious and tolerant alterations in disease candidate genes.
引用
下载
收藏
页码:201 / 206
页数:6
相关论文
共 50 条
  • [21] Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity
    Qizhi Zhu
    Lihua Wang
    Ruyu Dai
    Wei Zhang
    Wending Tang
    Yannan Bin
    Zeliang Wang
    Junfeng Xia
    Interdisciplinary Sciences: Computational Life Sciences, 2021, 13 : 693 - 702
  • [22] Sequence-Based Prediction of Transmembrane Protein Crystallization Propensity
    Zhu, Qizhi
    Wang, Lihua
    Dai, Ruyu
    Zhang, Wei
    Tang, Wending
    Bin, Yannan
    Wang, Zeliang
    Xia, Junfeng
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (04) : 693 - 702
  • [23] SOLpro: accurate sequence-based prediction of protein solubility
    Magnan, Christophe N.
    Randall, Arlo
    Baldi, Pierre
    BIOINFORMATICS, 2009, 25 (17) : 2200 - 2207
  • [24] Sequence-based prediction of protein binding mode landscapes
    Horvath, Attila
    Miskei, Marton
    Ambrusl, Viktor
    Vendruscolo, Michele
    Fuxreiter, Monika
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (05)
  • [25] Prediction of unfolded segments in a protein sequence based on amino acid composition
    Coeytaux, K
    Poupon, A
    BIOINFORMATICS, 2005, 21 (09) : 1891 - 1900
  • [26] Protein location prediction using atomic composition and global features of the amino acid sequence
    Cherian, Betsy Sheena
    Nair, Achuthsankar S.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) : 1670 - 1674
  • [27] On the use of structure and sequence-based features for protein classification and retrieval
    Keith Marsolo
    Srinivasan Parthasarathy
    Knowledge and Information Systems, 2008, 14 : 59 - 80
  • [28] On the use of structure and sequence-based features for protein classification and retrieval
    Marsolo, Keith
    Parthasarathy, Srinivasan
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 394 - +
  • [29] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411
  • [30] Prediction of collagen stability from amino acid sequence
    Persikov, AV
    Ramshaw, JAM
    Brodsky, B
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (19) : 19343 - 19349