Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations

被引:10
|
作者
Savojardo, Castrense [1 ,2 ]
Fariselli, Piero [1 ,2 ]
Martelli, Pier Luigi [2 ,3 ]
Casadio, Rita [2 ,3 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, I-41029 Bologna, Italy
[2] Univ Bologna, Biocomp Grp, I-40126 Bologna, Italy
[3] CIRI Life Sci & Hlth Technol, Dept Biol, I-40129 Bologna, Italy
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
RECURSIVE NEURAL-NETWORKS; FEATURE VECTORS; INFORMATION; CYSTEINES; BONDS; STATE;
D O I
10.1186/1471-2105-14-S1-S10
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein structures and to the large number of protein chains that are currently available for statistical validation. It was previously discussed that disulfide bond topology in proteins is also constrained by correlated mutations. Results: In this paper we exploit information derived from a corrected mutual information analysis and from the inverse of the covariance matrix to address the problem of the prediction of the topology of disulfide bonds in Eukaryotes. Recently, we have shown that Support Vector Regression (SVR) can improve the prediction for the disulfide connectivity patterns. Here we show that the inclusion of the correlated mutation information increases of 5 percentage points the SVR performance (from 54% to 59%). When this approach is used in combination with a method previously developed by us and scoring at the state of art in predicting both location and topology of disulfide bonds in Eukaryotes (DisLocate), the per-protein accuracy is 38%, 2 percentage points higher than that previously obtained. Conclusions: In this paper we show that the inclusion of information derived from correlated mutations can improve the performance of the state of the art methods for predicting disulfide connectivity patterns in Eukaryotic proteins. Our analysis also provides support to the notion that improving methods to extract evolutionary information from multiple sequence alignments greatly contributes to the scoring performance of predictors suited to detect relevant features from protein chains.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations
    Castrense Savojardo
    Piero Fariselli
    Pier Luigi Martelli
    Rita Casadio
    [J]. BMC Bioinformatics, 14
  • [2] Predicting disulfide bond connectivity in proteins by correlated mutations analysis
    Rubinstein, Rotem
    Fiser, Andras
    [J]. BIOINFORMATICS, 2008, 24 (04) : 498 - 504
  • [3] Prediction of disulfide connectivity in proteins with support vector machine
    Hsuan-Liang Liu
    Shih-Chieh Chen
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF CHEMICAL ENGINEERS, 2007, 38 (01): : 63 - 70
  • [4] Prediction of the Bonding State of Cysteine Residues in Proteins with Machine-Learning Methods
    Savojardo, Castrense
    Fariselli, Piero
    Martelli, Pier Luigi
    Shukla, Priyank
    Casadio, Rita
    [J]. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2011, 6685 : 98 - 111
  • [5] Prediction of disulfide connectivity in proteins
    Fariselli, P
    Casadio, R
    [J]. BIOINFORMATICS, 2001, 17 (10) : 957 - 964
  • [6] Prediction of Hemolytic Toxicity for Saponins by Machine-Learning Methods
    Zheng, Suqing
    Wang, Yibing
    Liu, Hongmei
    Chang, Wenping
    Xu, Yong
    Lin, Fu
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 2019, 32 (06) : 1014 - 1026
  • [7] Machine-learning methods for stream water temperature prediction
    Feigl, Moritz
    Lebiedzinski, Katharina
    Herrnegger, Mathew
    Schulz, Karsten
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2021, 25 (05) : 2951 - 2977
  • [8] Prediction of Settling Velocity of Microplastics by Multiple Machine-Learning Methods
    Leng, Zequan
    Cao, Lu
    Gao, Yun
    Hou, Yadong
    Wu, Di
    Huo, Zhongyan
    Zhao, Xizeng
    [J]. WATER, 2024, 16 (13)
  • [9] Risk estimation and risk prediction using machine-learning methods
    Kruppa, Jochen
    Ziegler, Andreas
    Koenig, Inke R.
    [J]. HUMAN GENETICS, 2012, 131 (10) : 1639 - 1654
  • [10] Risk estimation and risk prediction using machine-learning methods
    Jochen Kruppa
    Andreas Ziegler
    Inke R. König
    [J]. Human Genetics, 2012, 131 : 1639 - 1654