Kernel methods for predicting protein-protein interactions

被引:375
|
作者
Ben-Hur, A [1 ]
Noble, WS
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/bti1016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Despite advances in high-throughput methods for discovering protein-protein interactions, the interaction networks of even well-studied model organisms are sketchy at best, highlighting the continued need for computational methods to help direct experimentalists in the search for novel interactions. Results: We present a kernel method for predicting protein-protein interactions using a combination of data sources, including protein sequences, Gene Ontology annotations, local properties of the network, and homologous interactions in other species. Whereas protein kernels proposed in the literature provide a similarity between single proteins, prediction of interactions requires a kernel between pairs of proteins. We propose a pairwise kernel that converts a kernel between single proteins into a kernel between pairs of proteins, and we illustrate the kernel's effectiveness in conjunction with a support vector machine classifier. Furthermore, we obtain improved performance by combining several sequence-based kernels based on k-mer frequency, motif and domain content and by further augmenting the pairwise sequence kernel with features that are based on other sources of data. We apply our method to predict physical interactions in yeast using data from the BIND database. At a false positive rate of 1% the classifier retrieves close to 80% of a set of trusted interactions. We thus demonstrate the ability of our method to make accurate predictions despite the sizeable fraction of false positives that are known to exist in interaction databases.
引用
收藏
页码:I38 / I46
页数:9
相关论文
共 50 条
  • [21] Predicting protein-protein interactions by association mining
    Kotlyar, M
    Jurisica, I
    [J]. INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 37 - 46
  • [22] The interactome: Predicting the protein-protein interactions in cells
    Plewczynski, Dariusz
    Ginalski, Krzysztof
    [J]. CELLULAR & MOLECULAR BIOLOGY LETTERS, 2009, 14 (01) : 1 - 22
  • [23] Predicting Protein-Protein Interactions by Association Mining
    [J]. Information Systems Frontiers, 2006, 8 : 37 - 47
  • [24] Information assessment on predicting protein-protein interactions
    Lin, N
    Wu, BL
    Jansen, R
    Gerstein, M
    Zhao, HY
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [25] ProteinPrompt: a webserver for predicting protein-protein interactions
    Canzler, Sebastian
    Fischer, Markus
    Ulbricht, David
    Ristic, Nikola
    Hildebrand, Peter W.
    Staritzbichler, Rene
    [J]. BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [26] Modeling Protein-Protein Interface Interactions as a Means for Predicting Protein-Protein Interaction Partners
    Reyes, Vicente M.
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2009, 26 (06): : 873 - 873
  • [27] Data mining methods for protein-protein interactions
    Nafar, Zahra
    Golshani, Ashkan
    [J]. 2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 2090 - +
  • [28] Computational methods of analysis of protein-protein interactions
    Salwinski, L
    Eisenberg, D
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2003, 13 (03) : 377 - 382
  • [29] Predicting protein-protein interactions in E-coli using machine learning methods
    Goyal, Kshama
    Vidyasagar, M.
    [J]. PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 2190 - 2195
  • [30] Community-wide evaluation of methods for predicting the effect of mutations on protein-protein interactions
    Moretti, Rocco
    Fleishman, Sarel J.
    Agius, Rudi
    Torchala, Mieczyslaw
    Bates, Paul A.
    Kastritis, Panagiotis L.
    Rodrigues, Joao P. G. L. M.
    Trellet, Mikael
    Bonvin, Alexandre M. J. J.
    Cui, Meng
    Rooman, Marianne
    Gillis, Dimitri
    Dehouck, Yves
    Moal, Iain
    Romero-Durana, Miguel
    Perez-Cano, Laura
    Pallara, Chiara
    Jimenez, Brian
    Fernandez-Recio, Juan
    Flores, Samuel
    Pacella, Michael
    Kilambi, Krishna Praneeth
    Gray, Jeffrey J.
    Popov, Petr
    Grudinin, Sergei
    Esquivel-Rodriguez, Juan
    Kihara, Daisuke
    Zhao, Nan
    Korkin, Dmitry
    Zhu, Xiaolei
    Demerdash, Omar N. A.
    Mitchell, Julie C.
    Kanamori, Eiji
    Tsuchiya, Yuko
    Nakamura, Haruki
    Lee, Hasup
    Park, Hahnbeom
    Seok, Chaok
    Sarmiento, Jamica
    Liang, Shide
    Teraguchi, Shusuke
    Standley, Daron M.
    Shimoyama, Hiromitsu
    Terashi, Genki
    Takeda-Shitaka, Mayuko
    Iwadate, Mitsuo
    Umeyama, Hideaki
    Beglov, Dmitri
    Hall, David R.
    Kozakov, Dima
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (11) : 1980 - 1987