Prediction of Protein-Protein Interaction with Pairwise Kernel Support Vector Machine

被引:49
|
作者
Zhang, Shao-Wu [1 ,2 ]
Hao, Li-Yang [1 ]
Zhang, Ting-He [1 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
[2] Minist Educ, Key Lab Informat Fus Technol, Xian 710072, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
amino acid distance frequency; amino acid index distribution; protein-protein interaction; pairwise kernel function; support vector machine; AMINO-ACID-COMPOSITION; SUBCELLULAR LOCATION; SEQUENCES; CLASSIFICATION; INFORMATION; PARAMETERS; NETWORKS;
D O I
10.3390/ijms15023220
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) play a key role in many cellular processes. Unfortunately, the experimental methods currently used to identify PPIs are both time-consuming and expensive. These obstacles could be overcome by developing computational approaches to predict PPIs. Here, we report two methods of amino acids feature extraction: (i) distance frequency with PCA reducing the dimension (DFPCA) and (ii) amino acid index distribution (AAID) representing the protein sequences. In order to obtain the most robust and reliable results for PPI prediction, pairwise kernel function and support vector machines (SVM) were employed to avoid the concatenation order of two feature vectors generated with two proteins. The highest prediction accuracies of AAID and DFPCA were 94% and 93.96%, respectively, using the 10 CV test, and the results of pairwise radial basis kernel function are considerably improved over those based on radial basis kernel function. Overall, the PPI prediction tool, termed PPI-PKSVM, which is freely available at http://159.226.118.31/PPI/index.html, promises to become useful in such areas as bio-analysis and drug development.
引用
收藏
页码:3220 / 3233
页数:14
相关论文
共 50 条
  • [21] Protein-protein interaction based on pairwise similarity
    Zaki, Nazar
    Lazarova-Molnar, Sanja
    El-Hajj, Wassim
    Campbell, Piers
    BMC BIOINFORMATICS, 2009, 10
  • [22] Some Remarks on Prediction of Protein-Protein Interaction with Machine Learning
    Zhang, Shao-Wu
    Wei, Ze-Gang
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 254 - 264
  • [23] Predicting protein-protein binding sites by a support vector machine approach
    Ou, Rui
    Zhang, Juhua
    2007 IEEE/ICME INTERNATIONAL CONFERENCE ON COMPLEX MEDICAL ENGINEERING, VOLS 1-4, 2007, : 1621 - 1625
  • [24] A Coupled Similarity Kernel for Pairwise Support Vector Machine
    Li, Mu
    Li, Jinjiu
    Ou, Yuming
    Luo, Dan
    AGENTS AND DATA MINING INTERACTION (ADMI 2014), 2015, 9145 : 114 - 123
  • [25] Prediction of Protein Thermostability with Support Vector Machine
    Ai, Haixin
    Zhang, Jikuan
    Zhang, Li
    Deng, Fangbo
    Zhao, Jian
    Liu, Hongsheng
    8TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2014), 2014, : 63 - 68
  • [26] PPI-Detect: A Support Vector Machine Model for Sequence-Based Prediction of Protein-Protein Interactions
    Romero-Molina, Sandra
    Ruiz-Blanco, Yasser B.
    Harms, Mirja
    Muench, Jan
    Sanchez-Garcia, Elsa
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2019, 40 (11) : 1233 - 1242
  • [27] Machine learning on protein-protein interaction prediction: models, challenges and trends
    Tang, Tao
    Zhang, Xiaocai
    Liu, Yuansheng
    Peng, Hui
    Zheng, Binshuang
    Yin, Yanlin
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [28] Prediction of protein-protein interaction inhibitors by chemoinformatics and machine learning methods
    Neugebauer, Alexander
    Hartmann, Rolf W.
    Klein, Christian D.
    JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (19) : 4665 - 4668
  • [29] A Study of Network-based Kernel Methods on Protein-Protein Interaction for Protein Functions Prediction
    Ching, Wai-Ki
    Li, Limin
    Chan, Yat-Ming
    Mamitsuka, Hiroshi
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2009, 11 : 25 - +
  • [30] Human protein-protein interaction prediction
    Mark D McDowall
    Michelle S Scott
    Geoffrey J Barton
    BMC Bioinformatics, 11 (Suppl 10)