Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding

被引:115
|
作者
Huang, Yu-An [1 ]
You, Zhu-Hong [2 ]
Chen, Xing [3 ]
Chan, Keith [4 ]
Luo, Xin [4 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
[2] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
[3] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Kowloon 999077, Hong Kong, Peoples R China
来源
BMC BIOINFORMATICS | 2016年 / 17卷
基金
美国国家科学基金会;
关键词
FACE RECOGNITION; AUTO COVARIANCE; HYPERPLANES; COMPLEXES; RESIDUES; ENSEMBLE; PROFILE;
D O I
10.1186/s12859-016-1035-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Proteins are the important molecules which participate in virtually every aspect of cellular function within an organism in pairs. Although high-throughput technologies have generated considerable protein-protein interactions (PPIs) data for various species, the processes of experimental methods are both time-consuming and expensive. In addition, they are usually associated with high rates of both false positive and false negative results. Accordingly, a number of computational approaches have been developed to effectively and accurately predict protein interactions. However, most of these methods typically perform worse when other biological data sources (e.g., protein structure information, protein domains, or gene neighborhoods information) are not available. Therefore, it is very urgent to develop effective computational methods for prediction of PPIs solely using protein sequence information. Results: In this study, we present a novel computational model combining weighted sparse representation based classifier (WSRC) and global encoding (GE) of amino acid sequence. Two kinds of protein descriptors, composition and transition, are extracted for representing each protein sequence. On the basis of such a feature representation, novel weighted sparse representation based classifier is introduced to predict protein interaction class. When the proposed method was evaluated with the PPIs data of S. cerevisiae, Human and H. pylori, it achieved high prediction accuracies of 96.82, 97.66 and 92.83 % respectively. Extensive experiments were performed for cross-species PPIs prediction and the prediction accuracies were also very promising. Conclusions: To further evaluate the performance of the proposed method, we then compared its performance with the method based on support vector machine (SVM). The results show that the proposed method achieved a significant improvement. Thus, the proposed method is a very efficient method to predict PPIs and may be a useful supplementary tool for future proteomics studies.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding
    Yu-An Huang
    Zhu-Hong You
    Xing Chen
    Keith Chan
    Xin Luo
    [J]. BMC Bioinformatics, 17
  • [2] Sequence-Based Prediction of Protein-Protein Interactions Using Ensemble Based Classifier Combined with Global Encoding in HIV (Human Immunodeficiency Virus)
    Lestari, D.
    Musti, M. I. S.
    Bustamam, A.
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023
  • [3] Using Weighted Sparse Representation Model Combined with Discrete Cosine Transformation to Predict Protein-Protein Interactions from Protein Sequence
    Huang, Yu-An
    You, Zhu-Hong
    Gao, Xin
    Wong, Leon
    Wang, Lirong
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [4] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    [J]. BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411
  • [5] Prediction of Protein-Protein Interactions with Clustered Amino Acids and Weighted Sparse Representation
    Huang, Qiaoying
    You, Zhuhong
    Zhang, Xiaofeng
    Zhou, Yong
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (05) : 10855 - 10869
  • [6] Sequence-based prediction of protein-protein interactions by means of codon usage
    Najafabadi, Hamed Shateri
    Salavati, Reza
    [J]. GENOME BIOLOGY, 2008, 9 (05)
  • [7] Sequence-based prediction of protein-protein interactions by means of codon usage
    Hamed Shateri Najafabadi
    Reza Salavati
    [J]. Genome Biology, 9
  • [8] FCTP-WSRC: Protein-Protein Interactions Prediction via Weighted Sparse Representation Based Classification
    Kong, Meng
    Zhang, Yusen
    Xu, Da
    Chen, Wei
    Dehmer, Matthias
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [9] Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method
    Goktepe, Yunus Emre
    Kodaz, Halife
    [J]. NEUROCOMPUTING, 2018, 303 : 68 - 74
  • [10] PPI-Detect: A Support Vector Machine Model for Sequence-Based Prediction of Protein-Protein Interactions
    Romero-Molina, Sandra
    Ruiz-Blanco, Yasser B.
    Harms, Mirja
    Muench, Jan
    Sanchez-Garcia, Elsa
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2019, 40 (11) : 1233 - 1242