Prediction of Protein-Protein Interactions with Clustered Amino Acids and Weighted Sparse Representation

被引:23
|
作者
Huang, Qiaoying [1 ]
You, Zhuhong [2 ]
Zhang, Xiaofeng [1 ]
Zhou, Yong [2 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[2] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
基金
美国国家科学基金会;
关键词
reduced amino acid alphabet; weighted sparse representation-based classification; protein-protein interactions; EVOLUTIONARY INFORMATION; FACE RECOGNITION; CHOUS PSEAAC; IDENTIFICATION; LOCALIZATION; SEQUENCES; GENERATE; ALPHABET; MODES;
D O I
10.3390/ijms160510855
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With the completion of the Human Genome Project, bioscience has entered into the era of the genome and proteome. Therefore, protein-protein interactions (PPIs) research is becoming more and more important. Life activities and the protein-protein interactions are inseparable, such as DNA synthesis, gene transcription activation, protein translation, etc. Though many methods based on biological experiments and machine learning have been proposed, they all spent a long time to learn and obtained an imprecise accuracy. How to efficiently and accurately predict PPIs is still a big challenge. To take up such a challenge, we developed a new predictor by incorporating the reduced amino acid alphabet (RAAA) information into the general form of pseudo-amino acid composition (PseAAC) and with the weighted sparse representation-based classification (WSRC). The remarkable advantages of introducing the reduced amino acid alphabet is being able to avoid the notorious dimensionality disaster or overfitting problem in statistical prediction. Additionally, experiments have proven that our method achieved good performance in both a low- and high-dimensional feature space. Among all of the experiments performed on the PPIs data of Saccharomyces cerevisiae, the best one achieved 90.91% accuracy, 94.17% sensitivity, 87.22% precision and a 83.43% Matthews correlation coefficient (MCC) value. In order to evaluate the prediction ability of our method, extensive experiments are performed to compare with the state-of-the-art technique, support vector machine (SVM). The achieved results show that the proposed approach is very promising for predicting PPIs, and it can be a helpful supplement for PPIs prediction.
引用
收藏
页码:10855 / 10869
页数:15
相关论文
共 50 条
  • [1] FCTP-WSRC: Protein-Protein Interactions Prediction via Weighted Sparse Representation Based Classification
    Kong, Meng
    Zhang, Yusen
    Xu, Da
    Chen, Wei
    Dehmer, Matthias
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [2] Prediction of Protein-Protein Interactions from Protein Sequences by Combining MatPCA Feature Extraction Algorithms and Weighted Sparse Representation Models
    Wang, Zheng
    Li, Yang
    You, Zhu-Hong
    Li, Li-Ping
    Zhan, Xin-Ke
    Pan, Jie
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [3] Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding
    Yu-An Huang
    Zhu-Hong You
    Xing Chen
    Keith Chan
    Xin Luo
    [J]. BMC Bioinformatics, 17
  • [4] Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding
    Huang, Yu-An
    You, Zhu-Hong
    Chen, Xing
    Chan, Keith
    Luo, Xin
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [5] Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition
    Huang, Yu-An
    You, Zhu-Hong
    Chen, Xing
    Yan, Gui-Ying
    [J]. BMC SYSTEMS BIOLOGY, 2016, 10
  • [6] Prediction of Protein-Protein Interaction By Metasample-Based Sparse Representation
    Du, Xiuquan
    Li, Xinrui
    Zhang, Hanqian
    Zhang, Yanping
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [7] Using Weighted Sparse Representation Model Combined with Discrete Cosine Transformation to Predict Protein-Protein Interactions from Protein Sequence
    Huang, Yu-An
    You, Zhu-Hong
    Gao, Xin
    Wong, Leon
    Wang, Lirong
    [J]. BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [8] Predicting protein-protein interactions by weighted pseudo amino acid composition
    Goktepe, Yunus Emre
    Ilhan, Ilhan
    Kahramanli, Sirzat
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (03) : 272 - 290
  • [9] Improving the prediction of yeast protein function using weighted protein-protein interactions
    Ahmed, Khaled S.
    Saloma, Nahed H.
    Kadah, Yasser M.
    [J]. THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2011, 8
  • [10] Prediction of Protein-Protein Interactions from Sequences using a Correlation Matrix of the Physicochemical Properties of Amino Acids
    Kopoin, Charlemagne N'Diffon
    Atiampo, Armand Kodjo
    N'Guessan, Behou Gerard
    Babri, Michel
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (03): : 41 - 47