An Ensemble Classifier with Random Projection for Predicting Protein-Protein Interactions Using Sequence and Evolutionary Information

被引:14
|
作者
Song, Xiao-Yu [1 ]
Chen, Zhan-Heng [2 ]
Sun, Xiang-Yang [1 ]
You, Zhu-Hong [2 ]
Li, Li-Ping [2 ]
Zhao, Yang [1 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Xinjiang, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
protein-protein interactions; position-specific scoring matrix; random projection ensemble classifier; support vector machine; AMINO-ACID-SEQUENCES; MACHINES; DATABASE; SPACES;
D O I
10.3390/app8010089
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Identifying protein-protein interactions (PPIs) is crucial to comprehend various biological processes in cells. Although high-throughput techniques generate many PPI data for various species, they are only a petty minority of the entire PPI network. Furthermore, these approaches are costly and time-consuming and have a high error rate. Therefore, it is necessary to design computational methods for efficiently detecting PPIs. In this study, a random projection ensemble classifier (RPEC) was explored to identify novel PPIs using evolutionary information contained in protein amino acid sequences. The evolutionary information was obtained from a position-specific scoring matrix (PSSM) generated from PSI-BLAST. A novel feature fusion scheme was then developed by combining discrete cosine transform (DCT), fast Fourier transform (FFT), and singular value decomposition (SVD). Finally, via the random projection ensemble classifier, the performance of the presented approach was evaluated on Yeast, Human, and H. pylori PPI datasets using 5-fold cross-validation. Our approach achieved high prediction accuracies of 95.64%, 96.59%, and 87.62%, respectively, effectively outperforming other existing methods. Generally speaking, our approach is quite promising and supplies a practical and effective method for predicting novel PPIs.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [2] Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation
    Li, Yang
    Wang, Zheng
    You, Zhu-Hong
    Li, Li-Ping
    Hu, Xuegang
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [3] Predicting protein-protein interactions by a supervised learning classifier
    Huang, Y
    Frishman, D
    Muchnik, I
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (04) : 291 - 301
  • [4] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [5] An Ensemble Classifier to Predict Protein-Protein Interactions by Combining PSSM-based Evolutionary Information with Local Binary Pattern Model
    Li, Yang
    Li, Li-Ping
    Wang, Lei
    Yu, Chang-Qing
    Wang, Zheng
    You, Zhu-Hong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (14)
  • [6] Predicting Protein-Protein Interactions Using Sequence and Network Information via Variational Graph Autoencoder
    Luo, Xin
    Wang, Liwei
    Hu, Pengwei
    Hu, Lun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3182 - 3194
  • [7] Efficient prediction of protein-protein interactions using sequence information
    Guarracino, Mario R.
    Nebbia, Adriano
    Manna, Valeria
    Chinchuluun, Altannar
    Pardalos, Panos M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 677 - 682
  • [8] Information assessment on predicting protein-protein interactions
    Nan Lin
    Baolin Wu
    Ronald Jansen
    Mark Gerstein
    Hongyu Zhao
    BMC Bioinformatics, 5
  • [9] Information assessment on predicting protein-protein interactions
    Lin, N
    Wu, BL
    Jansen, R
    Gerstein, M
    Zhao, HY
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [10] An Ensemble Classifier with Random Projection for Predicting Multi-label Protein Subcellular Localization
    Wan, Shibiao
    Mak, Man-Wai
    Zhang, Bai
    Wang, Yue
    Kung, Sun-Yuan
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,