PSPEL: In Silico Prediction of Self-Interacting Proteins from Amino Acids Sequences Using Ensemble Learning

被引:43
|
作者
Li, Jian-Qiang [1 ]
You, Zhu-Hong [2 ]
Li, Xiao [2 ]
Ming, Zhong [1 ]
Chen, Xing [3 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[3] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Self-interacting proteins; ensemble classifier; low rank; protein sequence; MAP; DATABASE; PROTEOME; FOREST;
D O I
10.1109/TCBB.2017.2649529
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Self interacting proteins (SIPs) play an important role in various aspects of the structural and functional organization of the cell. Detecting SIPs is one of the most important issues in current molecular biology. Although a large number of SIPs data has been generated by experimental methods, wet laboratory approaches are both time-consuming and costly. In addition, they yield high false negative and positive rates. Thus, there is a great need for in silico methods to predict SIPs accurately and efficiently. In this study, a new sequence-based method is proposed to predict SIPs. The evolutionary information contained in Position-Specific Scoring Matrix (PSSM) is extracted from of protein with known sequence. Then, features are fed to an ensemble classifier to distinguish the self-interacting and non-self-interacting proteins. When performed on Saccharomyces cerevisiae and Human SIPs data sets, the proposed method can achieve high accuracies of 86.86 and 91.30 percent, respectively. Our method also shows a good performance when compared with the SVM classifier and previous methods. Consequently, the proposed method can be considered to be a novel promising tool to predict SIPs.
引用
收藏
页码:1165 / 1172
页数:8
相关论文
共 50 条
  • [21] Accurate prediction of essential proteins using ensemble machine learning
    鲁德志
    吴淏
    侯俞彤
    吴云成
    刘媛媛
    王金武
    Chinese Physics B, 2025, 34 (01) : 112 - 119
  • [22] Accurate prediction of essential proteins using ensemble machine learning
    Lu, Dezhi
    Wu, Hao
    Hou, Yutong
    Wu, Yuncheng
    Liu, Yuanyuan
    Wang, Jinwu
    CHINESE PHYSICS B, 2025, 34 (01)
  • [23] In vitro selection of self-interacting transmembrane segments - Membrane proteins approached from a different perspective
    Langosch, D
    Lindner, E
    Gurezka, R
    IUBMB LIFE, 2002, 54 (03) : 109 - 113
  • [24] Prediction of local structural stabilities of proteins from their amino acid sequences
    Tartaglia, Gian Gaetano
    Cavalli, Andrea
    Vendruscolo, Michele
    STRUCTURE, 2007, 15 (02) : 139 - 143
  • [25] Accurate Prediction of Human Essential Proteins Using Ensemble Deep Learning
    Li, Yiming
    Zeng, Min
    Wu, Yifan
    Li, Yaohang
    Li, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3263 - 3271
  • [26] Prediction of Disordered Regions in Proteins Using Physicochemical Properties of Amino Acids
    Gok, Murat
    Kocal, Osman Hilmi
    Genc, Sevdanur
    INTERNATIONAL JOURNAL OF PEPTIDE RESEARCH AND THERAPEUTICS, 2016, 22 (01) : 31 - 36
  • [27] Prediction of Disordered Regions in Proteins Using Physicochemical Properties of Amino Acids
    Murat Gök
    Osman Hilmi Koçal
    Sevdanur Genç
    International Journal of Peptide Research and Therapeutics, 2016, 22 : 31 - 36
  • [28] Global Vectors Representation of Protein Sequences and Its Application for Predicting Self-Interacting Proteins with Multi-Grained Cascade Forest Model
    Chen, Zhan-Heng
    You, Zhu-Hong
    Zhang, Wen-Bo
    Wang, Yan-Bin
    Cheng, Li
    Alghazzawi, Daniyal
    GENES, 2019, 10 (11)
  • [29] PREDICTION OF CONFORMATION OF MEMBRANE PROTEINS FROM SEQUENCE OF AMINO-ACIDS
    GREEN, NM
    FLANAGAN, MT
    BIOCHEMICAL JOURNAL, 1976, 153 (03) : 729 - 732
  • [30] Prediction of RNA-binding amino acids from protein and RNA sequences
    Choi, Sungwook
    Han, Kyungsook
    BMC BIOINFORMATICS, 2011, 12