pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC

被引:117
|
作者
Cheng, Xiang [1 ,2 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ,3 ,4 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
[4] King Abdulaziz Univ, Fac Comp & Informat Technol Rabigh, Jeddah, Saudi Arabia
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Multi-label system; Chou's general PseAAC; Gene ontology; ML-GKR; Chou's metrics; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; MULTI-LABEL CLASSIFIER; LYSINE SUCCINYLATION SITES; ENSEMBLE CLASSIFIER; LOCATION PREDICTION; VIRUS PROTEINS; WEB SERVER; PHYSICOCHEMICAL PROPERTIES; DIPEPTIDE COMPOSITION;
D O I
10.1016/j.ygeno.2017.10.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Information of the proteins' subcellular localization is crucially important for revealing their biological functions in a cell, the basic unit of life. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop computational tools for timely identifying their subcellular locations based on the sequence information alone. The current study is focused on the Gram-negative bacterial proteins. Although considerable efforts have been made in protein subcellular prediction, the problem is far from being solved yet. This is because mounting evidences have indicated that many Gram-negative bacterial proteins exist in two or more location sites. Unfortunately, most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions important for both basic research and drug design. In this study, by using the multi-label theory, we developed a new predictor called "pLoc-mGneg" for predicting the subcellular localization of Gram-negative bacterial proteins with both single and multiple locations. Rigorous cross-validation on a high quality benchmark dataset indicated that the proposed predictor is remarkably superior to "iLoc-Gneg", the state-of-the-art predictor for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for the novel predictor has been established at http://www.jci-bioinfo.cn/pLoc-mGneg/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
引用
收藏
页码:231 / 239
页数:9
相关论文
共 28 条
  • [21] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC (vol 628, pg 315, 2017)
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    GENE, 2018, 644 : 156 - 156
  • [22] pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset
    Xiao, Xuan
    Cheng, Xiang
    Chen, Genqiang
    Mao, Qi
    Chou, Kuo-Chen
    MEDICINAL CHEMISTRY, 2019, 15 (05) : 496 - 509
  • [23] Multi-location gram-positive and gram-negative bacterial protein subcellular localization using gene ontology and multi-label classifier ensemble
    Wang, Xiao
    Zhang, Jun
    Li, Guo-Zheng
    BMC BIOINFORMATICS, 2015, 16
  • [24] Multi-location gram-positive and gram-negative bacterial protein subcellular localization using gene ontology and multi-label classifier ensemble
    Xiao Wang
    Jun Zhang
    Guo-Zheng Li
    BMC Bioinformatics, 16
  • [25] A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    PLOS ONE, 2011, 6 (06):
  • [26] Gneg-mPLoc: A top-down strategy to enhance the quality of predicting subcellular localization of Gram-negative bacterial proteins
    Shen, Hong-Bin
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 264 (02) : 326 - 333
  • [27] Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence
    Liang, Yunyun
    Zhang, Shengli
    JOURNAL OF THEORETICAL BIOLOGY, 2018, 454 : 22 - 29
  • [28] Prediction subcellular localization of Gram-negative bacterial proteins by support vector machine using wavelet denoising and Chou's pseudo amino acid composition
    Yu, Bin
    Li, Shan
    Chen, Cheng
    Xu, Jiameng
    Qiu, Wenying
    Wu, Xue
    Chen, Ruixin
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 167 : 102 - 112