pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC

被引:171
|
作者
Cheng, Xiang [1 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ,3 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; MULTI-LABEL CLASSIFIER; ENZYME SUBFAMILY CLASSES; SUPPORT VECTOR MACHINE; ENSEMBLE CLASSIFIER; ANTIMICROBIAL PEPTIDES; DIPEPTIDE COMPOSITION; LEARNING CLASSIFIER; LOCATION PREDICTION; MEMBRANE-PROTEINS;
D O I
10.1039/c7mb00267j
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. To realize this, it is indispensable to develop an automated method for fast and accurate identification of the subcellular locations of uncharacterized proteins. The current study is focused on plant protein subcellular location prediction based on the sequence information alone. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most of the existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex protein is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called "pLoc-mPlant" by extracting the optimal GO (Gene Ontology) information into the Chou's general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on the same stringent benchmark dataset indicated that the proposed pLoc-mPlant predictor is remarkably superior to iLoc-Plant, the state-of-the-art method for predicting plant protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mPlant/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
引用
收藏
页码:1722 / 1727
页数:6
相关论文
共 24 条
  • [1] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENE, 2017, 628 : 315 - 321
  • [2] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC (vol 628, pg 315, 2017)
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENE, 2018, 644 : 156 - 156
  • [3] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC (vol 628, pg 315, 2017)
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENE, 2018, 646 : 234 - 234
  • [4] pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2018, 34 (09) : 1448 - 1456
  • [5] pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. CURRENT PHARMACEUTICAL DESIGN, 2018, 24 (34) : 4013 - 4022
  • [6] pLoc-mEuk: Predict subcellular localization of multi-label eukaryotic proteins by extracting the key GO information into general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENOMICS, 2018, 110 (01) : 50 - 58
  • [7] pLoc_bal-mEuk: Predict Subcellular Localization of Eukaryotic Proteins by General PseAAC and Quasi-balancing Training Dataset
    Chou, Kuo-Chen
    Cheng, Xiang
    Xiao, Xuan
    [J]. MEDICINAL CHEMISTRY, 2019, 15 (05) : 472 - 485
  • [8] pLoc_bal-mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC
    Cheng, Xiang
    Lin, Wei-Zhong
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2019, 35 (03) : 398 - 406
  • [9] pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset
    Xiao, Xuan
    Cheng, Xiang
    Chen, Genqiang
    Mao, Qi
    Chou, Kuo-Chen
    [J]. MEDICINAL CHEMISTRY, 2019, 15 (05) : 496 - 509
  • [10] pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENOMICS, 2018, 110 (04) : 231 - 239