iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites

被引:202
|
作者
Wu, Zhi-Cheng [1 ,2 ]
Xiao, Xuan [1 ,2 ]
Chou, Kuo-Chen [2 ]
机构
[1] Jing De Zhen Ceram Inst, Dept Comp, Jing De Zhen 333046, Peoples R China
[2] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; GRAM-NEGATIVE BACTERIA; TERMINAL TARGETING SEQUENCES; IMPROVED HYBRID APPROACH; APOPTOSIS PROTEINS; LOCATION PREDICTION; GENE ONTOLOGY; SORTING SIGNALS; REPRESENTATION;
D O I
10.1039/c1mb05232b
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein subcellular localization is a challenging problem, particularly when query proteins may simultaneously exist at, or move between, two or more different subcellular location sites. Most of the existing methods can only be used to deal with the single-location proteins. Actually, multiple-location proteins should not be ignored because they usually bear some special functions worthy of our notice. By introducing the "multi-labeled learning" approach, a new predictor, called iLoc-Plant, has been developed that can be used to deal with the systems containing both single-and multiple-location plant proteins. As a demonstration, the jackknife cross-validation was performed with iLoc-Plant on a benchmark dataset of plant proteins classified into the following 12 location sites: (1) cell membrane, (2) cell wall, (3) chloroplast, (4) cytoplasm, (5) endoplasmic reticulum, (6) extracellular, (7) Golgi apparatus, (8) mitochondrion, (9) nucleus, (10) peroxisome, (11) plastid, and (12) vacuole, where some proteins belong to two or three locations but none has >= 25% pairwise sequence identity to any other in a same subset. The overall success rate thus obtained by iLoc-Plant was 71%, which is remarkably higher than those achieved by any existing predictors that also have the capacity to deal with such a stringent and complicated plant protein system. As a user-friendly web-server, iLoc-Plant is freely accessible to the public at the web-site http://icpr.jci.edu.cn/bioinfo/iLoc-Plant or http://www.jci-bioinfo.cn/iLoc-Plant. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematic equations presented in this paper for its integrity. It is anticipated that iLoc-Plant may become a useful bioinformatics tool for Molecular Cell Biology, Proteomics, Systems Biology, and Drug Development.
引用
收藏
页码:3287 / 3297
页数:11
相关论文
共 50 条
  • [1] iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 284 (01) : 42 - 51
  • [2] A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    PLOS ONE, 2011, 6 (06):
  • [3] iLoc-Animal: a multi-label learning classifier for predicting subcellular localization of animal proteins
    Lin, Wei-Zhong
    Fang, Jian-An
    Xiao, Xuan
    Chou, Kuo-Chen
    MOLECULAR BIOSYSTEMS, 2013, 9 (04) : 634 - 644
  • [4] iLoc-Euk: A Multi-Label Classifier for Predicting the Subcellular Localization of Singleplex and Multiplex Eukaryotic Proteins
    Chou, Kuo-Chen
    Wu, Zhi-Cheng
    Xiao, Xuan
    PLOS ONE, 2011, 6 (03):
  • [5] ML-rRBF-ECOC: A Multi-Label Learning Classifier for Predicting Protein Subcellular Localization with Both Single and Multiple Sites
    Han, Guo-Sheng
    Yu, Zu-Guo
    CURRENT PROTEOMICS, 2019, 16 (05) : 359 - 365
  • [6] Imbalanced Multi-Modal Multi-Label Learning for Subcellular Localization Prediction of Human Proteins with Both Single and Multiple Sites
    He, Jianjun
    Gu, Hong
    Liu, Wenqi
    PLOS ONE, 2012, 7 (06):
  • [7] An Ensemble Classifier with Random Projection for Predicting Multi-label Protein Subcellular Localization
    Wan, Shibiao
    Mak, Man-Wai
    Zhang, Bai
    Wang, Yue
    Kung, Sun-Yuan
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [8] Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins
    Shibiao Wan
    Man-Wai Mak
    Sun-Yuan Kung
    BMC Bioinformatics, 17
  • [9] Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    BMC BIOINFORMATICS, 2016, 17
  • [10] iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites
    Chou, Kuo-Chen
    Wu, Zhi-Cheng
    Xiao, Xuan
    MOLECULAR BIOSYSTEMS, 2012, 8 (02) : 629 - 641