pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC

被引:135
|
作者
Cheng, Xiang [1 ]
Xiao, Xuan [1 ,3 ]
Chou, Kuo-Chen [2 ,3 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen, Peoples R China
[2] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
[3] Gordon Life Sci Inst, Boston, MA 02478 USA
基金
中国国家自然科学基金;
关键词
Multi-label system; GO; PseAAC; ML-GKR; Chou's metrics; AMINO-ACID-COMPOSITION; MULTI-LABEL CLASSIFIER; SEQUENCE-BASED PREDICTOR; ENSEMBLE CLASSIFIER; LOCATION PREDICTION; WEB SERVER; DIPEPTIDE COMPOSITION; LEARNING CLASSIFIER; TOPOLOGICAL INDEXES; MEMBRANE-PROTEINS;
D O I
10.1016/j.gene.2017.07.036
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Knowledge of subcellular locations of proteins is crucially important for in-depth understanding their functions in a cell. With the explosive growth of protein sequences generated in the postgenomic age, it is highly demanded to develop computational tools for timely annotating their subcellular locations based on the sequence information alone. The current study is focused on virus proteins. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most existing methods can be used to deal with single location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex proteins is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called "pLoc-mVirus" by extracting the optimal GO (Gene Ontology) information into the general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on a same stringent benchmark dataset indicated that the proposed pLoc-mVirus predictor is remarkably superior to iLoc-Virus, the state-of-the-art method in predicting virus protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mVirus/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.
引用
收藏
页码:315 / 321
页数:7
相关论文
共 13 条
  • [1] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC (vol 628, pg 315, 2017)
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENE, 2018, 644 : 156 - 156
  • [2] pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC (vol 628, pg 315, 2017)
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENE, 2018, 646 : 234 - 234
  • [3] pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. MOLECULAR BIOSYSTEMS, 2017, 13 (09) : 1722 - 1727
  • [4] pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2018, 34 (09) : 1448 - 1456
  • [5] pLoc-mEuk: Predict subcellular localization of multi-label eukaryotic proteins by extracting the key GO information into general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENOMICS, 2018, 110 (01) : 50 - 58
  • [6] pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset
    Xiao, Xuan
    Cheng, Xiang
    Chen, Genqiang
    Mao, Qi
    Chou, Kuo-Chen
    [J]. MEDICINAL CHEMISTRY, 2019, 15 (05) : 496 - 509
  • [7] pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. CURRENT PHARMACEUTICAL DESIGN, 2018, 24 (34) : 4013 - 4022
  • [8] pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. GENOMICS, 2018, 110 (04) : 231 - 239
  • [9] pLoc_bal-mEuk: Predict Subcellular Localization of Eukaryotic Proteins by General PseAAC and Quasi-balancing Training Dataset
    Chou, Kuo-Chen
    Cheng, Xiang
    Xiao, Xuan
    [J]. MEDICINAL CHEMISTRY, 2019, 15 (05) : 472 - 485
  • [10] pLoc_bal-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by quasi-balancing training dataset and general PseAAC
    Cheng, Xiang
    Xiao, Xuan
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2018, 458 : 92 - 102