Imbalanced Multi-Modal Multi-Label Learning for Subcellular Localization Prediction of Human Proteins with Both Single and Multiple Sites

被引:30
|
作者
He, Jianjun [1 ]
Gu, Hong [1 ]
Liu, Wenqi [1 ]
机构
[1] Dalian Univ Technol, Sch Control Sci & Engn, Dalian, Liaoning, Peoples R China
来源
PLOS ONE | 2012年 / 7卷 / 06期
关键词
AMINO-ACID-COMPOSITION; PROTEASE CLEAVAGE SITES; LOCATION PREDICTION; FUNCTIONAL DOMAIN; WEB SERVER; FUSION CLASSIFIER; EUK-MPLOC; VIRUS; SCALE; ENSEMBLE;
D O I
10.1371/journal.pone.0037155
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is well known that an important step toward understanding the functions of a protein is to determine its subcellular location. Although numerous prediction algorithms have been developed, most of them typically focused on the proteins with only one location. In recent years, researchers have begun to pay attention to the subcellular localization prediction of the proteins with multiple sites. However, almost all the existing approaches have failed to take into account the correlations among the locations caused by the proteins with multiple sites, which may be the important information for improving the prediction accuracy of the proteins with multiple sites. In this paper, a new algorithm which can effectively exploit the correlations among the locations is proposed by using Gaussian process model. Besides, the algorithm also can realize optimal linear combination of various feature extraction technologies and could be robust to the imbalanced data set. Experimental results on a human protein data set show that the proposed algorithm is valid and can achieve better performance than the existing approaches.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 284 (01) : 42 - 51
  • [2] iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites
    Wu, Zhi-Cheng
    Xiao, Xuan
    Chou, Kuo-Chen
    MOLECULAR BIOSYSTEMS, 2011, 7 (12) : 3287 - 3297
  • [3] A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    PLOS ONE, 2011, 6 (06):
  • [4] Network Based Subcellular Localization Prediction for Multi-Label Proteins
    Mondal, Ananda Mohan
    Lin, Jhih-rong
    Hu, Jianjun
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 473 - 480
  • [5] ML-rRBF-ECOC: A Multi-Label Learning Classifier for Predicting Protein Subcellular Localization with Both Single and Multiple Sites
    Han, Guo-Sheng
    Yu, Zu-Guo
    CURRENT PROTEOMICS, 2019, 16 (05) : 359 - 365
  • [6] Collaboration based multi-modal multi-label learning
    Zhang, Yi
    Zhu, Yinlong
    Zhang, Zhecheng
    Wang, Chongjung
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
  • [7] Collaboration based multi-modal multi-label learning
    Yi Zhang
    Yinlong Zhu
    Zhecheng Zhang
    Chongjung Wang
    Applied Intelligence, 2022, 52 : 14204 - 14217
  • [8] Multi-scale deep learning for the imbalanced multi-label protein subcellular localization prediction based on immunohistochemistry images
    Wang, Fengsheng
    Wei, Leyi
    BIOINFORMATICS, 2022, 38 (09) : 2602 - 2611
  • [9] Rethinking Modal-oriented Label Correlations for Multi-modal Multi-label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Zhang, Lei
    Wang, Chongjun
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] Multi-Label Multi-Kernel Transfer Learning for Human Protein Subcellular Localization
    Mei, Suyu
    PLOS ONE, 2012, 7 (06):