Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution

被引:0
|
作者
Jin, Danyu [1 ]
Zhu, Ping [1 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The prediction of protein subcellular localization not only is important for the study of protein structure and function but also can facilitate the design and development of new drugs. In recent years, feature extraction methods based on protein evolution information have attracted much attention and made good progress. Based on the protein position-specific score matrix (PSSM) obtained by PSI-BLAST, PSSM-GSD method is proposed according to the data distribution characteristics. In order to reflect the protein sequence information as much as possible, AAO method, PSSM-AAO method, and PSSM-GSD method are fused together. Then, conditional entropy-based classifier chain algorithm and support vector machine are used to locate multilabel proteins. Finally, we test Gpos-mPLoc and Gneg-mPLoc datasets, considering the severe imbalance of data, and select SMOTE algorithm to expand a few sample; the experiment shows that the AAO + PSSM* method in the paper achieved 83.1% and 86.8% overall accuracy, respectively. After experimental comparison of different methods, AAO + PSSM* has good performance and can effectively predict protein subcellular location.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Prediction of protein subcellular localization based on primary sequence data
    Özarar, M
    Atalay, V
    Atalay, RÇ
    [J]. PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 118 - 120
  • [32] Multisite protein subcellular localization prediction based on entropy density
    Zhao, Qing
    Wang, Dong
    Chen, Yuehui
    Qu, Xumi
    [J]. BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S2003 - S2009
  • [33] Prediction of protein subcellular localization based on primary sequence data
    Özarar, M
    Atalay, V
    Atalay, RÇ
    [J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, 2003, 2869 : 611 - 618
  • [34] Deep Forest-based Prediction of Protein Subcellular Localization
    Zhao, Lingling
    Wang, Junjie
    Nabil, Mahieddine Mohammed
    Zhang, Jun
    [J]. CURRENT GENE THERAPY, 2018, 18 (05) : 268 - 274
  • [35] Gene ontology based transfer learning for protein subcellular localization
    Suyu Mei
    Wang Fei
    Shuigeng Zhou
    [J]. BMC Bioinformatics, 12
  • [36] Prediction of subcellular protein localization based on functional domain composition
    Jia, Peilin
    Qian, Ziliang
    Zeng, ZhenBin
    Cai, Yudong
    Li, Yixue
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 357 (02) : 366 - 370
  • [37] Automated Protein Subcellular Localization Based on Local Invariant Features
    Li, Chao
    Wang, Xue-hong
    Zheng, Li
    Huang, Ji-feng
    [J]. PROTEIN JOURNAL, 2013, 32 (03): : 230 - 237
  • [38] Automated Protein Subcellular Localization Based on Local Invariant Features
    Chao Li
    Xue-hong Wang
    Li Zheng
    Ji-feng Huang
    [J]. The Protein Journal, 2013, 32 : 230 - 237
  • [39] Tissue distribution and subcellular localization of ζ-sarcoglycan
    Kutzick, C
    Herrmann, R
    Neumann, B
    Kawaguchi, N
    Straub, V
    Zaik, M
    Wewer, U
    Voit, T
    [J]. NEUROMUSCULAR DISORDERS, 2002, 12 (7-8) : 733 - 733
  • [40] SUBCELLULAR-LOCALIZATION AND TISSUE DISTRIBUTION OF THE 110 KDA HDL RECEPTOR PROTEIN
    HOKLAND, BM
    MENDEZ, AJ
    ORAM, JF
    [J]. ARTERIOSCLEROSIS, 1990, 10 (05): : A767 - A768