Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution

被引:0
|
作者
Jin, Danyu [1 ]
Zhu, Ping [1 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Jiangsu, Peoples R China
关键词
PREDICTION;
D O I
10.1155/2021/8629776
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The prediction of protein subcellular localization not only is important for the study of protein structure and function but also can facilitate the design and development of new drugs. In recent years, feature extraction methods based on protein evolution information have attracted much attention and made good progress. Based on the protein position-specific score matrix (PSSM) obtained by PSI-BLAST, PSSM-GSD method is proposed according to the data distribution characteristics. In order to reflect the protein sequence information as much as possible, AAO method, PSSM-AAO method, and PSSM-GSD method are fused together. Then, conditional entropy-based classifier chain algorithm and support vector machine are used to locate multilabel proteins. Finally, we test Gpos-mPLoc and Gneg-mPLoc datasets, considering the severe imbalance of data, and select SMOTE algorithm to expand a few sample; the experiment shows that the AAO + PSSM* method in the paper achieved 83.1% and 86.8% overall accuracy, respectively. After experimental comparison of different methods, AAO + PSSM* has good performance and can effectively predict protein subcellular location.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution
    Jin, Danyu
    Zhu, Ping
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [2] Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information
    Yunyun Liang
    Shengli Zhang
    [J]. Acta Biotheoretica, 2018, 66 : 61 - 78
  • [3] Prediction of Apoptosis Protein's Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information
    Liang, Yunyun
    Zhang, Shengli
    [J]. ACTA BIOTHEORETICA, 2018, 66 (01) : 61 - 78
  • [4] Improving prediction of protein subcellular localization using evolutionary information and sequence-order information
    Wang, Minghui
    Li, Ao
    Xie, Dan
    Fan, Zhewen
    Jiang, Zhaohui
    Feng, Huanqing
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4434 - 4436
  • [5] A new hybrid approach to predict subcellular localization by incorporating protein evolutionary conservation information
    Zhang, ShaoWu
    Zhang, YunLong
    Li, JunHui
    Yang, HuiFeng
    Cheng, YongMei
    Zhou, GuoPing
    [J]. LIFE SYSTEM MODELING AND SIMULATION, PROCEEDINGS, 2007, 4689 : 172 - +
  • [6] Subcellular localization prediction of apoptosis proteins based on evolutionary information and support vector machine
    Xiang, Qilin
    Liao, Bo
    Li, Xianhong
    Xu, Huimin
    Chen, Jing
    Shi, Zhuoxing
    Dai, Qi
    Yao, Yuhua
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 78 : 41 - 46
  • [7] Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC
    Shen, Yinan
    Tang, Jijun
    Guo, Fei
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2019, 462 : 230 - 239
  • [8] Detection of Protein Subcellular Localization based on a Full Syntactic Parser and Semantic Information
    Kim, Mi-Young
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS, 2008, : 407 - 411
  • [9] Predicting protein subcellular localization based on information content of gene ontology terms
    Zhang, Shu-Bo
    Tang, Qiang-Rong
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 65 : 1 - 7
  • [10] Prediction of protein subcellular localization with a novel method: Sequence-segmented PseAAC
    Zhang, Shao-Wu
    Yang, Hui-Fang
    Li, Qi-Peng
    Cheng, Yong-Mei
    Pan, Quan
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 4024 - 4028