Prediction of mitochondrial proteins of malaria parasite using bi-profile Bayes feature extraction

被引:35
|
作者
Jia, Cangzhi [1 ]
Liu, Tian [2 ]
Chang, Alan K. [2 ]
Zhai, Yingying [1 ]
机构
[1] Northeastern Univ, Dept Math, Shenyang 110004, Peoples R China
[2] Dalian Univ Technol, Dept Biosci & Biotechnol, Dalian 116024, Peoples R China
基金
美国国家科学基金会;
关键词
Malaria; Mitochondrial protein; Subcellular location; Bi-profile Bayes; Support vector machine; SUBCELLULAR-LOCALIZATION; CIS/TRANS ISOMERIZATION; DISULFIDE CONNECTIVITY; SEQUENCE; ACCURACY;
D O I
10.1016/j.biochi.2011.01.013
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Mitochondrial proteins of Plasmodium falciparum are considered as attractive targets for anti-malarial drugs, but the experimental identification of these proteins is a difficult and time-consuming task. Computational prediction of mitochondrial proteins offers an alternative approach. However, the commonly used subcellular location prediction methods are unsuited for P. falciparum mitochondrial proteins whereas the organism and organelle-specific methods were constructed on the basis of a rather small dataset. In this study, a novel dataset termed PfM233, which included 108 mitochondrial and 125 non-mitochondrial proteins with sequence similarity below 25%, was established and the methods for predicting mitochondrial proteins of P. falciparum were described. Both bi-profile Bayes and split amino acid composition were applied to extract the features from the N- and C-terminal sequences of these proteins, which were then used to construct two SVM based classifiers (PfMP-N25 and PfMP-30). Using PfM233 as the dataset, PfMP-N25 and PfMP-30 achieved accuracies (MCCs) of 90.13% (0.80) and 90.99% (0.82). When tested with the commonly used 40 mitochondrial proteins in PfM175 and the 108 mitochondrial proteins in PfM233, these two methods obviously outperformed the existing general, organelle-specific and organism and organelle-specific methods. (C) 2011 Elsevier Masson SAS. All rights reserved.
引用
收藏
页码:778 / 782
页数:5
相关论文
共 50 条
  • [41] Systematic analysis of human lysine acetylation proteins and accurate prediction of human lysine acetylation through bi-relative adapted binomial score Bayes feature representation
    Shao, Jianlin
    Xu, Dong
    Hu, Landian
    Kwan, Yiu-Wa
    Wang, Yifei
    Kong, Xiangyin
    Ngai, Sai-Ming
    [J]. MOLECULAR BIOSYSTEMS, 2012, 8 (11) : 2964 - 2973
  • [42] Feature extraction and classification of proteomics data using stationary wavelet transform and naive Bayes classifier
    Liu Dan
    Huang Yuan-yuan
    Ma Chen-xiang
    [J]. 2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [43] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Omar W. Ahmed
    Rami Qahwaji
    Tufan Colak
    Paul A. Higgins
    Peter T. Gallagher
    D. Shaun Bloomfield
    [J]. Solar Physics, 2013, 283 : 157 - 175
  • [44] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Ahmed, Omar W.
    Qahwaji, Rami
    Colak, Tufan
    Higgins, Paul A.
    Gallagher, Peter T.
    Bloomfield, D. Shaun
    [J]. SOLAR PHYSICS, 2013, 283 (01) : 157 - 175
  • [45] Computational Prediction of Lysine Pupylation Sites in Prokaryotic Proteins Using Position Specific Scoring Matrix into Bigram for Feature Extraction
    Singh, Vineet
    Sharma, Alok
    Chandra, Abel
    Dehzangi, Abdollah
    Shigemizu, Daichi
    Tsunoda, Tatsuhiko
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 488 - 500
  • [46] Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA
    Lei Du
    Qingfang Meng
    Yuehui Chen
    Peng Wu
    [J]. BMC Bioinformatics, 21
  • [47] Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA
    Du, Lei
    Meng, Qingfang
    Chen, Yuehui
    Wu, Peng
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [48] Prediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion
    Zhang, SW
    Pan, Q
    Zhang, HC
    Shao, ZC
    Shi, JY
    [J]. AMINO ACIDS, 2006, 30 (04) : 461 - 468
  • [49] Prediction of protein homo-oligomer types by pseudo amino acid composition: Approached with an improved feature extraction and Naive Bayes Feature Fusion
    S.-W. Zhang
    Q. Pan
    H.-C. Zhang
    Z.-C. Shao
    J.-Y. Shi
    [J]. Amino Acids, 2006, 30 : 461 - 468
  • [50] Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction
    Chen, Yu-Miao
    Zu, Xin-Ping
    Li, Dan
    [J]. FRONTIERS IN GENETICS, 2020, 11