New Feature Vector for Apoptosis Protein Subcellular Localization Prediction

被引:0
|
作者
Govindan, Geetha [1 ]
Nair, Achuthsankar S. [2 ]
机构
[1] Univ Kerala, Ctr Bioinformat, Thiruvananthapuram 695034, Kerala, India
[2] Univ Kerala, Interuniv Ctr Excellence Bioinformat, Thiruvananthapuram 695034, Kerala, India
关键词
protein subcellular localization; naive bayes classification; correlation-based feature selection; di-peptides; AMINO-ACID-COMPOSITION; FUNCTIONAL DOMAIN COMPOSITION; LOCATION PREDICTION; MACHINES; SEQUENCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is widely recognized that the information for determining the final subcellular localization of proteins is found in their amino acid sequences. In this work we present new features extracted from the full length protein sequence to incorporate more biological information. Features are based on the occurrence frequency of di-peptides - traditional, higher order. Naive Bayes classification along with correlation-based feature selection method is proposed to predict the subcellular location of apoptosis protein sequences. Our system makes predictions with an accuracy of 83% using Naive Bayes classification alone and 86% using Naive Bayes classification with correlation-based feature selection. This result shows that the new feature vector is promising, and helps in increasing the prediction accuracy.
引用
收藏
页码:294 / +
页数:3
相关论文
共 50 条
  • [1] A novel representation for apoptosis protein subcellular localization prediction using support vector machine
    Zhang, Li
    Liao, Bo
    Li, Dachao
    Zhu, Wen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2009, 259 (02) : 361 - 365
  • [2] Prediction of Subcellular Localization for Apoptosis Protein: Approached with a Novel Representation and Support Vector Machine
    Jian, Guiqian
    Zhang, Yusen
    Qian, Panpan
    [J]. MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2012, 67 (03) : 867 - 878
  • [3] Feature subset selection for protein subcellular localization prediction
    Institute of Automation, National University of Defense Technology, Changsha 410073, Hunan, China
    [J]. Lect. Notes Comput. Sci., 2006, (433-443):
  • [4] Feature Extraction Techniques for Protein Subcellular Localization Prediction
    Gao, Qing-Bin
    Jin, Zhi-Chao
    Wu, Cheng
    Sun, Ya-Lin
    He, Jia
    He, Xiang
    [J]. CURRENT BIOINFORMATICS, 2009, 4 (02) : 120 - 128
  • [5] Feature subset selection for protein subcellular localization prediction
    Gao, Qing-Bin
    Wang, Zheng-Zhi
    [J]. COMPUTATIONAL INTELLIGENCE AND BIOINFORMATICS, PT 3, PROCEEDINGS, 2006, 4115 : 433 - 443
  • [6] Prediction of protein subcellular multisite localization using a new feature extraction method
    Wang, L. Y.
    Wang, D.
    Chen, Y. H.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2016, 15 (03):
  • [7] Support vector machine approach for protein subcellular localization prediction
    Hua, SJ
    Sun, ZR
    [J]. BIOINFORMATICS, 2001, 17 (08) : 721 - 728
  • [8] A Compact Hybrid Feature Vector for an Accurate Prediction of Protein Subcellular Location
    Yang, Ang
    Liao, Bo
    Peng, Ting
    Li, Renfa
    Cao, Zhi
    [J]. JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2013, 10 (03) : 679 - 683
  • [9] Feature Fusion Based SVM Classifier for Protein Subcellular Localization Prediction
    Rahman, Julia
    Mondal, Md. Nazrul Islam
    Ben Islam, Md. Khaled
    Al Mehedi, Md.
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (01) : 288
  • [10] Subcellular localization prediction with new protein encoding schemes
    Ogul, Hasan
    Mumcuoglu, Erkan U.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (02) : 227 - 232