A Feature Selection Method for Prediction Essential Protein

被引:30
|
作者
Zhong, Jiancheng [1 ,2 ]
Wang, Jianxin [1 ]
Peng, Wei [3 ]
Zhang, Zhen [1 ]
Li, Min [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
[2] Hunan Normal Univ, Coll Polytech, Changsha 410083, Peoples R China
[3] Kunming Univ Sci & Technol, Ctr Comp, Kunming 650093, Peoples R China
基金
中国国家自然科学基金;
关键词
essential protein; feature selection; Protein-Protein Interaction (PPI); machine learning; centrality algorithm; ESSENTIAL GENES; SACCHAROMYCES-CEREVISIAE; IDENTIFICATION; CENTRALITY; NETWORKS; LOCALIZATION; INTEGRATION; ORTHOLOGY; IDENTIFY; DATABASE;
D O I
10.1109/TST.2015.7297748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential proteins are vital to the survival of a cell. There are various features related to the essentiality of proteins, such as biological and topological features. Many computational methods have been developed to identify essential proteins by using these features. However, it is still a big challenge to design an effective method that is able to select suitable features and integrate them to predict essential proteins. In this work, we first collect 26 features, and use SVM-RFE to select some of them to create a feature space for predicting essential proteins, and then remove the features that share the biological meaning with other features in the feature space according to their Pearson Correlation Coefficients (PCC). The experiments are carried out on S. cerevisiae data. Six features are determined as the best subset of features. To assess the prediction performance of our method, we further compare it with some machine learning methods, such as SVM, Naive Bayes, Bayes Network, and NBTree when inputting the different number of features. The results show that those methods using the 6 features outperform that using other features, which confirms the effectiveness of our feature selection method for essential protein prediction.
引用
收藏
页码:491 / 499
页数:9
相关论文
共 50 条
  • [1] A Feature Selection Method for Prediction Essential Protein
    Jiancheng Zhong
    Jianxin Wang
    Wei Peng
    Zhen Zhang
    Min Li
    Tsinghua Science and Technology, 2015, 20 (05) : 491 - 499
  • [2] FrankSum: New feature selection method for protein function prediction
    Al-Shahib, A
    Breitling, R
    Gilbert, D
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (04) : 259 - 275
  • [3] A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class
    Ni, Qianwu
    Chen, Lei
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (07) : 612 - 621
  • [4] Feature Selection for Protein Dihedral Angle Prediction
    Aydin, Zafer
    Kaynar, Oguz
    Gormez, Yasin
    2017 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2017, : 48 - 52
  • [5] An Ensemble Feature Selection Method for Prediction of CKD
    Manonmani, M.
    Balakrishnan, Sarojini
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 667 - 672
  • [6] Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method
    Gao, Jianzhao
    Tao, Xue-Wen
    Zhao, Jia
    Feng, Yuan-Ming
    Cai, Yu-Dong
    Zhang, Ning
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (07) : 629 - 637
  • [7] Method for prediction of protein-protein interactions in yeast using genomics/proteomics information and feature selection
    Urquiza, J. M.
    Rojas, I.
    Pomares, H.
    Herrera, L. J.
    Ortega, J.
    Prieto, A.
    NEUROCOMPUTING, 2011, 74 (16) : 2683 - 2690
  • [8] Method for Prediction of Protein-Protein Interactions in Yeast Using Genomics/Proteomics Information and Feature Selection
    Urquiza, J. M.
    Rojas, I.
    Pomares, H.
    Florido, J. P.
    Rubio, G.
    Herrera, L. J.
    Calvo, J. C.
    Ortega, J.
    BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 853 - 860
  • [9] Feature subset selection for protein subcellular localization prediction
    Institute of Automation, National University of Defense Technology, Changsha 410073, Hunan, China
    Lect. Notes Comput. Sci., 2006, (433-443):
  • [10] LncRNA–protein interaction prediction with reweighted feature selection
    Guohao Lv
    Yingchun Xia
    Zhao Qi
    Zihao Zhao
    Lianggui Tang
    Cheng Chen
    Shuai Yang
    Qingyong Wang
    Lichuan Gu
    BMC Bioinformatics, 24