A Feature Selection Method for Prediction Essential Protein

被引:30
|
作者
Zhong, Jiancheng [1 ,2 ]
Wang, Jianxin [1 ]
Peng, Wei [3 ]
Zhang, Zhen [1 ]
Li, Min [1 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
[2] Hunan Normal Univ, Coll Polytech, Changsha 410083, Peoples R China
[3] Kunming Univ Sci & Technol, Ctr Comp, Kunming 650093, Peoples R China
基金
中国国家自然科学基金;
关键词
essential protein; feature selection; Protein-Protein Interaction (PPI); machine learning; centrality algorithm; ESSENTIAL GENES; SACCHAROMYCES-CEREVISIAE; IDENTIFICATION; CENTRALITY; NETWORKS; LOCALIZATION; INTEGRATION; ORTHOLOGY; IDENTIFY; DATABASE;
D O I
10.1109/TST.2015.7297748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential proteins are vital to the survival of a cell. There are various features related to the essentiality of proteins, such as biological and topological features. Many computational methods have been developed to identify essential proteins by using these features. However, it is still a big challenge to design an effective method that is able to select suitable features and integrate them to predict essential proteins. In this work, we first collect 26 features, and use SVM-RFE to select some of them to create a feature space for predicting essential proteins, and then remove the features that share the biological meaning with other features in the feature space according to their Pearson Correlation Coefficients (PCC). The experiments are carried out on S. cerevisiae data. Six features are determined as the best subset of features. To assess the prediction performance of our method, we further compare it with some machine learning methods, such as SVM, Naive Bayes, Bayes Network, and NBTree when inputting the different number of features. The results show that those methods using the 6 features outperform that using other features, which confirms the effectiveness of our feature selection method for essential protein prediction.
引用
收藏
页码:491 / 499
页数:9
相关论文
共 50 条
  • [21] Prediction of Protein Cleavage Site with Feature Selection by Random Forest
    Li, Bi-Qing
    Cai, Yu-Dong
    Feng, Kai-Yan
    Zhao, Gui-Jun
    PLOS ONE, 2012, 7 (09):
  • [22] Integrative approaches to the prediction of protein functions based on the feature selection
    Seokha Ko
    Hyunju Lee
    BMC Bioinformatics, 10
  • [23] LncRNA-protein interaction prediction with reweighted feature selection
    Lv, Guohao
    Xia, Yingchun
    Qi, Zhao
    Zhao, Zihao
    Tang, Lianggui
    Chen, Cheng
    Yang, Shuai
    Wang, Qingyong
    Gu, Lichuan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [24] Comparative Study on Feature Selection in Protein Structure and Function Prediction
    Yi, Wenjing
    Sun, Ao
    Liu, Manman
    Liu, Xiaoqing
    Zhang, Wei
    Dai, Qi
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [25] Prediction of Protein Structural Classes Based on Feature Selection Technique
    Ding, Hui
    Lin, Hao
    Chen, Wei
    Li, Zi-Qiang
    Guo, Feng-Biao
    Huang, Jian
    Rao, Nini
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2014, 6 (03) : 235 - 240
  • [26] Feature selection for data driven prediction of protein model quality
    Montuori, Alfonso
    Pugliese, Luisa
    Raimondo, Giovanni
    Pasero, Eros
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3561 - +
  • [27] Prediction of Protein-protein Interactions Based on Feature Selection and Data Balancing
    Liu, Liang
    Lu, Wen-Cong
    Cai, Yu-Dong
    Feng, Kai-Yan
    Peng, Chunrong
    Zhu, Yubei
    PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03): : 336 - 345
  • [28] Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method
    Zhou, You
    Huang, Tao
    Huang, Guohua
    Zhang, Ning
    Kong, XiangYin
    Cai, Yu-Dong
    NEUROCOMPUTING, 2016, 217 : 53 - 62
  • [29] A Novel Feature Selection Method for Software Fault Prediction Model
    Cui, Can
    Liu, Bin
    Li, Guoqi
    2019 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2019) - R & M IN THE SECOND MACHINE AGE - THE CHALLENGE OF CYBER PHYSICAL SYSTEMS, 2019,
  • [30] Prediction for Rational Synthesis Based on Weighted Feature Selection Method
    Qi, Miao
    Li, Jinsong
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    MOLECULAR INFORMATICS, 2013, 32 (9-10) : 765 - 774