Identification of Bacteriophage Virion Proteins Using Multinomial Naive Bayes with g-Gap Feature Tree

被引:28
|
作者
Pan, Yanyuan [1 ]
Gao, Hui [1 ]
Lin, Hao [2 ]
Liu, Zhen [1 ]
Tang, Lixia [2 ]
Li, Songtao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Ctr Informat Biol, Key Lab Neuroinformat,Minist Educ, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
bacteriophage virion proteins; g-gap peptides; ANOVA; Multinomial Naive Bayes; FEATURE-SELECTION; SITES; PREDICTION; DATABASE; RNA; DNA;
D O I
10.3390/ijms19061779
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Bacteriophages, which are tremendously important to the ecology and evolution of bacteria, play a key role in the development of genetic engineering. Bacteriophage virion proteins are essential materials of the infectious viral particles and in charge of several of biological functions. The correct identification of bacteriophage virion proteins is of great importance for understanding both life at the molecular level and genetic evolution. However, few computational methods are available for identifying bacteriophage virion proteins. In this paper, we proposed a new method to predict bacteriophage virion proteins using a Multinomial Naive Bayes classification model based on discrete feature generated from the g-gap feature tree. The accuracy of the proposed model reaches 98.37% with MCC of 96.27% in 10-fold cross-validation. This result suggests that the proposed method can be a useful approach in identifying bacteriophage virion proteins from sequence information. For the convenience of experimental scientists, a web server (PhagePred) that implements the proposed predictor is available, which can be freely accessed on the Internet.
引用
下载
收藏
页数:12
相关论文
共 10 条
  • [1] Identification of Phage Virion Proteins by Using the g-gap Tripeptide Composition
    Yang, Liangwei
    Gao, Hui
    Liu, Zhen
    Tang, Lixia
    LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) : 332 - 339
  • [2] Naive Bayes Classifier with Feature Selection to Identify Phage Virion Proteins
    Feng, Peng-Mian
    Ding, Hui
    Chen, Wei
    Lin, Hao
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [3] Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis
    Ding, Hui
    Feng, Peng-Mian
    Chen, Wei
    Lin, Hao
    MOLECULAR BIOSYSTEMS, 2014, 10 (08) : 2229 - 2235
  • [4] Identification of Cancerlectins Using Support Vector Machines With Fusion of G-Gap Dipeptide
    Qian, Lili
    Wen, Yaping
    Han, Guosheng
    FRONTIERS IN GENETICS, 2020, 11
  • [5] Classifying the superfamily of small heat shock proteins by using g-gap dipeptide compositions
    Feng, Pengmian
    Liu, Weiwei
    Huang, Cong
    Tang, Zhaohui
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2021, 167 : 1575 - 1578
  • [6] Identification of Cancerlectins By Using Cascade Linear Discriminant Analysis and Optimal g-gap Tripeptide Composition
    Yang, Liangwei
    Gao, Hui
    Wu, Keyu
    Zhang, Haotian
    Li, Changyu
    Tang, Lixia
    CURRENT BIOINFORMATICS, 2020, 15 (06) : 528 - 537
  • [7] Comparison of Naive Bayes and Decision Tree on Feature Selection Using Genetic Algorithm for Classification Problem
    Rahmadani, S.
    Dongoran, A.
    Zarlis, M.
    Zakarias
    2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
  • [8] Putative Drug and Vaccine Target Identification in Leishmania donovani Membrane Proteins Using Naive Bayes Probabilistic Classifier
    Sinha, Arvind Kumar
    Singh, Pradeep
    Prakash, Anand
    Pal, Dharm
    Dube, Anuradha
    Kumar, Awanish
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 204 - 211
  • [9] Sequence Based Prediction of DNA-Binding Proteins Based on Hybrid Feature Selection Using Random Forest and Gaussian Naive Bayes
    Lou, Wangchao
    Wang, Xiaoqing
    Chen, Fan
    Chen, Yixiao
    Jiang, Bo
    Zhang, Hua
    PLOS ONE, 2014, 9 (01):
  • [10] Pred-BVP-Unb: Fast prediction of bacteriophage Virion proteins using un-biased multi-perspective properties with recursive feature elimination
    Arif, Muhammad
    Ali, Farman
    Ahmad, Saeed
    Kabir, Muhammad
    Ali, Zakir
    Hayat, Maqsood
    GENOMICS, 2020, 112 (02) : 1565 - 1574