Feature selection for optimizing the Naive Bayes algorithm

被引:0
|
作者
Winarti, Titin [1 ]
Vydia, Vensy [1 ]
机构
[1] Univ Semarang, Semarang, Indonesia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes is a data-mining method used in the classification of text-based documents. The advantage of this method is simple algorithms with low calculation complexity. However, Naive Bayes has a weakness where the independence of the Naive Bayes feature cannot always be applied so that it will affect the accuracy of calculations. Naive Bayes therefore needs to be optimized by giving scale using a gain ratio. Weighting with Naive Bayes raises problems in calculating the probability of each document, where many features that do not represent the tested class appear so that there is a misclassification. so weighting with Naive Bayes is still not optimal. This article proposes the optimization of Naive Bayes through using the weighting gain ratio, which is a method of selecting features in the case of text classification. The results of this study indicated that the Naive Bayes optimization method using feature selection and weighting gain ratio produces an accuracy of 94%.
引用
收藏
页码:47 / 51
页数:5
相关论文
共 50 条
  • [31] Optimizing weighted lazy learning and Naive Bayes classification using differential evolution algorithm
    Bai, Yu
    Bain, Michael
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (6) : 3005 - 3024
  • [32] Optimizing weighted lazy learning and Naive Bayes classification using differential evolution algorithm
    Yu Bai
    Michael Bain
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 3005 - 3024
  • [33] Variable selection for Naive Bayes classification
    Blanquero, Rafael
    Carrizosa, Emilio
    Ramirez-Cobo, Pepa
    Remedios Sillero-Denamiel, M.
    COMPUTERS & OPERATIONS RESEARCH, 2021, 135
  • [34] Feature Selection for Chemical Compound Extraction using Wrapper Approach with Naive Bayes Classifier
    Alshaikhdeeb, Basel
    Ahmad, Kamsuriah
    PROCEEDINGS OF THE 2017 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI'17), 2017,
  • [35] An Improved Feature Selection Based on Naive Bayes with Kernel Density Estimator for Opinion Mining
    Raja Rajeswari Sethuraman
    John Sanjeev Kumar Athisayam
    Arabian Journal for Science and Engineering, 2021, 46 : 4059 - 4071
  • [36] A Method for Avoiding Bias from Feature Selection with Application to Naive Bayes Classification Models
    Li, Longhai
    Zhang, Jianguo
    Neal, Radford M.
    BAYESIAN ANALYSIS, 2008, 3 (01): : 171 - 196
  • [37] Robust Method of Sparse Feature Selection for Multi-Label Classification with Naive Bayes
    Ruta, Dymitr
    FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 375 - 380
  • [38] An Improved Feature Selection Based on Naive Bayes with Kernel Density Estimator for Opinion Mining
    Sethuraman, Raja Rajeswari
    Athisayam, John Sanjeev Kumar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (04) : 4059 - 4071
  • [39] Deep Feature Weighting Based on Genetic Algorithm and Naive Bayes for Twitter Sentiment Analysis
    Cahya, Reiza Adi
    Adimanggala, Dinda
    Supianto, Ahmad Afif
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 326 - 331
  • [40] Feature weighting for naive Bayes using multi objective artificial bee colony algorithm
    Chaudhuri, Abhilasha
    Sahu, Tirath Prasad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2021, 24 (01) : 74 - 88