Feature selection for optimizing the Naive Bayes algorithm

被引:0
|
作者
Winarti, Titin [1 ]
Vydia, Vensy [1 ]
机构
[1] Univ Semarang, Semarang, Indonesia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes is a data-mining method used in the classification of text-based documents. The advantage of this method is simple algorithms with low calculation complexity. However, Naive Bayes has a weakness where the independence of the Naive Bayes feature cannot always be applied so that it will affect the accuracy of calculations. Naive Bayes therefore needs to be optimized by giving scale using a gain ratio. Weighting with Naive Bayes raises problems in calculating the probability of each document, where many features that do not represent the tested class appear so that there is a misclassification. so weighting with Naive Bayes is still not optimal. This article proposes the optimization of Naive Bayes through using the weighting gain ratio, which is a method of selecting features in the case of text classification. The results of this study indicated that the Naive Bayes optimization method using feature selection and weighting gain ratio produces an accuracy of 94%.
引用
收藏
页码:47 / 51
页数:5
相关论文
共 50 条
  • [1] Text Classification Based on Naive Bayes Algorithm with Feature Selection
    Chen, Zhenguo
    Shi, Guang
    Wang, Xiaoju
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4255 - 4260
  • [2] Naive Bayes-Guided Bat Algorithm for Feature Selection
    Taha, Ahmed Majid
    Mustapha, Aida
    Chen, Soong-Der
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [3] Naive Feature Selection: Sparsity in Naive Bayes
    Askari, Armin
    d'Aspremont, Alex
    El Ghaoui, Laurent
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1813 - 1821
  • [4] Feature selection for text classification with Naive Bayes
    Chen, Jingnian
    Huang, Houkuan
    Tian, Shengfeng
    Qu, Youli
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5432 - 5435
  • [5] Learning naive Bayes for probability estimation by feature selection
    Jiang, Liangxiao
    Zhang, Harry
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 503 - 514
  • [6] Feature selection for unbalanced class distribution and Naive Bayes
    Mladenic, D
    Grobelnik, M
    MACHINE LEARNING, PROCEEDINGS, 1999, : 258 - 267
  • [7] Naive Feature Selection: A Nearly Tight Convex Relaxation for Sparse Naive Bayes
    Askari, Armin
    d'Aspremont, Alexandre
    El Ghaoui, Laurent
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (01) : 278 - 296
  • [8] Feature Selection Model using Naive Bayes ML Algorithm for WSN Intrusion Detection System
    Jeevaraj, Deepa
    Vijayan, T.
    Karthik, B.
    Sriram, M.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 179 - 185
  • [9] Comparison of Naive Bayes and Decision Tree on Feature Selection Using Genetic Algorithm for Classification Problem
    Rahmadani, S.
    Dongoran, A.
    Zarlis, M.
    Zakarias
    2ND INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2017, 2018, 978
  • [10] HYBRID FEATURE SELECTION APPROACH USING BACTERIAL FORAGING ALGORITHM GUIDED BY NAIVE BAYES CLASSIFICATION
    Mittal, Divya
    Bala, Manju
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,