Feature selection for optimizing the Naive Bayes algorithm

被引:0
|
作者
Winarti, Titin [1 ]
Vydia, Vensy [1 ]
机构
[1] Univ Semarang, Semarang, Indonesia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes is a data-mining method used in the classification of text-based documents. The advantage of this method is simple algorithms with low calculation complexity. However, Naive Bayes has a weakness where the independence of the Naive Bayes feature cannot always be applied so that it will affect the accuracy of calculations. Naive Bayes therefore needs to be optimized by giving scale using a gain ratio. Weighting with Naive Bayes raises problems in calculating the probability of each document, where many features that do not represent the tested class appear so that there is a misclassification. so weighting with Naive Bayes is still not optimal. This article proposes the optimization of Naive Bayes through using the weighting gain ratio, which is a method of selecting features in the case of text classification. The results of this study indicated that the Naive Bayes optimization method using feature selection and weighting gain ratio produces an accuracy of 94%.
引用
收藏
页码:47 / 51
页数:5
相关论文
共 50 条
  • [21] Feature weighted naive Bayes algorithm for information retrieval of enterprise systems
    Wang, Li
    Ji, Ping
    Qi, Jing
    Shan, Siqing
    Bi, Zhuming
    Deng, Weiguo
    Zhang, Naijing
    ENTERPRISE INFORMATION SYSTEMS, 2014, 8 (01) : 107 - 120
  • [22] Principal Feature Selection Impact for Internet Traffic Classification Using Naive Bayes
    Paramita, Adi Suryaputra
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL SYSTEMS, TECHNOLOGY AND INFORMATION 2015 (ICESTI 2015), 2016, 365 : 475 - 480
  • [23] Discrimination-based feature selection for multinomial naive Bayes text classification
    Zhu, Jingbo
    Wang, Huizhen
    Zhang, Xijuan
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 149 - +
  • [24] Predicting peptide bond conformation using feature selection and the Naive Bayes approach
    Exarchos, Kostas P.
    Exarchos, Themis P.
    Papaloukas, Costas
    Troganis, Anastassios N.
    Fotiadis, Dimitrios I.
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 5009 - +
  • [25] Speeding up incremental wrapper feature subset selection with Naive Bayes classifier
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 140 - 147
  • [26] An Efficient Feature Selection using Parallel Cuckoo Search and Naive Bayes classifier
    Sujana, T. Sai
    Rao, N. Madhu Sudana
    Reddy, Raja Sekar
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 167 - 172
  • [27] Iterative Feature Selection using Information Gain & Naive Bayes for Document Classification
    Rahman, Chowdhury Mofizur
    Afroze, Lameya
    Refath, Naznin Sultana
    Shawon, Nafin
    2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [28] A Naive Bayes approach for URL classification with supervised feature selection and rejection framework
    Rajalakshmi, R.
    Aravindan, Chandrabose
    COMPUTATIONAL INTELLIGENCE, 2018, 34 (01) : 363 - 396
  • [29] Feature Selection Based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification Using Naive Bayes
    Molano, Viviana
    Cobos, Carlos
    Mendoza, Martha
    Herrera-Viedma, Enrique
    Manic, Milos
    HUMAN-INSPIRED COMPUTING AND ITS APPLICATIONS, PT I, 2014, 8856 : 80 - 91
  • [30] Weakening Feature Independence of Naive Bayes Using Feature Weighting and Selection on Imbalanced Customer Review Data
    Cahya, Reiza Adi
    Bachtiar, Fitra A.
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 182 - 187