Feature selection for optimizing the Naive Bayes algorithm

被引:0
|
作者
Winarti, Titin [1 ]
Vydia, Vensy [1 ]
机构
[1] Univ Semarang, Semarang, Indonesia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes is a data-mining method used in the classification of text-based documents. The advantage of this method is simple algorithms with low calculation complexity. However, Naive Bayes has a weakness where the independence of the Naive Bayes feature cannot always be applied so that it will affect the accuracy of calculations. Naive Bayes therefore needs to be optimized by giving scale using a gain ratio. Weighting with Naive Bayes raises problems in calculating the probability of each document, where many features that do not represent the tested class appear so that there is a misclassification. so weighting with Naive Bayes is still not optimal. This article proposes the optimization of Naive Bayes through using the weighting gain ratio, which is a method of selecting features in the case of text classification. The results of this study indicated that the Naive Bayes optimization method using feature selection and weighting gain ratio produces an accuracy of 94%.
引用
收藏
页码:47 / 51
页数:5
相关论文
共 50 条
  • [41] A Feature-Adjusted Naive Bayes Agorithm
    Kang Shi-Ze
    Ma Hong
    Huang Rui-Yang
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 507 - 510
  • [42] Relevance-diversity algorithm for feature selection and modified Bayes for prediction
    Shaheen, M.
    Naheed, N.
    Ahsan, A.
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 66 : 329 - 342
  • [43] A novel selective naive Bayes algorithm
    Chen, Shenglei
    Webb, Geoffrey I.
    Liu, Linyuan
    Ma, Xin
    KNOWLEDGE-BASED SYSTEMS, 2020, 192
  • [44] Stacking algorithm based on naive Bayes
    Huang, Chen
    Zhou, Yuting
    Yang, Xuemei
    Liu, Shiqi
    Yin, Junping
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [45] Optimizing MapReduce Partitioner Using Naive Bayes Classifier
    Chen, Lei
    Lu, Wei
    Wang, Liqiang
    Bao, Ergude
    Xing, Weiwei
    Yang, Yong
    Yuan, Victor
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 812 - 819
  • [46] Relationship between Naive Bayes Error and Max-Dependency Criterion in Feature Selection Problems
    Sedaghat, Nafiseh
    Fathy, Mahmood
    Modarressi, Mohammad-Hossein
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2013), 2013, : 262 - 266
  • [47] Feature Selection for Text and Image Data Using Differential Evolution with SVM and Naive Bayes Classifiers
    Dixit, Abhishek
    Mani, Ashish
    Bansal, Rohit
    ENGINEERING JOURNAL-THAILAND, 2020, 24 (05): : 161 - 172
  • [48] Intrusion Detection Model Using Chi Square Feature Selection and Modified Naive Bayes Classifier
    Thaseen, I. Sumaiya
    Kumar, Ch. Aswani
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON BIG DATA AND CLOUD COMPUTING CHALLENGES (ISBCC - 16'), 2016, 49 : 81 - 91
  • [49] LASSO-based feature selection and naive Bayes classifier for crime prediction and its type
    Nitta, Gnaneswara Rao
    Rao, B. Yogeshwara
    Sravani, T.
    Ramakrishiah, N.
    BalaAnand, M.
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2019, 13 (03) : 187 - 197
  • [50] Feature Selection Based on a Genetic Algorithm for Optimizing Weaning Success
    Rosati, Samanta
    Scotto, Andrea
    Fanelli, Vito
    Balestra, Gabriella
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 566 - 570