Naive Bayes Classification Algorithm Based on Optimized Training Data

被引:5
|
作者
Zhu, Xiaodan [1 ]
Su, Jinsong [1 ]
Wu, Qingfeng [1 ]
Dong, Huailin [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China
关键词
optimized training data; effectiveness; Naive Bayes;
D O I
10.4028/www.scientific.net/AMR.490-495.460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes classification algorithm is an effective simple classification algorithm. Most researches in traditional Naive Bayes classification focus on the improvement of the classification algorithm, ignoring the selection of training data which has a great effect on the performance of classifier. And so a method is proposed to optimize the selection of training data in this paper. Adopting this method, the noisy instances in training data are eliminated by user-defined effectiveness threshold, improving the performance of classifier. Experimental results on large-scale data show that our approach significantly outperforms the baseline classifier.
引用
收藏
页码:460 / 464
页数:5
相关论文
共 50 条
  • [41] A Study of the Naive Bayes Classification Based on the Laplacian Matrix
    Jiang, Lei
    Yuan, Peng
    Zhang, Qiongbing
    Liu, Qi
    IAENG International Journal of Computer Science, 2020, 47 (04) : 1 - 10
  • [42] Research of Classification System based on Naive Bayes and MetaClass
    Ren, Bin
    Cheng, Lianglun
    ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 3, PROCEEDINGS, 2009, : 154 - 156
  • [43] Research on text classification mining based on Naive Bayes
    Liu, LZ
    Zhang, CL
    Chen, JJ
    ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 8521 - 8524
  • [44] Research on Archives Text Classification Based on Naive Bayes
    Liu, Peixin
    Yu, Hongzhi
    Xu, Tao
    Lan, Chuanqo
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 187 - 190
  • [45] An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (02) : 1289 - 1296
  • [46] Improving naive bayes for classification
    Jiang L.
    Cai Z.
    Wang D.
    International Journal of Computers and Applications, 2010, 32 (03) : 328 - 332
  • [47] Generator Fault Classification Method Based on Multi-Source Information Fusion Naive Bayes Classification Algorithm
    Wang, Yi
    Huang, Yuhao
    Yang, Kai
    Chen, Zhihan
    Luo, Cheng
    ENERGIES, 2022, 15 (24)
  • [48] Parallel naive Bayes algorithm for large-scale Chinese text classification based on spark
    Liu Peng
    Zhao Hui-han
    Teng Jia-yu
    Yang Yan-yan
    Liu Ya-feng
    Zhu Zong-wei
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2019, 26 (01) : 1 - 12
  • [49] Text Classification on Mahout with Naive-Bayes Machine Learning Algorithm
    Salur, Mehmet Umut
    Tokat, Sezai
    Aydilek, Ibrahim Berkan
    2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [50] An Improved Naive Bayes Text Classification Algorithm In Chinese Information Processing
    Yuan, Lingling
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 267 - 269