A New Instance-weighting Naive Bayes Text Classifiers

被引:0
|
作者
Wu, Yongcheng [1 ]
机构
[1] Jingchu Univ Technol, Comp Engn Sch, Jingmen, Peoples R China
关键词
naive bayes; instance-weighting; text classifier; multinomial naive bayes; CNB; OVA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is shown in recent research that naive Bayes text classifiers have achieved noticeable classification performance despite its strong assumption of conditional independence among features. In order to weaken this unrealistic assumption and improve the classification accuracy, there are generally three methods: structures manipulating, features manipulating, and instances manipulating. Instances manipulating can be further divided into instance-weighting and instance-selecting. In this paper, we propose a new instance-weighting approach to naive Bayes text classifier. In this new approach, the training dataset is firstly divided into several subsets according to their class value. Then every training instance in a subset is weighted according to the distance between it and the mean of the training subset. The experimental results on 15 text document datasets show that in terms of the accuracy of classification, our method performs better than three existing naive Bayes text classifiers.
引用
收藏
页码:198 / 202
页数:5
相关论文
共 50 条
  • [41] USING INSTANCE CLONING TO IMPROVE NAIVE BAYES FOR RANKING
    Jiang, Liangxiao
    Wang, Dianhong
    Zhang, Harry
    Cai, Zhihua
    Huang, Bo
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2008, 22 (06) : 1121 - 1140
  • [42] An Ensemble of Naive Bayes Classifiers for Uncertain Categorical Data
    de Holanda Maia, Marcelo Rodrigues
    Plastino, Alexandre
    Freitas, Alex A.
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1222 - 1227
  • [43] SSV criterion based discretization for naive Bayes classifiers
    Grabczewski, K
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 574 - 579
  • [44] Augmenting naive Bayes classifiers with statistical language models
    Peng, FC
    Schuurmans, D
    Wang, SJ
    [J]. INFORMATION RETRIEVAL, 2004, 7 (3-4): : 317 - 345
  • [45] Naive Bayes classifiers that perform well with continuous variables
    Bouckaert, RR
    [J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 1089 - 1094
  • [46] A Novel Naive Bayes Voting Strategy for Combining Classifiers
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 467 - 472
  • [47] On why discretization works for naive-Bayes classifiers
    Yang, Y
    Webb, GI
    [J]. AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 440 - 452
  • [48] Naive Bayes classifiers learned from incomplete data
    Chen, Jingnian
    Huang, Houkuan
    Tian, Fengzhan
    Qiao, Zhufeng
    [J]. Jisuanji Gongcheng/Computer Engineering, 2006, 32 (17): : 86 - 88
  • [49] Tool wear monitoring using naive Bayes classifiers
    Karandikar, Jaydeep
    McLeay, Tom
    Turner, Sam
    Schmitz, Tony
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2015, 77 (9-12): : 1613 - 1626
  • [50] Heterodimeric protein complex identification by naive Bayes classifiers
    Maruyama, Osamu
    [J]. BMC BIOINFORMATICS, 2013, 14