DISCRIMINATIVELY WEIGHTED NAIVE BAYES AND ITS APPLICATION IN TEXT CLASSIFICATION

被引:53
|
作者
Jiang, Liangxiao [1 ]
Wang, Dianghong [2 ]
Cai, Zhihua [1 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Hubei, Peoples R China
[2] China Univ Geosci, Dept Elect Engn, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes; discriminatively weighted naive Bayes; instance weighting; discriminative instance weighting; discriminative learning; ROC CURVE; AREA;
D O I
10.1142/S0218213011004770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many approaches are proposed to improve naive Bayes by weakening its conditional independence assumption. In this paper, we work on the approach of instance weighting and propose an improved naive Bayes algorithm by discriminative instance weighting. We called it Discriminatively Weighted Naive Bayes. In each iteration of it, different training instances are discriminatively assigned different weights according to the estimated conditional probability loss. The experimental results based on a large number of UCI data sets validate its effectiveness in terms of the classification accuracy and AUC. Besides,the experimental results on the running time show that our Discriminatively Weighted Naive Bayes performs almost as efficiently as the state-of-the-art Discriminative Frequency Estimate learning method, and significantly more efficient than Boosted Naive Bayes. At last, we apply the idea of discriminatively weighted learning in our algorithm to some state-of-the-art naive Bayes text classifiers, such as multinomial naive Bayes, complement naive Bayes and the one-versus-all-but-one model, and have achieved remarkable improvements.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Feature subset selection using naive Bayes for text classification
    Feng, Guozhong
    Guo, Jianhua
    Jing, Bing-Yi
    Sun, Tieli
    [J]. PATTERN RECOGNITION LETTERS, 2015, 65 : 109 - 115
  • [32] Laplace Naive Bayes classifier in the classification of text in machine learning
    Kalcheva, Neli
    Nikolov, Nedyalko
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON BIOMEDICAL INNOVATIONS AND APPLICATIONS (BIA 2020), 2020, : 18 - 20
  • [33] A Chinese text classification system based on Naive Bayes algorithm
    Cui, Wei
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRONIC, INFORMATION AND COMPUTER ENGINEERING, 2016, 44
  • [34] Fast Text Classification with Naive Bayes Method on Apache Spark
    Ogul, Iskender Ulgen
    Ozcan, Caner
    Hakdagli, Ozlem
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [35] Improved Naive Bayes with optimal correlation factor for text classification
    Jiangning Chen
    Zhibo Dai
    Juntao Duan
    Heinrich Matzinger
    Ionel Popescu
    [J]. SN Applied Sciences, 2019, 1
  • [36] A Double Weighted Naive Bayes for Multi-label Classification
    Yan, Xuesong
    Li, Wei
    Wu, Qinghua
    Sheng, Victor S.
    [J]. COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015), 2016, 575 : 382 - 389
  • [37] ANALYSIS OF RELATIONSHIP BETWEEN RENYI ENTROPY AND MARGINAL BAYES ERROR AND ITS APPLICATION TO WEIGHTED NAIVE BAYES CLASSIFIERS
    Endo, Tomomi
    Omura, Kazuhiro
    Kudo, Mineichi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (07)
  • [38] Integrating associative rule-based classification with Naive Bayes for text classification
    Hadi, Wa'el
    Al-Radaideh, Qasem A.
    Alhawari, Samer
    [J]. APPLIED SOFT COMPUTING, 2018, 69 : 344 - 356
  • [39] Collaboratively weighted naive Bayes
    Huan Zhang
    Liangxiao Jiang
    Chaoqun Li
    [J]. Knowledge and Information Systems, 2021, 63 : 3159 - 3182
  • [40] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711