Bayesian Naive Bayes classifiers to text classification

被引:157
|
作者
Xu, Shuo [1 ]
机构
[1] Inst Sci & Tech Informat China, Res Ctr Informat Sci Theory & Methodol, 15 Fuxing Rd, Beijing 100038, Peoples R China
基金
美国国家科学基金会;
关键词
Bayesian Naive Bayes classifier; event model; Naive Bayes classifier; text classification; DECISION;
D O I
10.1177/0165551516677946
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is the task of assigning predefined categories to natural language documents, and it can provide conceptual views of document collections. The Naive Bayes (NB) classifier is a family of simple probabilistic classifiers based on a common assumption that all features are independent of each other, given the category variable, and it is often used as the baseline in text classification. However, classical NB classifiers with multinomial, Bernoulli and Gaussian event models are not fully Bayesian. This study proposes three Bayesian counterparts, where it turns out that classical NB classifier with Bernoulli event model is equivalent to Bayesian counterpart. Finally, experimental results on 20 newsgroups and WebKB data sets show that the performance of Bayesian NB classifier with multinomial event model is similar to that of classical counterpart, but Bayesian NB classifier with Gaussian event model is obviously better than classical counterpart.
引用
收藏
页码:48 / 59
页数:12
相关论文
共 50 条
  • [21] Some effective techniques for naive Bayes text classification
    Kim, Sang-Bum
    Han, Kyoung-Soo
    Rim, Hae-Chang
    Myaeng, Sung Hyon
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (11) : 1457 - 1466
  • [22] An improved FloatBoost algorithm for Naive Bayes text classification
    Liu, XM
    Yin, JW
    Dong, JX
    Ghafoor, MA
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 162 - 171
  • [23] Research on text classification mining based on Naive Bayes
    Liu, LZ
    Zhang, CL
    Chen, JJ
    [J]. ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 8521 - 8524
  • [24] Directional naive Bayes classifiers
    Pedro L. López-Cruz
    Concha Bielza
    Pedro Larrañaga
    [J]. Pattern Analysis and Applications, 2015, 18 : 225 - 246
  • [25] Research on Archives Text Classification Based on Naive Bayes
    Liu, Peixin
    Yu, Hongzhi
    Xu, Tao
    Lan, Chuanqo
    [J]. PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 187 - 190
  • [26] Modifying Naive Bayes Classifier for Multinomial Text Classification
    Sharma, Neha
    Singh, Manoj
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,
  • [27] Techniques for improving the performance of naive Bayes for text classification
    Schneider, KM
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 682 - 693
  • [28] Directional naive Bayes classifiers
    Lopez-Cruz, Pedro L.
    Bielza, Concha
    Larranaga, Pedro
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (02) : 225 - 246
  • [29] On pairwise naive Bayes classifiers
    Sulzmann, Jan-Nikolas
    Fuernkranz, Johannes
    Huellermeier, Eyke
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 371 - +
  • [30] Landscapes of Naive Bayes classifiers
    Hoare, Zoe
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (01) : 59 - 72