Augmenting naive Bayes classifiers with statistical language models

被引:128
|
作者
Peng, FC
Schuurmans, D
Wang, SJ
机构
[1] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
[3] Univ Waterloo, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
来源
INFORMATION RETRIEVAL | 2004年 / 7卷 / 3-4期
关键词
naive Bayes; text classification; n-gram language models; smoothing;
D O I
10.1023/B:INRT.0000011209.19643.e2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We augment naive Bayes models with statistical n-gram language models to address short-comings of the standard naive Bayes text classifier. The result is a generalized naive Bayes classifier which allows for a local Markov dependence among observations; a model we refer to as the Chain Augmented Naive Bayes (CAN) Bayes classifier. CAN models have two advantages over standard naive Bayes classifiers. First, they relax some of the independence assumptions of naive Bayes - allowing a local Markov chain dependence in the observed variables - while still permitting efficient inference and learning. Second, they permit straightforward application of sophisticated smoothing techniques from statistical language modeling, which allows one to obtain better parameter estimates than the standard Laplace smoothing used in naive Bayes classification. In this paper, we introduce CAN models and apply them to various text classification problems. To demonstrate the language independent and task independent nature of these classifiers, we present experimental results on several text classification problems - authorship attribution, text genre classification, and topic detection - in several languages - Greek, English, Japanese and Chinese. We then systematically study the key factors in the CAN model that can influence the classification performance, and analyze the strengths and weaknesses of the model.
引用
收藏
页码:317 / 345
页数:29
相关论文
共 50 条
  • [1] Augmenting Naive Bayes Classifiers with Statistical Language Models
    Fuchun Peng
    Dale Schuurmans
    Shaojun Wang
    [J]. Information Retrieval, 2004, 7 : 317 - 345
  • [2] Investigating the Statistical Assumptions of Naive Bayes Classifiers
    Kelly, Anthony
    Johnson, Marc Anthony
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [3] Directional naive Bayes classifiers
    Pedro L. López-Cruz
    Concha Bielza
    Pedro Larrañaga
    [J]. Pattern Analysis and Applications, 2015, 18 : 225 - 246
  • [4] Directional naive Bayes classifiers
    Lopez-Cruz, Pedro L.
    Bielza, Concha
    Larranaga, Pedro
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (02) : 225 - 246
  • [5] On pairwise naive Bayes classifiers
    Sulzmann, Jan-Nikolas
    Fuernkranz, Johannes
    Huellermeier, Eyke
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 371 - +
  • [6] Landscapes of Naive Bayes classifiers
    Hoare, Zoe
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (01) : 59 - 72
  • [7] Incremental augmented naive Bayes classifiers
    Alcobé, JR
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 539 - 543
  • [8] Evolving extended naive bayes classifiers
    Klawonn, Frank
    Angelov, Plamen
    [J]. ICDM 2006: Sixth IEEE International Conference on Data Mining, Workshops, 2006, : 643 - 647
  • [9] Error Estimation on Hybrid Naive Bayes Classifiers
    Chen, Yilan
    Wang, Huanbao
    [J]. ADVANCES IN COMPUTATIONAL ENVIRONMENT SCIENCE, 2012, 142 : 239 - 246
  • [10] Learning Naive Bayes Classifiers with Incomplete Data
    Leng, Cuiping
    Wang, Shuangcheng
    Wang, Hui
    [J]. 2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL IV, PROCEEDINGS, 2009, : 350 - +