Multinomial naive Bayes for text categorization revisited

被引:0
|
作者
Kibriya, AM [1 ]
Frank, E [1 ]
Pfahringer, B [1 ]
Holmes, G [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents empirical results for several versions of the multinomial naive Bayes classifier on four text categorization problems, and a way of improving it using locally weighted learning. More specifically, it compares standard multinomial naive Bayes to the recently proposed transformed weight-normalized complement naive Bayes classifier (TWCNB) [1], and shows that some of the modifications included in TWCNB may not be necessary to achieve optimum performance on some datasets. However, it does show that TFIDF conversion and document length normalization are important. It also shows that support vector machines can, in fact, sometimes very significantly outperform both methods. Finally, it shows how the performance of multinomial naive Bayes can be improved using locally weighted learning. However, the overall conclusion of our paper is that support vector machines are still the method of choice if the aim is to maximize accuracy.
引用
收藏
页码:488 / 499
页数:12
相关论文
共 50 条
  • [21] Multinomial Naive Bayes for real-time gender recognition
    Vergara, Diego
    Hernandez, Sergio
    Jorquera, Felipe
    [J]. 2016 XXI SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND ARTIFICIAL VISION (STSIVA), 2016,
  • [22] Indoor Localization Using Improved Multinomial Naive Bayes Technique
    Ul Haq, Muhammad Aziz
    Kamboh, Hammid Mehmood Allahdita
    Akram, Usman
    Sohail, Amer
    Iram, Hifsa
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL AFRO-EUROPEAN CONFERENCE FOR INDUSTRIAL ADVANCEMENT-AECIA 2016, 2018, 565 : 321 - 329
  • [23] Large Margin Multinomial Mixture Model for Text Categorization
    Pan, Zhen-Yu
    Jiang, Hui
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1566 - +
  • [24] Multinomial Naive Bayes using similarity based conditional probability
    Santhi, B.
    Brindha, G. R.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 1431 - 1441
  • [25] Modified Multinomial Naive Bayes Algorithm for Heart Disease Prediction
    Marikani, T.
    Shyamala, K.
    [J]. INTELLIGENT COMMUNICATION TECHNOLOGIES AND VIRTUAL MOBILE NETWORKS, ICICV 2019, 2020, 33 : 294 - 300
  • [26] Adapting naive Bayes tree for text classification
    Shasha Wang
    Liangxiao Jiang
    Chaoqun Li
    [J]. Knowledge and Information Systems, 2015, 44 : 77 - 89
  • [27] Adapting Hidden Naive Bayes for Text Classification
    Gan, Shengfeng
    Shao, Shiqi
    Chen, Long
    Yu, Liangjun
    Jiang, Liangxiao
    [J]. MATHEMATICS, 2021, 9 (19)
  • [28] Bayesian Naive Bayes classifiers to text classification
    Xu, Shuo
    [J]. JOURNAL OF INFORMATION SCIENCE, 2018, 44 (01) : 48 - 59
  • [29] Feature selection for text classification with Naive Bayes
    Chen, Jingnian
    Huang, Houkuan
    Tian, Shengfeng
    Qu, Youli
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5432 - 5435
  • [30] Naive Bayes for text classification with unbalanced classes
    Frank, Eibe
    Bouckaert, Remco R.
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 503 - 510