Techniques for improving the performance of naive Bayes for text classification

被引:0
|
作者
Schneider, KM [1 ]
机构
[1] Univ Passau, Dept Gen Linguist, D-94032 Passau, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes is often used in text classification applications and experiments because of its simplicity and effectiveness. However, its performance is often degraded because it does not model text well, and by inappropriate feature selection and the lack of reliable confidence scores. We address these problems and show that they can be solved by some simple corrections. We demonstrate that our simple modifications are able to improve the performance of Naive B ayes for text classification significantly.
引用
收藏
页码:682 / 693
页数:12
相关论文
共 50 条
  • [31] Integrating associative rule-based classification with Naive Bayes for text classification
    Hadi, Wa'el
    Al-Radaideh, Qasem A.
    Alhawari, Samer
    [J]. APPLIED SOFT COMPUTING, 2018, 69 : 344 - 356
  • [32] Improving Naive Bayes Models of Insurance Risk by Unsupervised Classification
    Jurek, Anna
    Zakrzewska, Danuta
    [J]. 2008 INTERNATIONAL MULTICONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (IMCSIT), VOLS 1 AND 2, 2008, : 122 - +
  • [33] Improving the performance of Naive Bayes classifier for spam detection
    Yang, Zhen
    Guo, Jun
    Xu, Weiran
    Chen, Bo
    Hu, Jiani
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 694 - 698
  • [34] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [35] Text Classification on Mahout with Naive-Bayes Machine Learning Algorithm
    Salur, Mehmet Umut
    Tokat, Sezai
    Aydilek, Ibrahim Berkan
    [J]. 2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [36] Constrained domain maximum likelihood estimation for naive Bayes text classification
    Andres-Ferrer, Jesus
    Juan, Alfons
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) : 189 - 196
  • [37] An Improved Naive Bayes Text Classification Algorithm In Chinese Information Processing
    Yuan, Lingling
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 267 - 269
  • [38] Deep feature weighting for naive Bayes and its application to text classification
    Jiang, Liangxiao
    Li, Chaoqun
    Wang, Shasha
    Zhang, Lungan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 52 : 26 - 39
  • [39] A Method of Text Classification Combining Naive Bayes and the Similarity Computing Algorithms
    Hong, Yinghan
    Mai, Guizhen
    Zeng, Hui
    Guo, Cai
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2015 WORKSHOPS, 2015, 9461 : 3 - 14
  • [40] Divergence-Based Feature Selection for Naive Bayes Text Classification
    Wang, Huizhen
    Zhu, Jingbo
    Su, Keh-Yih
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 209 - +