Boosting Naive Bayes Text Categorization by Using Cloud Model

被引:0
|
作者
Wan, Jian [1 ]
He, Tingting [1 ]
Chen, Jinguang [2 ,3 ]
Dong, Jinling [1 ]
机构
[1] Huazhong Normal Univ, Dept Comp Sci & Technol, Wuhan 430079, Peoples R China
[2] Huazhong Normal Univ, Engn & Res Ctr Informat Technol Educ, Wuhan 430079, Peoples R China
[3] Huzhou Teachers Coll, Sch Teacher Educ, Wuhan 313000, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes; Cloud Model; Feature Selection; Text Categorization; FEATURE-SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a method which improves effectiveness of Naive Bayes text categorization by using cloud model. The traditional Naive Bayes text categorization directly uses term frequency to describe the relationship between words and categories. In deed, there are many words with high frequency do not have a close relevance with the category. To solve this problem, we introduce cloud model theory into Naive Bayes text classification and build a new feature selection system. By using numerical characteristics of cloud, we obtain more representative features. Experimental results on 20 Newsgroups show that our method can improve accuracy of text categorization remarkably.
引用
收藏
页码:165 / +
页数:2
相关论文
共 50 条
  • [1] Multinomial naive Bayes for text categorization revisited
    Kibriya, AM
    Frank, E
    Pfahringer, B
    Holmes, G
    [J]. AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 488 - 499
  • [2] Toward Optimal Feature Selection in Naive Bayes for Text Categorization
    Tang, Bo
    Kay, Steven
    He, Haibo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (09) : 2508 - 2521
  • [3] Combining active learning and boosting for Naive Bayes text classifiers
    Kim, HJ
    Kim, J
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 519 - 527
  • [4] Boosting Naive Bayes text classification using uncertainty-based selective sampling
    Kim, HJ
    Kim, JU
    Ra, YG
    [J]. NEUROCOMPUTING, 2005, 67 : 403 - 410
  • [5] A new term-weighting scheme for naive Bayes text categorization
    Mendoza, Marcelo
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2012, 8 (01) : 55 - +
  • [6] Using boosting in a multilabel text categorization problem
    Muruzábal, J
    Souto, EG
    [J]. INFORMATION TECHNOLOGY AND ORGANIZATIONS: TRENDS, ISSUES, CHALLENGES AND SOLUTIONS, VOLS 1 AND 2, 2003, : 431 - 433
  • [7] Muscle Categorization Using PDF Estimation and Naive Bayes Classification
    Adel, Tameem M.
    Smith, Benn E.
    Stashuk, Daniel W.
    [J]. 2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 2619 - 2622
  • [8] Naive Bayes Text Categorization Algorithm Based on TF-IDF Attribute Weighting
    Jiang, Feng
    Zhang, Zhenghao
    Chen, Ping
    Liu, Yongrui
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 521 - 525
  • [9] Boosting naive bayes by active learning
    Wang, LM
    Yuan, SM
    Li, L
    Li, HJ
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1383 - 1386
  • [10] Boosting the Discriminant Power of Naive Bayes
    Wang, Shihe
    Ren, Jianfeng
    Lian, Xiaoyu
    Bai, Ruibin
    Jiang, Xudong
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4906 - 4912