Varying Naive Bayes Models With Applications to Classification of Chinese Text Documents

被引:8
|
作者
Guan, Guoyu [1 ,2 ]
Guo, Jianhua [1 ,2 ]
Wang, Hansheng [3 ]
机构
[1] NE Normal Univ, Key Lab Appl Stat, Minist Educ, Changchun 130024, Peoples R China
[2] NE Normal Univ, Sch Math & Stat, Changchun 130024, Peoples R China
[3] Peking Univ, Dept Business Stat & Econometr, Guanghua Sch Management, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
BIC; Chinese document classification; Screening consistency; Time-dependent classification rule; SUPPORT VECTOR MACHINES; VARIABLE SELECTION; DISCRIMINANT-ANALYSIS; SPARSE; INFERENCES; LIKELIHOOD; ALGORITHMS; REGRESSION;
D O I
10.1080/07350015.2014.903086
中图分类号
F [经济];
学科分类号
02 ;
摘要
Document classification is an area of great importance for which many classification methods have been developed. However, most of these methods cannot generate time-dependent classification rules. Thus, they are not the best choices for problems with time-varying structures. To address this problem, we propose a varying naive Bayes model, which is a natural extension of the naive Bayes model that allows for time-dependent classification rule. The method of kernel smoothing is developed for parameter estimation and a BIC-type criterion is invented for feature selection. Asymptotic theory is developed and numerical studies are conducted. Finally, the proposed method is demonstrated on a real dataset, which was generated by the Mayor Public Hotline of Changchun, the capital city of Jilin Province in Northeast China.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [1] DEEP FEATURE WEIGHTING IN NAIVE BAYES FOR CHINESE TEXT CLASSIFICATION
    Jiang, Qiaowei
    Wang, Wen
    Han, Xu
    Zhang, Shasha
    Wang, Xinyan
    Wang, Cong
    [J]. PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 160 - 164
  • [2] A Chinese text classification system based on Naive Bayes algorithm
    Cui, Wei
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRONIC, INFORMATION AND COMPUTER ENGINEERING, 2016, 44
  • [3] An Improved Naive Bayes Text Classification Algorithm In Chinese Information Processing
    Yuan, Lingling
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 267 - 269
  • [4] Chinese News Text Multi Classification Based on Naive Bayes Algorithm
    Wang, Fei
    Deng, Xin
    Hou, Lunqing
    [J]. ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,
  • [5] An Improvement to Naive Bayes for Text Classification
    Zhang, Wei
    Gao, Feng
    [J]. CEIS 2011, 2011, 15
  • [6] Naive Bayes Classification of DRDO Tender Documents
    Goswami, Sumit
    Bhardwaj, Prakriti
    Kapoor, Sunaina
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 593 - 597
  • [7] Classification of Text Documents based on Naive Bayes using N-Gram Features
    Baygin, Mehmet
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [8] Adapting naive Bayes tree for text classification
    Shasha Wang
    Liangxiao Jiang
    Chaoqun Li
    [J]. Knowledge and Information Systems, 2015, 44 : 77 - 89
  • [9] Adapting Hidden Naive Bayes for Text Classification
    Gan, Shengfeng
    Shao, Shiqi
    Chen, Long
    Yu, Liangjun
    Jiang, Liangxiao
    [J]. MATHEMATICS, 2021, 9 (19)
  • [10] Naive Bayes for text classification with unbalanced classes
    Frank, Eibe
    Bouckaert, Remco R.
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 503 - 510