Varying Naive Bayes Models With Applications to Classification of Chinese Text Documents

被引:8
|
作者
Guan, Guoyu [1 ,2 ]
Guo, Jianhua [1 ,2 ]
Wang, Hansheng [3 ]
机构
[1] NE Normal Univ, Key Lab Appl Stat, Minist Educ, Changchun 130024, Peoples R China
[2] NE Normal Univ, Sch Math & Stat, Changchun 130024, Peoples R China
[3] Peking Univ, Dept Business Stat & Econometr, Guanghua Sch Management, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
BIC; Chinese document classification; Screening consistency; Time-dependent classification rule; SUPPORT VECTOR MACHINES; VARIABLE SELECTION; DISCRIMINANT-ANALYSIS; SPARSE; INFERENCES; LIKELIHOOD; ALGORITHMS; REGRESSION;
D O I
10.1080/07350015.2014.903086
中图分类号
F [经济];
学科分类号
02 ;
摘要
Document classification is an area of great importance for which many classification methods have been developed. However, most of these methods cannot generate time-dependent classification rules. Thus, they are not the best choices for problems with time-varying structures. To address this problem, we propose a varying naive Bayes model, which is a natural extension of the naive Bayes model that allows for time-dependent classification rule. The method of kernel smoothing is developed for parameter estimation and a BIC-type criterion is invented for feature selection. Asymptotic theory is developed and numerical studies are conducted. Finally, the proposed method is demonstrated on a real dataset, which was generated by the Mayor Public Hotline of Changchun, the capital city of Jilin Province in Northeast China.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [41] Constrained domain maximum likelihood estimation for naive Bayes text classification
    Andres-Ferrer, Jesus
    Juan, Alfons
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) : 189 - 196
  • [42] Text Classification on Mahout with Naive-Bayes Machine Learning Algorithm
    Salur, Mehmet Umut
    Tokat, Sezai
    Aydilek, Ibrahim Berkan
    [J]. 2017 INTERNATIONAL ARTIFICIAL INTELLIGENCE AND DATA PROCESSING SYMPOSIUM (IDAP), 2017,
  • [43] Deep feature weighting for naive Bayes and its application to text classification
    Jiang, Liangxiao
    Li, Chaoqun
    Wang, Shasha
    Zhang, Lungan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 52 : 26 - 39
  • [44] A Method of Text Classification Combining Naive Bayes and the Similarity Computing Algorithms
    Hong, Yinghan
    Mai, Guizhen
    Zeng, Hui
    Guo, Cai
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2015 WORKSHOPS, 2015, 9461 : 3 - 14
  • [45] Compensation strategy of unseen feature words in naive Bayes text classification
    School of Management, Harbin Institute of Technology, Harbin 150001, China
    不详
    [J]. Harbin Gongye Daxue Xuebao, 2008, 6 (956-960):
  • [46] Combining fuzzy clustering with Naive Bayes augmented learning in text classification
    Liu, Lizhen
    Sun, Xiaojing
    Song, Hantao
    [J]. 2006 1ST INTERNATIONAL SYMPOSIUM ON PERVASIVE COMPUTING AND APPLICATIONS, PROCEEDINGS, 2006, : 168 - +
  • [47] Divergence-Based Feature Selection for Naive Bayes Text Classification
    Wang, Huizhen
    Zhu, Jingbo
    Su, Keh-Yih
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 209 - +
  • [48] Naive Bayes Text Classification with Positive Features Selected by Statistical Method
    Meena, M. Janaki
    Chandran, K. R.
    [J]. FIRST INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING 2009 (ICAC 2009), 2009, : 28 - +
  • [49] Constrained domain maximum likelihood estimation for naive Bayes text classification
    Jesús Andrés-Ferrer
    Alfons Juan
    [J]. Pattern Analysis and Applications, 2010, 13 : 189 - 196
  • [50] On word frequency information and negative evidence in Naive Bayes text classification
    Schneider, KM
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 474 - 485