A Novel Bayes Model: Hidden Naive Bayes

被引:222
|
作者
Jiang, Liangxiao [1 ]
Zhang, Harry [2 ]
Cai, Zhihua [1 ]
机构
[1] China Univ Geosci, Fac Comp Sci, Wuhan 430074, Peoples R China
[2] Univ New Brunswick, Fac Comp Sci, Fredericton, NB E3B 5A3, Canada
关键词
Naive Bayes; Bayesian network classifiers; learning algorithms; classification; class probability estimation; ranking; ROC CURVE; AREA;
D O I
10.1109/TKDE.2008.234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Because learning an optimal Bayesian network classifier is an NP-hard problem, learning-improved naive Bayes has attracted much attention from researchers. In this paper, we summarize the existing improved algorithms and propose a novel Bayes model: hidden naive Bayes (HNB). In HNB, a hidden parent is created for each attribute which combines the influences from all other attributes. We experimentally test HNB in terms of classification accuracy, using the 36 UCI data sets selected by Weka, and compare it to naive Bayes (NB), selective Bayesian classifiers (SBC), naive Bayes tree (NBTree), tree-augmented naive Bayes (TAN), and averaged one-dependence estimators (AODE). The experimental results show that HNB significantly outperforms NB, SBC, NBTree, TAN, and AODE. In many data mining applications, an accurate class probability estimation and ranking are also desirable. We study the class probability estimation and ranking performance, measured by conditional log likelihood (CLL) and the area under the ROC curve (AUC), respectively, of naive Bayes and its improved models, such as SBC, NBTree, TAN, and AODE, and then compare HNB to them in terms of CLL and AUC. Our experiments show that HNB also significantly outperforms all of them.
引用
收藏
页码:1361 / 1371
页数:11
相关论文
共 50 条
  • [41] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [42] On pairwise naive Bayes classifiers
    Sulzmann, Jan-Nikolas
    Fuernkranz, Johannes
    Huellermeier, Eyke
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 371 - +
  • [43] Landscapes of Naive Bayes classifiers
    Hoare, Zoe
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (01) : 59 - 72
  • [44] Integrating naive Bayes and FOIL
    Landwehr, Niels
    Kersting, Kristian
    De Raedt, Luc
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 481 - 507
  • [45] Why the Naive Bayes approximation is not as naive as it appears
    Stephens, Christopher R.
    Flores Huerta, Hugo
    Ruiz Linares, Ana
    [J]. 2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
  • [46] A network intrusion detection system based on a Hidden Naive Bayes multiclass classifier
    Koc, Levent
    Mazzuchi, Thomas A.
    Sarkani, Shahram
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (18) : 13492 - 13500
  • [47] Discretization as the enabling technique for the Naive Bayes and semi-Naive Bayes-based classification
    Mizianty, Marcin J.
    Kurgan, Lukasz A.
    Ogiela, Marek R.
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (04): : 421 - 449
  • [48] Sentiment Analysis using Naive Bayes and Complement Naive Bayes Classifier Algorithms on Hadoop Framework
    Seref, Berna
    Bostanci, Erkan
    [J]. 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 555 - 561
  • [49] Naive Feature Selection: Sparsity in Naive Bayes
    Askari, Armin
    d'Aspremont, Alex
    El Ghaoui, Laurent
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1813 - 1821
  • [50] Variational Bayes for estimating the parameters of a hidden Potts model
    McGrory, C. A.
    Titterington, D. M.
    Reeves, R.
    Pettitt, A. N.
    [J]. STATISTICS AND COMPUTING, 2009, 19 (03) : 329 - 340