A Novel Bayes Model: Hidden Naive Bayes

被引：222

作者：

Jiang, Liangxiao ^{[1
]}

Zhang, Harry ^{[2
]}

Cai, Zhihua ^{[1
]}

机构：

[1] China Univ Geosci, Fac Comp Sci, Wuhan 430074, Peoples R China

[2] Univ New Brunswick, Fac Comp Sci, Fredericton, NB E3B 5A3, Canada

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2009年 / 21卷 / 10期

关键词：

Naive Bayes; Bayesian network classifiers; learning algorithms; classification; class probability estimation; ranking; ROC CURVE; AREA;

D O I：

10.1109/TKDE.2008.234

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Because learning an optimal Bayesian network classifier is an NP-hard problem, learning-improved naive Bayes has attracted much attention from researchers. In this paper, we summarize the existing improved algorithms and propose a novel Bayes model: hidden naive Bayes (HNB). In HNB, a hidden parent is created for each attribute which combines the influences from all other attributes. We experimentally test HNB in terms of classification accuracy, using the 36 UCI data sets selected by Weka, and compare it to naive Bayes (NB), selective Bayesian classifiers (SBC), naive Bayes tree (NBTree), tree-augmented naive Bayes (TAN), and averaged one-dependence estimators (AODE). The experimental results show that HNB significantly outperforms NB, SBC, NBTree, TAN, and AODE. In many data mining applications, an accurate class probability estimation and ranking are also desirable. We study the class probability estimation and ranking performance, measured by conditional log likelihood (CLL) and the area under the ROC curve (AUC), respectively, of naive Bayes and its improved models, such as SBC, NBTree, TAN, and AODE, and then compare HNB to them in terms of CLL and AUC. Our experiments show that HNB also significantly outperforms all of them.

引用

页码：1361 / 1371

页数：11

共 50 条

[41] Naive Bayes text classifier
Zhang, Haiyi
Li, Di
[J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
[42] On pairwise naive Bayes classifiers
Sulzmann, Jan-Nikolas
Fuernkranz, Johannes
Huellermeier, Eyke
[J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 371 - +
[43] Landscapes of Naive Bayes classifiers
Hoare, Zoe
[J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (01) : 59 - 72
[44] Integrating naive Bayes and FOIL
Landwehr, Niels
Kersting, Kristian
De Raedt, Luc
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 481 - 507
[45] Why the Naive Bayes approximation is not as naive as it appears
Stephens, Christopher R.
Flores Huerta, Hugo
Ruiz Linares, Ana
[J]. 2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2015,
[46] A network intrusion detection system based on a Hidden Naive Bayes multiclass classifier
Koc, Levent
Mazzuchi, Thomas A.
Sarkani, Shahram
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (18) : 13492 - 13500
[47] Discretization as the enabling technique for the Naive Bayes and semi-Naive Bayes-based classification
Mizianty, Marcin J.
Kurgan, Lukasz A.
Ogiela, Marek R.
[J]. KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (04): : 421 - 449
[48] Sentiment Analysis using Naive Bayes and Complement Naive Bayes Classifier Algorithms on Hadoop Framework
Seref, Berna
Bostanci, Erkan
[J]. 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 555 - 561
[49] Naive Feature Selection: Sparsity in Naive Bayes
Askari, Armin
d'Aspremont, Alex
El Ghaoui, Laurent
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1813 - 1821
[50] Variational Bayes for estimating the parameters of a hidden Potts model
McGrory, C. A.
Titterington, D. M.
Reeves, R.
Pettitt, A. N.
[J]. STATISTICS AND COMPUTING, 2009, 19 (03) : 329 - 340

← 1 2 3 4 5 →