Attribute grouping-based naive Bayesian classifier

被引:0
|
作者
Yulin He [1 ]
Guiliang Ou [2 ]
Philippe Fournier-Viger [2 ]
Joshua Zhexue Huang [2 ]
机构
[1] Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ),College of Computer Science & Software Engineering
[2] Shenzhen University,undefined
关键词
naive Bayesian classifier; attribute independence assumption; attribute grouping; dependent attribute group; posterior probability; class-conditional probability;
D O I
10.1007/s11432-022-3728-2
中图分类号
学科分类号
摘要
The naive Bayesian classifier (NBC) is a supervised machine learning algorithm having a simple model structure and good theoretical interpretability. However, the generalization performance of NBC is limited to a large extent by the assumption of attribute independence. To address this issue, this paper proposes a novel attribute grouping-based NBC (AG-NBC), which is a variant of the classical NBC trained with different attribute groups. AG-NBC first applies a novel effective objective function to automatically identify optimal dependent attribute groups (DAGs). Condition attributes in the same DAG are strongly dependent on the class attribute, whereas attributes in different DAGs are independent of one another. Then, for each DAG, a random vector functional link network with a SoftMax layer is trained to output posterior probabilities in the form of joint probability density estimation. The NBC is trained using the grouping attributes that correspond to the original condition attributes. Extensive experiments were conducted to validate the rationality, feasibility, and effectiveness of AG-NBC. Our findings showed that the attribute groups chosen for NBC can accurately represent attribute dependencies and reduce overlaps between different posterior probability densities. In addition, the comparative results with NBC, flexible NBC (FNBC), tree augmented Bayes network (TAN), gain ratio-based attribute weighted naive Bayes (GRAWNB), averaged one-dependence estimators (AODE), weighted AODE (WAODE), independent component analysis-based NBC (ICA-NBC), hidden naive Bayesian (HNB) classifier, and correlation-based feature weighting filter for naive Bayes (CFW) show that AG-NBC obtains statistically better testing accuracies, higher area under the receiver operating characteristic curves (AUCs), and fewer probability mean square errors (PMSEs) than other Bayesian classifiers. The experimental results demonstrate that AG-NBC is a valid and efficient approach for alleviating the attribute independence assumption when building NBCs.
引用
收藏
相关论文
共 50 条
  • [21] Machine learning for keyphrases extraction based on naive Bayesian classifier
    Wang, Jiabing
    Peng, Hong
    Hu, Jingsong
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 815 - 818
  • [22] A new classifier for breast cancer detection based on Naive Bayesian
    Karabatak, Murat
    MEASUREMENT, 2015, 72 : 32 - 36
  • [23] Naive Bayesian classifier for microarray data
    Kelemen, A
    Zhou, H
    Lawhead, P
    Liang, YL
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1769 - 1773
  • [24] Naive bayesian text classifier based on different probability model
    Pin, L.V.
    Luo, Zhong
    International Journal of Digital Content Technology and its Applications, 2012, 6 (12) : 464 - 471
  • [25] WNB: A Weighted Naive Bayesian classifier
    de S. Pedro, Saulo D.
    Hruschka, Estevam R., Jr.
    Flruschka, Eduardo R.
    Ebecken, Nelson F. F.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 138 - +
  • [26] A novel naive bayesian text classifier
    Ding, Wang
    Yu, Songnian
    Wang, Qianfeng
    Yu, Jiaqi
    Guo, Qiang
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 78 - 82
  • [27] A Compressed Hidden Naive Bayesian Classifier
    Ou, Guiliang
    He, Yulin
    Huang, Joshua Zhexue
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] A part grouping-based approach for disassembly sequencing
    Gucdemir, Hulya
    Ilgin, Mehmet Ali
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (01):
  • [29] RESEARCH DESIGN ISSUES IN GROUPING-BASED TESTS
    LYS, T
    SABINO, JS
    JOURNAL OF FINANCIAL ECONOMICS, 1992, 32 (03) : 355 - 387
  • [30] Nomograms for visualization of naive Bayesian classifier
    Mozina, M
    Demsar, J
    Kattan, M
    Zupan, B
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 337 - 348