Attribute grouping-based naive Bayesian classifier

被引:0
|
作者
Yulin He [1 ]
Guiliang Ou [2 ]
Philippe Fournier-Viger [2 ]
Joshua Zhexue Huang [2 ]
机构
[1] Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ),College of Computer Science & Software Engineering
[2] Shenzhen University,undefined
关键词
naive Bayesian classifier; attribute independence assumption; attribute grouping; dependent attribute group; posterior probability; class-conditional probability;
D O I
10.1007/s11432-022-3728-2
中图分类号
学科分类号
摘要
The naive Bayesian classifier (NBC) is a supervised machine learning algorithm having a simple model structure and good theoretical interpretability. However, the generalization performance of NBC is limited to a large extent by the assumption of attribute independence. To address this issue, this paper proposes a novel attribute grouping-based NBC (AG-NBC), which is a variant of the classical NBC trained with different attribute groups. AG-NBC first applies a novel effective objective function to automatically identify optimal dependent attribute groups (DAGs). Condition attributes in the same DAG are strongly dependent on the class attribute, whereas attributes in different DAGs are independent of one another. Then, for each DAG, a random vector functional link network with a SoftMax layer is trained to output posterior probabilities in the form of joint probability density estimation. The NBC is trained using the grouping attributes that correspond to the original condition attributes. Extensive experiments were conducted to validate the rationality, feasibility, and effectiveness of AG-NBC. Our findings showed that the attribute groups chosen for NBC can accurately represent attribute dependencies and reduce overlaps between different posterior probability densities. In addition, the comparative results with NBC, flexible NBC (FNBC), tree augmented Bayes network (TAN), gain ratio-based attribute weighted naive Bayes (GRAWNB), averaged one-dependence estimators (AODE), weighted AODE (WAODE), independent component analysis-based NBC (ICA-NBC), hidden naive Bayesian (HNB) classifier, and correlation-based feature weighting filter for naive Bayes (CFW) show that AG-NBC obtains statistically better testing accuracies, higher area under the receiver operating characteristic curves (AUCs), and fewer probability mean square errors (PMSEs) than other Bayesian classifiers. The experimental results demonstrate that AG-NBC is a valid and efficient approach for alleviating the attribute independence assumption when building NBCs.
引用
收藏
相关论文
共 50 条
  • [1] Attribute grouping-based naive Bayesian classifier
    Yulin HE
    Guiliang OU
    Philippe FOURNIERVIGER
    Joshua Zhexue HUANG
    Science China(Information Sciences), 2025, 68 (03) : 125 - 149
  • [2] A Novel Mixed-Attribute Fusion-Based Naive Bayesian Classifier
    Ou, Guiliang
    He, Yulin
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] SubTree Augmented Naive Bayesian Classifier Based on the Fuzzy Equivalence Partition of Attribute Variables
    Chen, Hong-mei
    Wang, Li-zhen
    Liu, Wei-yi
    Chen, Hao
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 1397 - 1401
  • [4] Study on Hybrid-Weight for Feature Attribute in Naive Bayesian Classifier
    Guo, Bao-En
    Liu, Hai-Tao
    Geng, Chao
    2014 FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2014, : 958 - 962
  • [5] Auto-Encoding Independent Attribute Transformation for Naive Bayesian Classifier
    Ou, Guiliang
    He, Yulin
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Grouping-based nonadditive verification
    Amir, A
    Lindenbaum, M
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (02) : 186 - 192
  • [7] Spam Filter Based on Naive Bayesian Classifier
    Lv, Teng
    Yan, Ping
    Yuan, Hongwu
    He, Weimin
    5TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2020), 2020, 1575
  • [8] Fisher score based naive Bayesian classifier
    Shi, ZZ
    Huang, YP
    Zhang, SL
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1616 - 1621
  • [10] Feature grouping-based multiple fuzzy classifier system for fusion of hyperspectral and LIDAR data
    Bigdeli, Behnaz
    Samadzadegan, Farhad
    Reinartz, Peter
    JOURNAL OF APPLIED REMOTE SENSING, 2014, 8