Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data

被引:0
|
作者
Limin Wang
Junjie Wang
Lu Guo
Qilong Li
机构
[1] Jilin University,Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education
[2] College of Software,undefined
[3] Jilin University,undefined
[4] College of Instrumentation and Electrical Engineering,undefined
[5] Jilin University,undefined
来源
Applied Intelligence | 2024年 / 54卷
关键词
Bayesian network classifier; Attribute independence assumption; Ensemble learning; Log-likelihood function; Instance learning;
D O I
暂无
中图分类号
学科分类号
摘要
Naive Bayes (NB) is one of the top ten machine learning algorithms whereas its attribute independence assumption rarely holds in practice. A feasible and efficient approach to improving NB is relaxing the assumption by adding augmented edges to the restricted topology of NB. In this paper we prove theoretically that the generalized topology may be a suboptimal solution to model multivariate probability distributions if its fitness to data cannot be measured. Thus we propose to apply log-likelihood function as the scoring function, then introduce an efficient heuristic search strategy to explore high-dependence relationships, and for each iteration the learned topology will be improved to fit data better. The proposed algorithm, called log-likelihood Bayesian classifier (LLBC), can respectively learn two submodels from labeled training set and individual unlabeled testing instance, and then make them work jointly for classification in the framework of ensemble learning. Our extensive experimental evaluations on 36 benchmark datasets from the University of California at Irvine (UCI) machine learning repository reveal that, LLBC demonstrates excellent classification performance and provides a competitive approach to learn from labeled and unlabeled data.
引用
收藏
页码:1957 / 1979
页数:22
相关论文
共 50 条
  • [31] Consistency of Lipschitz Learning with Infinite Unlabeled Data and Finite Labeled Data
    Calder, Jeff
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (04): : 780 - 812
  • [32] An Efficient Active Bayes Classifier using Affinity Propagation on Unlabeled Data
    Wang XianHui
    Qin Zheng
    Zhang XuanPing
    2009 SECOND INTERNATIONAL CONFERENCE ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, FITME 2009, 2009, : 440 - 443
  • [33] Bayesian Network Classifier for Medical Data Analysis
    Reiz, Beata
    Csato, Lehel
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2009, 4 (01) : 65 - 72
  • [34] UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
    Wang, Chengyi
    Wu, Yu
    Qian, Yao
    Kumatani, Kenichi
    Liu, Shujie
    Wei, Furu
    Zeng, Michael
    Huang, Xuedong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] An efficient dynamic Bayesian network classifier structure learning algorithm: application to sport epidemiology
    Peterson, Kyle D.
    JOURNAL OF COMPLEX NETWORKS, 2020, 8 (04)
  • [36] Discriminatory Target Learning: Mining Significant Dependence Relationships from Labeled and Unlabeled Data
    Duan, Zhi-Yi
    Wang, Li-Min
    Mammadov, Musa
    Lou, Hua
    Sun, Ming-Hui
    ENTROPY, 2019, 21 (05)
  • [37] Global/Local Hybrid Learning of Mixture-of-Experts from Labeled and Unlabeled Data
    Yoon, Jong-Won
    Cho, Sung-Bae
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART I, 2011, 6678 : 452 - 459
  • [38] Learning to classify text from labeled and unlabeled documents
    Nigam, K
    McCallum, A
    Thrun, S
    Mitchell, T
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 792 - 799
  • [39] Scalable Bayesian Network Structure Learning with Splines
    Sharma, Charupriya
    van Beek, Peter
    INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 186, 2022, 186
  • [40] Robust Bayesian learning with domain heuristics for missing data
    Wun, CH
    Wu, CH
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2004, 3214 : 1268 - 1275