Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data

被引:0
|
作者
Limin Wang
Junjie Wang
Lu Guo
Qilong Li
机构
[1] Jilin University,Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education
[2] College of Software,undefined
[3] Jilin University,undefined
[4] College of Instrumentation and Electrical Engineering,undefined
[5] Jilin University,undefined
来源
Applied Intelligence | 2024年 / 54卷
关键词
Bayesian network classifier; Attribute independence assumption; Ensemble learning; Log-likelihood function; Instance learning;
D O I
暂无
中图分类号
学科分类号
摘要
Naive Bayes (NB) is one of the top ten machine learning algorithms whereas its attribute independence assumption rarely holds in practice. A feasible and efficient approach to improving NB is relaxing the assumption by adding augmented edges to the restricted topology of NB. In this paper we prove theoretically that the generalized topology may be a suboptimal solution to model multivariate probability distributions if its fitness to data cannot be measured. Thus we propose to apply log-likelihood function as the scoring function, then introduce an efficient heuristic search strategy to explore high-dependence relationships, and for each iteration the learned topology will be improved to fit data better. The proposed algorithm, called log-likelihood Bayesian classifier (LLBC), can respectively learn two submodels from labeled training set and individual unlabeled testing instance, and then make them work jointly for classification in the framework of ensemble learning. Our extensive experimental evaluations on 36 benchmark datasets from the University of California at Irvine (UCI) machine learning repository reveal that, LLBC demonstrates excellent classification performance and provides a competitive approach to learn from labeled and unlabeled data.
引用
收藏
页码:1957 / 1979
页数:22
相关论文
共 50 条
  • [21] Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data
    Van den Broeck, Guy
    Mohan, Karthika
    Choi, Arthur
    Darwiche, Adnan
    Pearl, Judea
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 161 - 170
  • [22] A Scalable Data Science Workflow Approach for Big Data Bayesian Network Learning
    Wang, Jianwu
    Tang, Yan
    Nguyen, Mai
    Altintas, Ilkay
    2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 16 - 25
  • [23] Bayesian belief network for positive unlabeled learning with uncertainty
    Gan, Hongxiao
    Zhang, Yang
    Song, Qun
    PATTERN RECOGNITION LETTERS, 2017, 90 : 28 - 35
  • [24] Time series feature learning with labeled and unlabeled data
    Wang, Haishuai
    Zhang, Qin
    Wu, Jia
    Pan, Shirui
    Chen, Yixin
    PATTERN RECOGNITION, 2019, 89 : 55 - 66
  • [25] Learning from Labeled and Unlabeled Vertices in Networks
    Ye, Wei
    Zhou, Linfei
    Mautz, Dominik
    Plant, Claudia
    Boehm, Christian
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1265 - 1274
  • [26] Scalable Learning of Bayesian Network Classifiers
    Martinez, Ana M.
    Webb, Geoffrey I.
    Chen, Shenglei
    Zaidi, Nayyar A.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [27] A new classifier based on information theoretic learning with unlabeled data
    Jeong, KH
    Xu, JW
    Erdogmus, D
    Principe, JC
    NEURAL NETWORKS, 2005, 18 (5-6) : 719 - 726
  • [28] Probabilistic modeling for face orientation discrimination: Learning from labeled and unlabeled data
    Baluja, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 854 - 860
  • [29] Learning from labeled and unlabeled data: An empirical study across techniques and domains
    Chawla, N
    Karakoulas, G
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2005, 23 : 331 - 366
  • [30] An Active MBBNTree Classifier Learning from Unlabeled Samples
    Cao, Yong C.
    Zhao, Yue
    Pan, Xiu Q.
    Lu, Yong
    Xu, Xiao N.
    7TH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY: MEASUREMENT THEORY AND SYSTEMS AND AERONAUTICAL EQUIPMENT, 2008, 7128