Bayesian Networks for Data Mining

被引:0
|
作者
David Heckerman
机构
[1] Microsoft Research,
[2] 9S,undefined
来源
关键词
Bayesian networks; Bayesian statistics; learning; missing data; classification; regression; clustering; causal discovery;
D O I
暂无
中图分类号
学科分类号
摘要
A Bayesian network is a graphical model that encodesprobabilistic relationships among variables of interest. When used inconjunction with statistical techniques, the graphical model hasseveral advantages for data modeling. One, because the model encodesdependencies among all variables, it readily handles situations wheresome data entries are missing. Two, a Bayesian network can be used tolearn causal relationships, and hence can be used to gain understanding about a problem domain and to predict the consequencesof intervention. Three, because the model has both a causal andprobabilistic semantics, it is an ideal representation for combiningprior knowledge (which often comes in causal form) and data. Four,Bayesian statistical methods in conjunction with Bayesian networksoffer an efficient and principled approach for avoiding theoverfitting of data. In this paper, we discuss methods for constructing Bayesian networks from prior knowledge and summarizeBayesian statistical methods for using data to improve these models.With regard to the latter task, we describe methods for learning boththe parameters and structure of a Bayesian network, includingtechniques for learning with incomplete data. In addition, we relateBayesian-network methods for learning to techniques for supervised andunsupervised learning. We illustrate the graphical-modeling approachusing a real-world case study.
引用
收藏
页码:79 / 119
页数:40
相关论文
共 50 条
  • [1] Bayesian networks for data mining
    Heckerman, D
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (01) : 79 - 119
  • [2] USE OF BAYESIAN NETWORKS IN DATA MINING
    Hanzelka, David
    [J]. APLIMAT 2005 - 4TH INTERNATIONAL CONFERENCE, PT II, 2005, : 437 - 443
  • [3] Data mining in Network engineering-Bayesian Networks for Data Mining
    Wang, Xiao Dan
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMMERCE AND SOCIETY, 2015, 17 : 412 - 417
  • [4] Epidemiological data mining of cardiovascular Bayesian networks
    Twardy, Charles R.
    Nicholson, Ann E.
    Korb, Kevin B.
    McNeil, John
    [J]. ELECTRONIC JOURNAL OF HEALTH INFORMATICS, 2006, 1 (01):
  • [5] Applying Bayesian Networks for meteorological data mining
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    [J]. APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XIII, 2006, : 122 - +
  • [6] Data mining of Bayesian networks using cooperative coevolution
    Wong, ML
    Lee, SY
    Leung, KS
    [J]. DECISION SUPPORT SYSTEMS, 2004, 38 (03) : 451 - 472
  • [7] Data mining based Bayesian networks for best classification
    Ouali, Abdelaziz
    Cherif, Amar Ramdane
    Krebs, Marie-Odile
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 1278 - 1292
  • [8] Application of Bayesian networks and data mining to biomedical problems
    Kammerdiner, Alla R.
    Gupal, Anatoliy M.
    Pardalos, Panos M.
    [J]. DATA MINING, SYSTEMS ANALYSIS, AND OPTIMIZATION IN BIOMEDICINE, 2007, 953 : 132 - +
  • [9] Parallel data mining of Bayesian Networks from Telecommunications Network data
    Sterritt, R
    Adamson, K
    Shapcott, CM
    Curran, EP
    [J]. PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 415 - 422
  • [10] Bayesian neural networks with confidence estimations applied to data mining
    Orre, R
    Lansner, A
    Bate, A
    Lindquist, M
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 34 (04) : 473 - 493