Collective mining of Bayesian networks from distributed heterogeneous data

被引:37
|
作者
Chen, R
Sivakumar, K [1 ]
Kargupta, H
机构
[1] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
[2] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
关键词
Bayesian network; collective data mining; distributed data mining; heterogeneous data; web log mining;
D O I
10.1007/s10115-003-0107-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a collective approach to learning a Bayesian network from distributed heterogeneous data. In this approach, we first learn a local Bayesian network at each site using the local data. Then each site identifies the observations that are most likely to be evidence of coupling between local and non-local variables and transmits a subset of these observations to a central site. Another Bayesian network is learnt at the central site using the data transmitted from the local site. The local and central Bayesian networks are combined to obtain a collective Bayesian network, which models the entire data. Experimental results and theoretical justification that demonstrate the feasibility of our approach are presented.
引用
收藏
页码:164 / 187
页数:24
相关论文
共 50 条
  • [1] Collective Mining of Bayesian Networks from Distributed Heterogeneous Data
    R. Chen
    K. Sivakumar
    H. Kargupta
    [J]. Knowledge and Information Systems, 2004, 6 : 164 - 187
  • [2] Collective mining of Bayesian networks from distributed heterogeneous data
    R. Chen
    K. Sivakumar
    H. Kargupta
    [J]. Knowledge and Information Systems, 2004, 6 (2) : 164 - 187
  • [3] Distributed web mining using Bayesian networks from multiple data streams
    Chen, R
    Sivakumar, K
    Kargupta, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 75 - 82
  • [4] Collective, Hierarchical Clustering from distributed, heterogeneous data
    Johnson, EL
    Kargupta, H
    [J]. LARGE-SCALE PARALLEL DATA MINING, 2000, 1759 : 221 - 244
  • [5] Bayesian networks for data mining
    Heckerman, D
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (01) : 79 - 119
  • [6] Bayesian Networks for Data Mining
    David Heckerman
    [J]. Data Mining and Knowledge Discovery, 1997, 1 : 79 - 119
  • [7] Collective Principal Component Analysis from Distributed, Heterogeneous Data
    Kargupta, Hillol
    Huang, Weiyun
    Sivakumar, Krishnamoorthy
    Park, Byung-Hoon
    Wang, Shuren
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 452 - 457
  • [8] Preparation of Distributed Heterogeneous Data for Data Mining
    Batasova, Svetlana
    Efimova, Maria
    Kholod, Ivan
    Semenchenko, Alexey
    [J]. 2015 XVIII International Conference on Soft Computing and Measurements (SCM), 2015, : 205 - 207
  • [9] Parallel data mining of Bayesian Networks from Telecommunications Network data
    Sterritt, R
    Adamson, K
    Shapcott, CM
    Curran, EP
    [J]. PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 415 - 422
  • [10] USE OF BAYESIAN NETWORKS IN DATA MINING
    Hanzelka, David
    [J]. APLIMAT 2005 - 4TH INTERNATIONAL CONFERENCE, PT II, 2005, : 437 - 443