Collective mining of Bayesian networks from distributed heterogeneous data

被引:37
|
作者
Chen, R
Sivakumar, K [1 ]
Kargupta, H
机构
[1] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
[2] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
关键词
Bayesian network; collective data mining; distributed data mining; heterogeneous data; web log mining;
D O I
10.1007/s10115-003-0107-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a collective approach to learning a Bayesian network from distributed heterogeneous data. In this approach, we first learn a local Bayesian network at each site using the local data. Then each site identifies the observations that are most likely to be evidence of coupling between local and non-local variables and transmits a subset of these observations to a central site. Another Bayesian network is learnt at the central site using the data transmitted from the local site. The local and central Bayesian networks are combined to obtain a collective Bayesian network, which models the entire data. Experimental results and theoretical justification that demonstrate the feasibility of our approach are presented.
引用
收藏
页码:164 / 187
页数:24
相关论文
共 50 条
  • [41] Distributed multivariate regression using wavelet-based collective data mining
    Hershberger, DE
    Kargupta, H
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (03) : 372 - 400
  • [42] Mining genetic epidemiology data with Bayesian networks I: Bayesian networks and example application (plasma apoE levels)
    Rodin, AS
    Boerwinkle, E
    [J]. BIOINFORMATICS, 2005, 21 (15) : 3273 - 3278
  • [43] Distributed data mining and knowledge management with networks of sensor arrays
    Dixon, Maurice
    Lambert, Simon C.
    Gallop, Julian R.
    [J]. KNOWLEDGE AND DATA MANAGEMENT IN GRIDS, 2007, : 235 - +
  • [44] Mining Subspace Clusters from Distributed Data
    Bian, Haiyun
    Bhatnagar, Raj
    [J]. COMPUTER AND INFORMATION SCIENCE 2009, 2009, 208 : 73 - 82
  • [45] A tutorial on data mining for Bayesian networks, with a specific focus on IoT for agriculture
    Krause, Paul J.
    Bokinala, Vivek
    [J]. INTERNET OF THINGS, 2023, 22 : 1 - 15
  • [46] DISTRIBUTED INFERENCE IN BAYESIAN NETWORKS
    DIEZ, FJ
    MIRA, J
    [J]. CYBERNETICS AND SYSTEMS, 1994, 25 (01) : 39 - 61
  • [47] DISTRIBUTED DATA MINING
    Fiolet, Valerie
    Toursel, Bernard
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2005, 6 (01): : 99 - 109
  • [48] Dependency Networks and Bayesian Networks for Web mining
    Tarantola, C
    Blanc, E
    [J]. DATA MINING III, 2002, 6 : 947 - 954
  • [49] Integrating Data Mining Models from Distributed Data Sources
    Wilford-Rivera, Ingrid
    Ruiz-Fernandez, Daniel
    Rosete-Suarez, Alejandro
    Marin-Alonso, Oscar
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2010, 79 : 389 - +
  • [50] Collective Dendrogram Clustering with Collaborative Filtering for Distributed Data Mining on Electronic Health Records
    Urmela, S.
    Nandhini, M.
    [J]. PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,