Knowledge discovery by probabilistic clustering of distributed databases

被引:12
|
作者
McClean, S [1 ]
Scotney, B [1 ]
Morrow, P [1 ]
Greer, K [1 ]
机构
[1] Univ Ulster, Sch Comp & Informat Engn, Coleraine BT52 1SA, Londonderry, North Ireland
关键词
distributed databases; probabilistic clustering; aggregates; dynamic shared ontology;
D O I
10.1016/j.datak.2004.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering of distributed databases facilitates knowledge discovery through learning of new concepts that characterise common features and differences between datasets. Hence, general patterns can be learned rather than restricting learning to specific databases from which rules may not be generalisable. We cluster databases that hold aggregate count data on categorical attributes that have been classified according to homogeneous or heterogeneous classification schemes. Clustering of datasets is carried out via the probability distributions that describe their respective aggregates. The homogeneous case is straightforward. For heterogeneous data we investigate a number of clustering strategies, of which the most efficient avoid the need to compute a dynamic shared ontology to homogenise the classification schemes prior to clustering. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:189 / 210
页数:22
相关论文
共 50 条
  • [21] SYSTEMS FOR KNOWLEDGE DISCOVERY IN DATABASES
    MATHEUS, CJ
    CHAN, PK
    PIATETSKYSHAPIRO, G
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 903 - 913
  • [22] Relational knowledge discovery in databases
    Blockeel, H
    De Raedt, L
    [J]. INDUCTIVE LOGIC PROGRAMMING, 1997, 1314 : 199 - 211
  • [23] Knowledge discovery in endgame databases
    Schlosser, M
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 423 - 435
  • [24] KNOWLEDGE DISCOVERY IN MOLECULAR DATABASES
    CONKLIN, D
    FORTIER, S
    GLASGOW, J
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) : 985 - 987
  • [25] Knowledge Discovery in Databases and Libraries
    Dhiman, Anil Kumar
    [J]. DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2011, 31 (06): : 446 - 451
  • [26] KNOWLEDGE DISCOVERY IN DATABASES - AN OVERVIEW
    FRAWLEY, WJ
    PIATETSKYSHAPIRO, G
    MATHEUS, CJ
    [J]. AI MAGAZINE, 1992, 13 (03) : 57 - 70
  • [27] Knowledge discovery in bibliographic databases
    Limb, P
    [J]. ONLINE INFORMATION REVIEW, 2000, 24 (05) : 404 - 405
  • [28] Knowledge discovery in bibliographic databases
    Green, R
    [J]. LIBRARY & INFORMATION SCIENCE RESEARCH, 2000, 22 (04) : 433 - 435
  • [29] Knowledge discovery in spatial databases
    Ester, M
    Kriegel, HP
    Sander, J
    [J]. KI-99: ADVANCES IN ARTIFICIAL INTELLIGENCE, 1999, 1701 : 61 - 74
  • [30] An image mining approach for clustering traffic behaviors based on knowledge discovery of image databases
    Fashandi, H
    Eftekhari-Moghadam, AM
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MEASUREMENT SYSTEMS AND APPLICATIONS, 2005, : 203 - 207