Local L2-Thresholding Based Data Mining in Peer-to-Peer Systems

被引:0
|
作者
Wolff, Ran
Bhaduri, Kanishka
Kargupta, Hillol
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a large network of computers, wireless sensors, or mobile devices, each of the components (hence, peers) has some data about the global status of the system. Many of the functions of the system, such as routing decisions, search strategies, data cleansing, and the assignment of mutual trust, depend on the global status. Therefore, it is essential that the system be able to detect, and react to, changes in its global status. Computing global predicates in such systems is usually very costly. Mainly because of their scale, and in some cases (e.g., sensor networks) also because of the high cost of communication. The cost further increases when the data changes rapidly (due to state changes, node failure, etc.) and computation has to follow these changes. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient local algorithm which detect when the L2 norm of the average data surpasses a threshold. Then, we use this algorithm as a feedback loop for the monitoring of complex predicates on the data such as the data's k-means clustering. The efficiency of the L2 algorithm guarantees that so long as the clustering results represent the data (i.e., the data is stationary) few resources are required. When the data undergoes an epoch change a change in the underlying distribution and the model no longer represents it, the feedback loop indicates this and the model is rebuilt. Furthermore, the existence of a feedback loop allows using approximate and "best-effort" methods for constructing the model; if an ill-fit model is built the feedback loop would indicate so, and the model would be rebuilt.
引用
收藏
页码:430 / 441
页数:12
相关论文
共 50 条
  • [1] Distributed data mining in peer-to-peer networks
    Datta, Souptik
    Bhaduri, Kanishka
    Giannella, Chris
    Kargupta, Hillol
    Wolff, Ran
    IEEE INTERNET COMPUTING, 2006, 10 (04) : 18 - 26
  • [2] Association rule mining in peer-to-peer systems
    Wolff, R
    Schuster, A
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (06): : 2426 - 2438
  • [3] Association rule mining in peer-to-peer systems
    Wolff, R
    Schuster, A
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 363 - 370
  • [4] A study of parallel data mining in a peer-to-peer network
    Guan, Huiwei
    Ip, Horace H. S.
    CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2007, 15 (03): : 281 - 289
  • [5] Peer-to-peer data mining, privacy issues, and games
    Bhaduri, Kanishka
    Das, Kamalika
    Kargupta, Hillol
    AUTONOMOUS INTELLIGENT SYSTEMS: AGENTS AND DATA MINING, PROCEEDINGS, 2007, 4476 : 1 - +
  • [6] Agent based peer-to-peer systems
    Dasgupta, P
    2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 663 - 666
  • [7] Reliability of data in structured peer-to-peer systems
    Rieche, S
    Wehrle, K
    Landsiedel, O
    Götz, S
    Petrak, L
    2004 INTERNATIONAL WORKSHOP ON HOT TOPICS IN PEER-TO-PEER SYSTEMS, PROCEEDINGS, 2004, : 108 - 113
  • [8] Inference attacks in peer-to-peer homogeneous distributed data mining
    da Silva, JC
    Klusch, M
    Lodi, S
    Moro, G
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 450 - 454
  • [9] Asynchronous Peer-to-Peer Data Mining with Stochastic Gradient Descent
    Ormandi, Robert
    Hegedus, Istvan
    Jelasity, Mark
    EURO-PAR 2011 PARALLEL PROCESSING, PT 1, 2011, 6852 : 528 - 540
  • [10] Peer-to-Peer Systems
    Rodrigues, Rodrigo
    Druschel, Peter
    COMMUNICATIONS OF THE ACM, 2010, 53 (10) : 72 - 82