Asynchronous Peer-to-Peer Data Mining with Stochastic Gradient Descent

被引:0
|
作者
Ormandi, Robert [1 ]
Hegedus, Istvan [1 ]
Jelasity, Mark [2 ]
机构
[1] Univ Szeged, Szeged, Hungary
[2] Hungarian Acad Sci, Univ Szeged, Szeged, Hungary
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fully distributed data mining algorithms build global models over large amounts of data distributed over a large number of peers in a network, without moving the data itself. In the area of peer-to-peer (P2P) networks, such algorithms have various applications in P2P social networking, and also in trackerless BitTorrent communities. The difficulty of the problem involves realizing good quality models with an affordable communication complexity, while assuming as little as possible about the communication model. Here we describe a conceptually simple, yet powerful generic approach for designing efficient, fully distributed, asynchronous, local algorithms for learning models of fully distributed data. The key idea is that many models perform a random walk over the network while being gradually adjusted to fit the data they encounter, using a stochastic gradient descent search. We demonstrate our approach by implementing the support vector machine (SVM) method and by experimentally evaluating its performance in various failure scenarios over different benchmark datasets. Our algorithm scheme can implement a wide range of machine learning methods in an extremely robust manner.
引用
收藏
页码:528 / 540
页数:13
相关论文
共 50 条
  • [31] Data-Driven Stochastic Game With Social Attributes for Peer-to-Peer Energy Sharing
    Chen, Liudong
    Liu, Nian
    Liu, Liangying
    Yu, Xinghuo
    Xue, Yusheng
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (06) : 5158 - 5171
  • [32] A peer-to-peer data structure for dynamic location data
    Heutelbeck, Dominic
    Hemmje, Matthias
    PERCOM 2006: FOURTH ANNUAL IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2006, : 264 - +
  • [33] Asynchronous privacy-preserving iterative computation on peer-to-peer networks
    J. A. M. Naranjo
    L. G. Casado
    Márk Jelasity
    Computing, 2012, 94 : 763 - 782
  • [34] Terrain Data Peer-to-peer Storage Model
    Li, Zhongmin
    Liu, Min
    Gao, Lu
    2010 18TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2010,
  • [35] Data indexing in peer-to-peer DHT networks
    Garcés-Erice, L. (garces@eurecom.fr), IEEE Computer Society; Information Proceeding Society of Japan (IPSJ) (Institute of Electrical and Electronics Engineers Inc.):
  • [36] Asynchronous privacy-preserving iterative computation on peer-to-peer networks
    Naranjo, J. A. M.
    Casado, L. G.
    Jelasity, Mark
    COMPUTING, 2012, 94 (8-10) : 763 - 782
  • [37] Reliability of data in structured peer-to-peer systems
    Rieche, S
    Wehrle, K
    Landsiedel, O
    Götz, S
    Petrak, L
    2004 INTERNATIONAL WORKSHOP ON HOT TOPICS IN PEER-TO-PEER SYSTEMS, PROCEEDINGS, 2004, : 108 - 113
  • [38] Characterization of data on the gnutella peer-to-peer network
    Miller, J
    CCNC 2004: 1ST IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, PROCEEDINGS: CONSUMER NETWORKING: CLOSING THE DIGITAL DIVIDE, 2004, : 489 - 494
  • [39] Research issues in Peer-to-Peer data management
    Ulusoy, Oezguer
    2007 22ND INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2007, : 1 - 8
  • [40] Peer-to-peer data integration with active XML
    Milo, T
    ADVANCES IN COMPUTER SCIENCE - ASIAN 2005, PROCEEDINGS: DATA MANAGEMENT ON THE WEB, 2005, 3818 : 11 - 18