Asynchronous Peer-to-Peer Data Mining with Stochastic Gradient Descent

被引:0
|
作者
Ormandi, Robert [1 ]
Hegedus, Istvan [1 ]
Jelasity, Mark [2 ]
机构
[1] Univ Szeged, Szeged, Hungary
[2] Hungarian Acad Sci, Univ Szeged, Szeged, Hungary
来源
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fully distributed data mining algorithms build global models over large amounts of data distributed over a large number of peers in a network, without moving the data itself. In the area of peer-to-peer (P2P) networks, such algorithms have various applications in P2P social networking, and also in trackerless BitTorrent communities. The difficulty of the problem involves realizing good quality models with an affordable communication complexity, while assuming as little as possible about the communication model. Here we describe a conceptually simple, yet powerful generic approach for designing efficient, fully distributed, asynchronous, local algorithms for learning models of fully distributed data. The key idea is that many models perform a random walk over the network while being gradually adjusted to fit the data they encounter, using a stochastic gradient descent search. We demonstrate our approach by implementing the support vector machine (SVM) method and by experimentally evaluating its performance in various failure scenarios over different benchmark datasets. Our algorithm scheme can implement a wide range of machine learning methods in an extremely robust manner.
引用
收藏
页码:528 / 540
页数:13
相关论文
共 50 条
  • [21] Peer-to-peer in big data management
    Bo Li
    Xiaofei Liao
    Peer-to-Peer Networking and Applications, 2013, 6 : 361 - 362
  • [22] Convergence analysis of an asynchronous peer-to-peer market with communication delays
    Dong, Alyssia
    Baroche, Thomas
    Latimier, Roman Le Goff
    Ben Ahmed, Hamid
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2021, 26 (26):
  • [23] Scalable retrieval and mining with optimal peer-to-peer configuration
    Chen, Jiann-Jone
    Hu, Chia-Jung
    Su, Chun-Rong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (02) : 209 - 220
  • [24] A peer-to-peer approach to parallel association rule mining
    Ishikawa, H
    Shioya, Y
    Omi, T
    Ohta, M
    Katayama, K
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 178 - 188
  • [25] Peer-to-Peer usage analysis: a distributed mining approach
    Masseglia, Florent
    Poncelet, Pascal
    Teisseire, Maguelonne
    20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS, 2006, : 993 - +
  • [26] Distributed Optimization Strategies for Mining on Peer-to-Peer Networks
    Dutta, Haimonti
    Matthur, Ananda
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 350 - +
  • [27] Credit risk modeling on data with two timestamps in peer-to-peer lending by gradient boosting
    Zhou, Ligang
    Fujita, Hamido
    Ding, Hao
    Ma, Rui
    APPLIED SOFT COMPUTING, 2021, 110
  • [28] Research on default risk of peer-to-peer online lending based on data mining algorithm
    Li, Xiao-Feng
    Zhang, Chang
    Lin, Xu-Chen
    Lv, Ting-Jie
    Liu, Lin-Lin
    Journal of Computers (Taiwan), 2020, 31 (02) : 83 - 100
  • [29] A Data Mining Based Publish/Subscribe System over Structured Peer-to-Peer Networks
    Song, Junping
    Wang, Haibo
    Lv, Pin
    Li, Shangzhou
    Xu, Menglu
    SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2015, 569 : 1 - 15
  • [30] Local L2-Thresholding Based Data Mining in Peer-to-Peer Systems
    Wolff, Ran
    Bhaduri, Kanishka
    Kargupta, Hillol
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 430 - 441