Optimal and Efficient Distributed Online Learning for Big Data

被引:3
|
作者
Sayin, Muhammed O. [1 ]
Vanli, N. Denizcan [1 ]
Delibalta, Ibrahim [2 ,3 ]
Kozat, Suleyman S. [1 ]
机构
[1] Bilkent Univ, Dept Elect & Elect Engn, Ankara, Turkey
[2] AVEA Commun Serv Inc, AveaLabs, Istanbul, Turkey
[3] Koc Univ, Grad Sch Social Sci & Humanities, Istanbul, Turkey
关键词
distributed processing; online learning; optimal and efficient; static state estimation; Big Data; smart grid; DIFFUSION STRATEGIES; STATE ESTIMATION; CONSENSUS; NETWORKS; SCHEME;
D O I
10.1109/BigDataCongress.2015.27
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose optimal and efficient distributed online learning strategies for Big Data applications. Here, we consider the optimal state estimation over distributed network of autonomous data sources. The autonomous data sources can generate and process data locally irrespective of any centralized control unit. We seek to enhance the learning rate through the distributed control of those autonomous data sources. We emphasize that although this problem attracted significant attention and extensively studied in different fields including services computing and machine learning disciplines, all the well-known strategies achieve suboptimal online learning performance in the mean square error sense. To this end, we introduce the oracle algorithm as the optimal distributed online learning strategy. We also propose the optimal and efficient distributed online learning algorithm that reduces the communication load tremendously, i.e., requires the undirected disclosure of only a single scalar. Finally, we demonstrate the significant performance gains due to the proposed strategies with respect to the state-of-the-art approaches.
引用
收藏
页码:126 / 133
页数:8
相关论文
共 50 条
  • [31] Energy-Efficient Analytics for Geographically Distributed Big Data
    Zhao, Peng
    Yang, Xinyu
    Lin, Jie
    Yang, Shusen
    Yu, Wei
    IEEE INTERNET COMPUTING, 2019, 23 (03) : 18 - 29
  • [32] Efficient Distributed Database Clustering Algorithm for Big Data Processing
    Li, Liantian
    2021 6TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2021), 2021, : 495 - 498
  • [33] Online learning for distributed optimal control of an electric vehicle fleet
    Latimier, R. Le Goff
    Cherot, G.
    Ben Ahmed, H.
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212
  • [34] User online behavior based on big data distributed clustering algorithm
    Wang, Yan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02):
  • [35] Online Distributed IoT Security Monitoring With Multidimensional Streaming Big Data
    Li, Fangyu
    Xie, Rui
    Wang, Zengyan
    Guo, Lulu
    Ye, Jin
    Ma, Ping
    Song, Wenzhan
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (05) : 4387 - 4394
  • [36] Efficient fused learning for distributed imbalanced data
    Zhou, Jie
    Shen, Guohao
    Chen, Xuan
    Lin, Yuanyuan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (05) : 1306 - 1317
  • [37] Efficient federated learning for distributed neuroimaging data
    Thapaliya, Bishal
    Ohib, Riyasat
    Geenjaar, Eloy
    Liu, Jingyu
    Calhoun, Vince
    Plis, Sergey M.
    FRONTIERS IN NEUROINFORMATICS, 2024, 18
  • [38] Efficient Online Reinforcement Learning with Offline Data
    Ball, Philip J.
    Smith, Laura
    Kostrikov, Ilya
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [39] Distributed Stochastic Aware Random Forests - Efficient Data Mining for Big Data
    Assuncao, Joaquim
    Fernandes, Paulo
    Lopes, Lucelene
    Normey, Silvio
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 425 - 426
  • [40] A sharing data approach oriented to distributed online learning
    Zhang Y.
    Liu W.
    Shao L.-S.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (08): : 1871 - 1880