Optimal and Efficient Distributed Online Learning for Big Data

被引:3
|
作者
Sayin, Muhammed O. [1 ]
Vanli, N. Denizcan [1 ]
Delibalta, Ibrahim [2 ,3 ]
Kozat, Suleyman S. [1 ]
机构
[1] Bilkent Univ, Dept Elect & Elect Engn, Ankara, Turkey
[2] AVEA Commun Serv Inc, AveaLabs, Istanbul, Turkey
[3] Koc Univ, Grad Sch Social Sci & Humanities, Istanbul, Turkey
关键词
distributed processing; online learning; optimal and efficient; static state estimation; Big Data; smart grid; DIFFUSION STRATEGIES; STATE ESTIMATION; CONSENSUS; NETWORKS; SCHEME;
D O I
10.1109/BigDataCongress.2015.27
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We propose optimal and efficient distributed online learning strategies for Big Data applications. Here, we consider the optimal state estimation over distributed network of autonomous data sources. The autonomous data sources can generate and process data locally irrespective of any centralized control unit. We seek to enhance the learning rate through the distributed control of those autonomous data sources. We emphasize that although this problem attracted significant attention and extensively studied in different fields including services computing and machine learning disciplines, all the well-known strategies achieve suboptimal online learning performance in the mean square error sense. To this end, we introduce the oracle algorithm as the optimal distributed online learning strategy. We also propose the optimal and efficient distributed online learning algorithm that reduces the communication load tremendously, i.e., requires the undirected disclosure of only a single scalar. Finally, we demonstrate the significant performance gains due to the proposed strategies with respect to the state-of-the-art approaches.
引用
收藏
页码:126 / 133
页数:8
相关论文
共 50 条
  • [1] Differential Privacy and Distributed Online Learning for Wireless Big Data
    Li, Chencheng
    Zhou, Pan
    Jiang, Tao
    2015 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2015,
  • [2] Distributed Private Online Learning for Social Big Data Computing over Data Center Networks
    Li, Chencheng
    Zhou, Pan
    Zhou, Yingxue
    Bian, Kaigui
    Jiang, Tao
    Rahardja, Susanto
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016, : 392 - 397
  • [3] Big data for online learning systems
    Dahdouh, Karim
    Dakkak, Ahmed
    Oughdir, Lahcen
    Messaoudi, Faycal
    EDUCATION AND INFORMATION TECHNOLOGIES, 2018, 23 (06) : 2783 - 2800
  • [4] Big data for online learning systems
    Karim Dahdouh
    Ahmed Dakkak
    Lahcen Oughdir
    Fayçal Messaoudi
    Education and Information Technologies, 2018, 23 : 2783 - 2800
  • [5] An Efficient Distributed Algorithm for Big Data Processing
    Al-kahtani, Mohammed S.
    Karim, Lutful
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2017, 42 (08) : 3149 - 3157
  • [6] An Efficient Distributed Algorithm for Big Data Processing
    Mohammed S. Al-kahtani
    Lutful Karim
    Arabian Journal for Science and Engineering, 2017, 42 : 3149 - 3157
  • [7] ClowdFlows: Online workflows for distributed big data mining
    Kranjc, Janez
    Orac, Roman
    Podpecan, Vid
    Lavrac, Nada
    Robnik-Sikonja, Marko
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 68 : 38 - 58
  • [8] Communication efficient distributed learning of neural networks in Big Data environments using Spark
    Alkhoury, Fouad
    Wegener, Dennis
    Sylla, Karl-Heinz
    Mock, Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3871 - 3877
  • [9] Online Learning Research Based on Big Data
    Zhang, Shihong
    2016 2ND INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SUPPORTED EDUCATION (FCSE 2016), 2016, : 70 - 74
  • [10] Online Similarity Learning for Big Data with Overfitting
    Cong, Yang
    Liu, Ji
    Fan, Baojie
    Zeng, Peng
    Yu, Haibin
    Luo, Jiebo
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (01) : 78 - 89