A fast online learning algorithm for distributed mining of bigdata

被引:0
|
作者
Zhang, Yu [1 ]
Sow, Daby [2 ]
Turaga, Deepak [2 ]
Van Der Schaar, Mihaela [1 ]
机构
[1] University of California, LOS Angeles, CA, United States
[2] IBM T.J. Watson Research Center, United States
来源
Performance Evaluation Review | 2014年 / 41卷 / 04期
基金
美国国家科学基金会;
关键词
Big data - Data mining - E-learning - Learning systems - Online systems;
D O I
10.1145/2627534.2627562
中图分类号
学科分类号
摘要
BigData analytics require that distributed mining of numerous data streams is performed in real-time. Unique challenges associated with designing such distributed mining systems are: online adaptation to incoming data characteristics, online processing of large amounts of heterogeneous data, limited data access and communication capabilities between distributed learners, etc. We propose a general frameworkfor distributed data mining and develop an efficientonline learning algorithm based on this. Our frameworkconsists of an ensemble learner and multiple local learners, which can only access different parts of the incoming data. By exploiting the correlations of the learning models among local learners, our proposed learning algorithms can optimize the prediction accuracy while requiring significantly less information exchange and computational complexity than existing state-of-the-art learning solutions.
引用
收藏
页码:90 / 93
相关论文
共 50 条
  • [41] A fast greedy algorithm for outlier mining
    He, Zengyou
    Deng, Shengchun
    Xu, Xiaofei
    Huang, Joshua Zhexue
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 567 - 576
  • [42] A Fast Algorithm of Mining Induced Subtrees
    Li, Yun
    Guo, Xin
    Yuan, Yunhao
    Wu, Jia
    Chen, Ling
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 195 - 199
  • [43] Fast Cooperative Distributed Learning
    Jakovetic, Dusan
    Moura, Jose M. F.
    Xavier, Joao
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1513 - 1517
  • [44] Design of Online Learning Efficiency Evaluation Algorithm for College English Based on Data Mining
    Li, Hui
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2022, PT I, 2023, 468 : 537 - 548
  • [45] A Comprehensive Survey and Open Challenges of Mining Bigdata
    Tidke, Bharat
    Mehta, Rupa
    Dhanani, Jenish
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 1, 2018, 83 : 441 - 448
  • [46] BCD : BigData, Cloud Computing and Distributed Computing
    Grover, Purva
    Johari, Rahul
    2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), 2015, : 764 - 768
  • [47] The anatomy of a distributed predictive modeling framework: online learning, blockchain network, and consensus algorithm
    Kuo, Tsung-Ting
    JAMIA OPEN, 2020, 3 (02) : 201 - 208
  • [48] An Online Learning Algorithm for Distributed Task Offloading in Multi-Access Edge Computing
    Sun, Zhenfeng
    Nakhai, Mohammad Reza
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 3090 - 3102
  • [49] Online data stream mining in distributed sensor network
    Zolotová, Iveta
    Lojka, Tomáš
    WSEAS Transactions on Circuits and Systems, 2014, 13 : 412 - 421
  • [50] ClowdFlows: Online workflows for distributed big data mining
    Kranjc, Janez
    Orac, Roman
    Podpecan, Vid
    Lavrac, Nada
    Robnik-Sikonja, Marko
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 68 : 38 - 58