Meta-learning for Large Scale Machine Learning with MapReduce

被引:0
|
作者
Liu, Xuan [1 ]
Wang, Xiaoguang [1 ]
Matwin, Stan [2 ]
Japkowicz, Nathalie [1 ]
机构
[1] Univ Ottawa, Sch EECS, Ottawa, ON, Canada
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
关键词
MapReduce; meta-learning; big data; parallel computing; Adaboost;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We have entered the big data age. Knowledge extraction from massive data is becoming more and more rewarding and urgent. MapReduce has provided a feasible framework for programming machine learning algorithms in Map and Reduce functions. The relatively simple programming interface has helped to solve machine learning algorithms' scalability problems. However, this framework suffers from an obvious weakness: it does not support iterations. This makes those algorithms requiring iterations difficult to fully explore the efficiency of MapReduce. In this paper, we propose to apply Meta-learning programmed with MapReduce to avoid parallelizing machine learning algorithms while also improving their scalability to big datasets. The experiments conducted on Hadoop fully distributed mode on Amazon EC2 demonstrate that our algorithm PML reduces the training computational complexity significantly when the number of computing nodes increases while gaining smaller error rates than those on one single node. The comparison of PML with the contemporary parallelized AdaBoost algorithm: AdaBoost.PL shows that PML has lower error rates.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Meta-learning and the new challenges of machine learning
    Monteiro, Jose Pedro
    Ramos, Diogo
    Carneiro, Davide
    Duarte, Francisco
    Fernandes, Joao M.
    Novais, Paulo
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (11) : 6240 - 6272
  • [2] Large-Scale Meta-Learning with Continual Trajectory Shifting
    Shin, JaeWoong
    Lee, Hae Beom
    Gong, Boqing
    Hwang, Sung Ju
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Constructive meta-learning with machine learning method repositories
    Abe, H
    Yamaguchi, T
    [J]. INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2004, 3029 : 502 - 511
  • [4] Learning Large-scale Neural Fields via Context Pruned Meta-Learning
    Tack, Jihoon
    Kim, Subin
    Yu, Sihyun
    Lee, Jaeho
    Shin, Jinwoo
    Schwarz, Jonathan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] MapReduce Based Parallel Neural Networks in Enabling Large Scale Machine Learning
    Liu, Yang
    Yang, Jie
    Huang, Yuan
    Xu, Lixiong
    Li, Siguang
    Qi, Man
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [6] Extreme Learning Machine for large-scale graph classification based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    [J]. NEUROCOMPUTING, 2017, 261 : 106 - 114
  • [7] Extreme Learning Machine for Large-Scale Graph Classification Based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    [J]. PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 93 - 105
  • [8] Using meta-learning to predict performance metrics in machine learning problems
    Carneiro, Davide
    Guimaraes, Miguel
    Carvalho, Mariana
    Novais, Paulo
    [J]. EXPERT SYSTEMS, 2023, 40 (01)
  • [9] Meta-Learning for Query Conceptualization at Web Scale
    Han, Fred X.
    Niu, Di
    Chen, Haolan
    Guo, Weidong
    Yan, Shengli
    Long, Bowei
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3064 - 3073
  • [10] Learning Meta-Learning (LML) dataset: Survey data of meta-learning parameters
    Corraya, Sonia
    Al Mamun, Shamim
    Kaiser, M. Shamim
    [J]. DATA IN BRIEF, 2023, 51