A comparison of forecasting models for the resource usage of MapReduce applications

被引:6
|
作者
Li, Yang Yuan [1 ]
Tien Van Do [2 ,3 ]
Nguyen, Hai T. [4 ]
机构
[1] Xian SiYuan Univ, Xian, Peoples R China
[2] Baoji Univ Arts & Sci, Baoji, Peoples R China
[3] Budapest Univ Technol & Econ, Dept Networked Syst & Serv, Magyar Tudosok Korutja 2, H-1117 Budapest, Hungary
[4] Budapest Univ Technol & Econ, Balatonfured Student Res Grp, Budapest, Hungary
关键词
MapReduce application; Resource usage parameters; LSTM model; Multiple linear regression model; TRAFFIC FLOW PREDICTION; NEURAL-NETWORKS; LSTM;
D O I
10.1016/j.neucom.2020.07.059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we construct forecasting models (multivariate long short-term memory recurrent neural networks and multiple linear regression) for the resource usage prediction of four MapReduce applica-tions and applications executed within the Spark framework. We have evaluated the impact of a sample size to prediction accuracy. Also, we propose a phase modelling approach for read/write-intensive applications. Our results show that models based on long short-term memory recurrent neural networks exhibit a higher accuracy than multiple linear regression models and the intensive characteristics of a resource are closely related to the prediction accuracy of forecasting models. We investigated the hyper parameter tuning of such models and showed that a randomly initialised, shallow, well-tuned network may outperform deeper models that use stacked autoencoder initialisation. Furthermore, multivariate long short-term memory recurrent neural network models are more sensitive to sample size than multiple linear regression models. We show that an LSTM model trained in a specific machine may be used to predict the resource usage in another machine. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:36 / 55
页数:20
相关论文
共 50 条
  • [1] Analysis of Resource Usage Profile for MapReduce Applications Using Hadoop on Cloud
    Liu, Zheyuan
    Mu, Dejun
    [J]. 2012 INTERNATIONAL CONFERENCE ON QUALITY, RELIABILITY, RISK, MAINTENANCE, AND SAFETY ENGINEERING (ICQR2MSE), 2012, : 1500 - 1504
  • [2] Stable Modeling on Resource Usage Parameters of MapReduce Application
    Li, Yangyuan
    [J]. BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (02): : 45 - 62
  • [3] Forecasting Data Center Resource Usage: An Experimental Comparison with Time-Series Methods
    Mazumdar, Somnath
    Kumar, Anoop S.
    [J]. PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 151 - 165
  • [4] Is Machine Learning Necessary for Cloud Resource Usage Forecasting?
    Christofidi, Georgia
    Papaioannou, Konstantinos
    Doudali, Thaleia Dimitra
    [J]. PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023, 2023, : 544 - 554
  • [5] Chisel: A Resource Savvy Approach for Handling Skew in MapReduce Applications
    Dhawalia, Prateek
    Kailasam, Sriram
    Janakiram, Dharanipragada
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 652 - 660
  • [6] PROSPECTIVE APPLICATIONS OF MODELS TO FORECASTING
    KREITZBERG, CW
    [J]. BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 1969, 50 (12) : 947 - +
  • [7] Profiling and Modeling Resource Usage of Virtualized Applications
    Wood, Timothy
    Cherkasova, Ludmila
    Ozonat, Kivanc
    Shenoy, Prashant
    [J]. MIDDLEWARE 2008, PROCEEDINGS, 2008, 5346 : 366 - +
  • [8] Performance Variations in Resource Scaling for MapReduce Applications on Private and Public Clouds
    Zhang, Fan
    Sakr, Majd
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 457 - 466
  • [9] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Humphrey, Marty
    [J]. 2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 74 - 82
  • [10] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Kim, In Kee
    Humphrey, Marty
    [J]. 2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 669 - 677