Performance modeling of big data applications in the cloud centers

被引:0
|
作者
Chao Shen
Weiqin Tong
Jenq-Neng Hwang
Qiang Gao
机构
[1] Shanghai University,School of Computer Engineering and Science
[2] University of Washington,Department of Electrical Engineering
来源
关键词
Cloud computing; Big data; Performance modeling; Embedded Markov chain; Response time;
D O I
暂无
中图分类号
学科分类号
摘要
Cloud computing has evolved as an efficient paradigm to process big data applications. Performance evaluation of cloud center is a necessary prerequisite to guarantee quality of service. However, it is a challenge task to effectively analyze the performance of cloud service due to the complexity of cloud resources and the diversity of big data applications. In this paper, we leverage queuing theory and probabilistic statistics to propose a performance evaluation model for cloud center under big data application arrivals. In this model, the tasks (i.e., big data applications) are with Poisson arrivals, each task is divided into lots of parallel subtasks, and the number of subtasks follows a general distribution. The model allows to calculate the important performance indicators such as mean number of subtasks in the system, the probability that a task obtains immediate service, task waiting time and blocking probability. The model can also be used to predict the time cost of performing application. Finally, we use the simulations and benchmarking running WordCount and TeraSort applications on a Hadoop platform to demonstrate the utility of the model.
引用
收藏
页码:2258 / 2283
页数:25
相关论文
共 50 条
  • [1] Performance modeling of big data applications in the cloud centers
    Shen, Chao
    Tong, Weiqin
    Hwang, Jenq-Neng
    Gao, Qiang
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (05): : 2258 - 2283
  • [2] Ensemble: A Tool for Performance Modeling of Applications in Cloud Data Centers
    Chen, Jin
    Soundararajan, Gokul
    Ghanbari, Saeed
    Iorio, Francesco
    Hashemi, Ali B.
    Amza, Cristiana
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2016, 4 (01) : 20 - 33
  • [3] I/O Performance Modeling for Big Data Applications over Cloud Infrastructures
    Mytilinis, Ioannis
    Tsoumakos, Dimitrios
    Kantere, Verena
    Nanos, Anastassios
    Koziris, Nectarios
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 201 - 206
  • [4] Predicting the performance of big data applications on the cloud
    Ardagna, D.
    Barbierato, E.
    Gianniti, E.
    Gribaudo, M.
    Pinto, T. B. M.
    da Silva, A. P. C.
    Almeida, J. M.
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (02): : 1321 - 1353
  • [5] Predicting the performance of big data applications on the cloud
    D. Ardagna
    E. Barbierato
    E. Gianniti
    M. Gribaudo
    T. B. M. Pinto
    A. P. C. da Silva
    J. M. Almeida
    [J]. The Journal of Supercomputing, 2021, 77 : 1321 - 1353
  • [6] Performance Evaluation of Big Data Applications in Cloud Providers
    Dourado, Leonardo dos Santos
    Miranda, Richard Siqueira
    de Araujo, Aleteia P. F.
    Ishikawa, Edson
    [J]. 2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
  • [7] Survey of Performance Modeling of Big Data Applications
    Pattanshetti, Tanuja
    Attar, Vahida
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 177 - 181
  • [8] A Hybrid Machine Learning Approach for Performance Modeling of Cloud-Based Big Data Applications
    Ataie, Ehsan
    Evangelinou, Athanasia
    Gianniti, Eugenio
    Ardagna, Danilo
    [J]. COMPUTER JOURNAL, 2022, 65 (12): : 3123 - 3140
  • [9] Modeling and Analysis of Performance and Energy Consumption in Cloud Data Centers
    El Kafhali, Said
    Salah, Khaled
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7789 - 7802
  • [10] Modeling and Analysis of Performance and Energy Consumption in Cloud Data Centers
    Said El Kafhali
    Khaled Salah
    [J]. Arabian Journal for Science and Engineering, 2018, 43 : 7789 - 7802