I/O Performance Modeling for Big Data Applications over Cloud Infrastructures

被引:2
|
作者
Mytilinis, Ioannis [1 ]
Tsoumakos, Dimitrios [2 ]
Kantere, Verena [3 ]
Nanos, Anastassios [1 ]
Koziris, Nectarios [1 ]
机构
[1] Natl Tech Univ Athens, Comp Syst Lab, GR-10682 Athens, Greece
[2] Ionian Univ, Dept Informat, Corfu, Greece
[3] Univ Geneva, Inst Serv Sci, CH-1211 Geneva 4, Switzerland
关键词
D O I
10.1109/IC2E.2015.29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data applications receive an ever-increasing amount of attention, thus becoming a dominant class of applications that are deployed over virtualized environments. Cloud environments entail a large amount of complexity relative to I/O performance. The use of Big Data increases the complexity of I/O management as well as its characterization and prediction: As I/O operations become growingly dominant in such applications, the intricacies of virtualization, different storage backends and deployment setups significantly hinder our ability to analyze and correctly predict I/O performance. To that end, this work proposes an end-to-end modeling technique to predict performance of I/O-intensive Big Data applications running over cloud infrastructures. We develop a model tuned over application and infrastructure dimensions: Primitive I/O operations, data access patterns, storage back ends and deployment parameters. The trained model can be used to predict both I/O but also general task performance. Our evaluation results show that for jobs which are dominated by I/O operations, such as I/O-bound MapReduce jobs, our model is capable of predicting execution time with an accuracy close to 90% that decreases as application processing becomes more complex.
引用
收藏
页码:201 / 206
页数:6
相关论文
共 50 条
  • [1] Performance modeling of big data applications in the cloud centers
    Chao Shen
    Weiqin Tong
    Jenq-Neng Hwang
    Qiang Gao
    [J]. The Journal of Supercomputing, 2017, 73 : 2258 - 2283
  • [2] Performance modeling of big data applications in the cloud centers
    Shen, Chao
    Tong, Weiqin
    Hwang, Jenq-Neng
    Gao, Qiang
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (05): : 2258 - 2283
  • [3] Predicting the performance of big data applications on the cloud
    Ardagna, D.
    Barbierato, E.
    Gianniti, E.
    Gribaudo, M.
    Pinto, T. B. M.
    da Silva, A. P. C.
    Almeida, J. M.
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (02): : 1321 - 1353
  • [4] Predicting the performance of big data applications on the cloud
    D. Ardagna
    E. Barbierato
    E. Gianniti
    M. Gribaudo
    T. B. M. Pinto
    A. P. C. da Silva
    J. M. Almeida
    [J]. The Journal of Supercomputing, 2021, 77 : 1321 - 1353
  • [5] Improving I/O Performance with Adaptive Data Compression for Big Data Applications
    Zou, Hongbo
    Yu, Yongen
    Tang, Wei
    Chen, Hsuanwei Michelle
    [J]. PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 1229 - 1238
  • [6] A heuristic approach to the multicriteria design of IaaS cloud infrastructures for Big Data applications
    Arostegi, Maria
    Torre-Bastida, Ana
    Nekane Bilbao, Miren
    Del Ser, Javier
    [J]. EXPERT SYSTEMS, 2018, 35 (05)
  • [7] Performance Evaluation of Big Data Applications in Cloud Providers
    Dourado, Leonardo dos Santos
    Miranda, Richard Siqueira
    de Araujo, Aleteia P. F.
    Ishikawa, Edson
    [J]. 2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
  • [8] Survey of Performance Modeling of Big Data Applications
    Pattanshetti, Tanuja
    Attar, Vahida
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 177 - 181
  • [9] Analytics over Big Data: Exploring the Convergence of Data Warehousing, OLAP and Data-Intensive Cloud Infrastructures
    Cuzzocrea, Alfredo
    [J]. 2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2013, : 481 - 483
  • [10] Automotive Big Data: Applications, Workloads and Infrastructures
    Luckow, Andre
    Kennedy, Ken
    Manhardt, Fabian
    Djerekarov, Emil
    Vorster, Bennie
    Apon, Amy
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1201 - 1210