Analyzing the Performance of Data Replication and Data Partitioning in the Cloud: the BEOWULF Approach

被引:0
|
作者
Stiemer, Alexander [1 ]
Fetai, Ilir [2 ]
Schuldt, Heiko [1 ]
机构
[1] Univ Basel, Dept Math & Comp Sci, Basel, Switzerland
[2] Swiss Distance Univ Appl Sci, Basel, Switzerland
关键词
cloud data management; data replication; data partitioning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications deployed in the Cloud usually come with dedicated performance and availability requirements. This can be achieved by replicating data across several sites and/or by partitioning data. Data replication allows to parallelize read requests and thus to decrease data access latency, but induces significant overhead for the synchronization of updates. Partitioning, in contrast, is highly beneficial if all the data accessed by an application is located at the same site, but again necessitates coordination if distributed transactions are needed to serve applications. In this paper, we analyze three protocols for distributed data management in the Cloud, namely Read-One-Write-All-Available (ROWAA), Majority Quorum (MQ) and Data Partitioning (DP) - all in a configuration that guarantees strong consistency. We introduce BEOWULF, a meta protocol based on a comprehensive cost model that integrates the three protocols and that dynamically selects the protocol with the lowest latency for a given workload. In the evaluation, we compare the prediction of the BEOWULF cost model with a baseline evaluation. The results nicely show the effectiveness of the analytical model and the precision in selecting the best suited protocol for a given workload.
引用
收藏
页码:2837 / 2846
页数:10
相关论文
共 50 条
  • [41] A review on data replication strategies in cloud systems
    Mokadem, Riad
    Martinez-Gil, Jorge
    Hameurlain, Abdelkader
    Kueng, Josef
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2022, 13 (04) : 347 - 362
  • [42] Data replication schemes in cloud computing: a survey
    Shakarami, Ali
    Ghobaei-Arani, Mostafa
    Shahidinejad, Ali
    Masdari, Mohammad
    Shakarami, Hamid
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2545 - 2579
  • [43] Cost Optimization for Dynamic Replication and Migration of Data in Cloud Data Centers
    Mansouri, Yaser
    Toosi, Adel Nadjaran
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2019, 7 (03) : 705 - 718
  • [44] Dynamic Replication Algorithm for Data Replication to Improve System Availability: A Performance Engineering Approach
    Vobugari, Sreekumar
    Somayajulu, D. V. L. N.
    Subaraya, B. M.
    IETE JOURNAL OF RESEARCH, 2015, 61 (02) : 132 - 141
  • [45] Scalable Data Partitioning Techniques for Distributed Data Processing in Cloud Environments: A Review
    Ponnusamy, Sivakumar
    Gupta, Pankaj
    IEEE ACCESS, 2024, 12 : 26735 - 26746
  • [46] Data Replication Approach with Consistency Guarantee for Data Grid
    Abawajy, Jemal H.
    Deris, Mustafa Mat
    IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (12) : 2975 - 2987
  • [47] PDDS - Improving Cloud Data Storage Security Using Data Partitioning Technique
    Selvakumar, C.
    Rathanam, G. Jeeva
    Sumalatha, M. R.
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 7 - 11
  • [48] Nonparametric methods for analyzing replication origins in genomewide data
    Ghosh D.
    Functional & Integrative Genomics, 2005, 5 (1) : 28 - 31
  • [49] Cost-Efficient Partitioning of Spatial Data on Cloud
    Akdogan, Afsin
    Indrakanti, Saratchandra
    Demiryurek, Ugur
    Shahabi, Cyrus
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 501 - 506
  • [50] Access Protocols in Data Partitioning Based Cloud Storage
    Ye, Yunqi
    Xiao, Liangliang
    Chen, Yinzi
    Yen, I-Ling
    Bastani, Farokh
    Chen, Ing-Ray
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 398 - 405