Analyzing the Performance of Data Replication and Data Partitioning in the Cloud: the BEOWULF Approach

被引:0
|
作者
Stiemer, Alexander [1 ]
Fetai, Ilir [2 ]
Schuldt, Heiko [1 ]
机构
[1] Univ Basel, Dept Math & Comp Sci, Basel, Switzerland
[2] Swiss Distance Univ Appl Sci, Basel, Switzerland
关键词
cloud data management; data replication; data partitioning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications deployed in the Cloud usually come with dedicated performance and availability requirements. This can be achieved by replicating data across several sites and/or by partitioning data. Data replication allows to parallelize read requests and thus to decrease data access latency, but induces significant overhead for the synchronization of updates. Partitioning, in contrast, is highly beneficial if all the data accessed by an application is located at the same site, but again necessitates coordination if distributed transactions are needed to serve applications. In this paper, we analyze three protocols for distributed data management in the Cloud, namely Read-One-Write-All-Available (ROWAA), Majority Quorum (MQ) and Data Partitioning (DP) - all in a configuration that guarantees strong consistency. We introduce BEOWULF, a meta protocol based on a comprehensive cost model that integrates the three protocols and that dynamically selects the protocol with the lowest latency for a given workload. In the evaluation, we compare the prediction of the BEOWULF cost model with a baseline evaluation. The results nicely show the effectiveness of the analytical model and the precision in selecting the best suited protocol for a given workload.
引用
收藏
页码:2837 / 2846
页数:10
相关论文
共 50 条
  • [21] Ensuring performance and provider profit through data replication in cloud systems
    Uras Tos
    Riad Mokadem
    Abdelkader Hameurlain
    Tolga Ayav
    Sebnem Bora
    Cluster Computing, 2018, 21 : 1479 - 1492
  • [22] Ensuring performance and provider profit through data replication in cloud systems
    Tos, Uras
    Mokadem, Riad
    Hameurlain, Abdelkader
    Ayav, Tolga
    Bora, Sebnem
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (03): : 1479 - 1492
  • [23] Survey on data replication in cloud systems
    Rambabu, D.
    Govardhan, A.
    WEB INTELLIGENCE, 2024, 22 (01) : 83 - 109
  • [24] An Efficient Approach for Data Replication in Data Grid
    Ebrahimi, M.
    Horri, A.
    Dastghaibyfard, Gh.
    Nasrolahi, P.
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 67 - 70
  • [25] Secured data partitioning in multi cloud environment
    Hasan, Hazila
    Chuprat, Suriayati
    2014 4TH WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2014, : 146 - 151
  • [26] Performance analysis on data replication in data grid
    Chen, Jing
    Kong, Lingfu
    Liu, Mingxin
    Wang, Xuan
    2005 International Symposium on Computer Science and Technology, Proceedings, 2005, : 159 - 163
  • [27] A CSO-based approach for secure data replication in cloud computing environment
    N. Mansouri
    M. M. Javidi
    B. Mohammad Hasani Zade
    The Journal of Supercomputing, 2021, 77 : 5882 - 5933
  • [28] A CSO-based approach for secure data replication in cloud computing environment
    Mansouri, N.
    Javidi, M. M.
    Mohammad Hasani Zade, B.
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (06): : 5882 - 5933
  • [29] Analyzing large-scale genomic data with cloud data lakes
    Weintraub, Grisha
    Hadar, Noam
    Gudes, Ehud
    Dolev, Shlomi
    Birk, Ohad S.
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 142 - 142
  • [30] Adaptive Partitioning Using Partial Replication for Sensor Data
    Kalavadia, Bhumika
    Bhatia, Tarushi
    Padiya, Trupti
    Pandat, Ami
    Bhise, Minal
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2019, 2019, 11319 : 260 - 269