A dynamic, cost-aware, optimized data replication strategy for heterogeneous cloud data centers

被引:58
|
作者
Gill, Navneet Kaur [1 ]
Singh, Sarbjeet [1 ]
机构
[1] Panjab Univ, UIET, Comp Sci & Engn, Chandigarh, India
关键词
Data availability; Data replication; Cost of replication; Knapsack; Re-replication; Re-balancing; AVAILABILITY; ENVIRONMENTS;
D O I
10.1016/j.future.2016.05.016
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In cloud computing, it is important to maintain high data availability and the performance of the system. In order to meet these requirements, the concept of replication is used. As the number of replicas of a data file increases, the data availability and the performance also increases, but at the same time, the cost of creating and maintaining new replicas also increases. In order to enjoy the maximum benefits of replication, it is. essential to optimize the cost of replication. The cloud systems are heterogeneous in nature as the different data centers have different policies, hardware and software configurations. As a result of this, the replicas of a data file placed at different data centers have different availabilities and replication costs associated with them. In this paper, a dynamic, cost-aware, optimized data replication strategy is proposed that identifies the minimum number of replicas required to ensure the desired availability. The concept of knapsack has been used to optimize the cost of replication and to re-replicate the replicas from higher-cost data centers to lower-cost data centers, without compromising the data availability. Mathematical descriptions and illustrations have been provided for the different phases of the proposed strategy, keeping in mind the heterogeneous nature of the system. The proposed strategy has been simulated using the CloudSim toolkit. The experimental results indicate that the strategy is effective in optimizing the cost of replication and increasing the data availability. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:10 / 32
页数:23
相关论文
共 50 条
  • [1] A Global Cost-Aware Container Scheduling Strategy in Cloud Data Centers
    Long, Saiqin
    Wen, Wen
    Li, Zhetao
    Li, Kenli
    Yu, Rong
    Zhu, Jiang
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (11) : 2752 - 2766
  • [2] Cost-Aware Cooperative Resource Provisioning for Heterogeneous Workloads in Data Centers
    Zhan, Jianfeng
    Wang, Lei
    Li, Xiaona
    Shi, Weisong
    Weng, Chuliang
    Zhang, Wenyao
    Zang, Xiutao
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (11) : 2155 - 2168
  • [3] Dynamic Cost-Aware Re-replication and Rebalancing Strategy in Cloud System
    Gill, Navneet Kaur
    Singh, Sarbjeet
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 : 39 - 47
  • [4] Cost Optimization for Dynamic Replication and Migration of Data in Cloud Data Centers
    Mansouri, Yaser
    Toosi, Adel Nadjaran
    Buyya, Rajkumar
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2019, 7 (03) : 705 - 718
  • [5] Comparing energy-aware vs. cost-aware data replication strategy
    Seguela, Morgan
    Mokadem, Riad
    Pierson, Jean-Marc
    [J]. 2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
  • [6] Cost-aware Workload Dispatching and Server Provisioning for Distributed Cloud Data Centers
    Fang, Weiwei
    Zhou, Quan
    An, Yuan
    Li, Yangchun
    Zhang, Huijing
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (05): : 51 - 60
  • [7] Cost-Aware Scheduling and Data Skew Alleviation for Big Data Processing in Heterogeneous Cloud Environment
    Li, Hongjian
    Zhu, Lisha
    Wang, Shuaicheng
    Wang, Lei
    [J]. JOURNAL OF GRID COMPUTING, 2023, 21 (03)
  • [8] Cost-Aware Scheduling and Data Skew Alleviation for Big Data Processing in Heterogeneous Cloud Environment
    Hongjian Li
    Lisha Zhu
    Shuaicheng Wang
    Lei Wang
    [J]. Journal of Grid Computing, 2023, 21
  • [9] A Dynamic, Cost-Aware, Optimized Maintenance Policy for Interactive Exploration of Linked Data
    Akhtar, Usman
    Sant'Anna, Anita
    Lee, Sungyoung
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (22):
  • [10] CAWSAC: Cost-Aware Workload Scheduling and Admission Control for Distributed Cloud Data Centers
    Yuan, Haitao
    Bi, Jing
    Tan, Wei
    Li, Bo Hu
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 976 - 985