Data Centers Selection for Moving Geo-distributed Big Data to Cloud

被引:3
|
作者
Zhang, Jiangtao [1 ]
Yuan, Qiang [2 ,3 ]
Chen, Shi [2 ,3 ]
Huang, Hejiao [2 ,3 ]
Wang, Xuan [2 ,4 ]
机构
[1] Shenzhen Jingyi Smart Technol Co Ltd, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[3] Shenzhen Key Lab Internet Informat Collaborat, Shenzhen, Peoples R China
[4] Shenzhen Appl Technol Engn Lab Internet Multimedi, Shenzhen, Peoples R China
来源
JOURNAL OF INTERNET TECHNOLOGY | 2019年 / 20卷 / 01期
基金
中国国家自然科学基金;
关键词
Big data; Data centers selection; Distributed cloud computing; Cost minimization; APPROXIMATION ALGORITHMS; RESOURCE PROVISION; G-HADOOP; MAPREDUCE;
D O I
10.3966/160792642019012001010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of the distributed networking and coexistent abundant computation and storage resources, cloud computing has become a preferred platform for big data analytics, especially for the geo-distributed data across the world. The precondition for data processing is to move the data to the cloud. Due to the large volume of data, high transmission cost across continents and even specific legal prohibition, it is not always feasible to move all data to one data center. Appropriate data centers should be selected while keeping fast data access and low cost. In this paper, four criteria of the problem are explored. A tight 3-approximation algorithm is proposed to address the former two criteria. It can be simplified when the underlying bipartite graph is complete. The latter two criteria are addressed by a heuristic. Comparing to the optimal method and other schemes, extensive simulations demonstrate that the proposed algorithms can find rather good solutions with less time, and hence are more appropriate for large scale applications.
引用
收藏
页码:111 / 122
页数:12
相关论文
共 50 条
  • [31] Distributed Data Strategies to Support Large-Scale Data Analysis Across Geo-Distributed Data Centers
    Emara, Tamer Z.
    Huang, Joshua Zhexue
    [J]. IEEE ACCESS, 2020, 8 : 178526 - 178538
  • [32] QoS-Aware Data Placement for MapReduce Applications in Geo-Distributed Data Centers
    Chen, Wuhui
    Liu, Baichuan
    Paik, Incheon
    Li, Zhenni
    Zheng, Zibin
    [J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2021, 68 (01) : 120 - 136
  • [33] Congestion-Aware Traffic Allocation for Geo-Distributed Data Centers
    Tao, Xiaoyi
    Ota, Kaoru
    Dong, Mianxiong
    Borjigin, Wuyunzhaola
    Qi, Heng
    Li, Keqiu
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (03) : 1675 - 1687
  • [34] Hierarchical Approach for Efficient Workload Management in Geo-Distributed Data Centers
    Forestiero, Agostino
    Mastroianni, Carlo
    Meo, Michela
    Papuzzo, Giuseppe
    Sheikhalishahi, Mehdi
    [J]. IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2017, 1 (01): : 97 - 111
  • [35] SCISPACE: A scientific collaboration workspace for geo-distributed HPC data centers
    Khan, Awais
    Kim, Taeuk
    Byun, Hyunki
    Kim, Youngjae
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 398 - 409
  • [36] Workload-Aware Scheduling Across Geo-distributed Data Centers
    Jin, Yibo
    Gao, Yuan
    Qian, Zhuzhong
    Zhai, Mingyu
    Peng, Hui
    Lu, Sanglu
    [J]. 2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1455 - 1462
  • [37] Cost Efficient Design of Fault Tolerant Geo-Distributed Data Centers
    Tripathi, Rakesh
    Vignesh, S.
    Tamarapalli, Venkatesh
    Medhi, Deep
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2017, 14 (02): : 289 - 301
  • [39] Placement of High Availability Geo-Distributed Data Centers in Emerging Economies
    Liu, Ruiyun
    Sun, Weiqiang
    Hu, Weisheng
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (03) : 3274 - 3288
  • [40] Reducing the expenses of geo-distributed data centers with portable containerized modules
    Brocanelli, Marco
    Zheng, Wenli
    Wang, Xiaorui
    [J]. PERFORMANCE EVALUATION, 2014, 79 : 104 - 119