Data Centers Selection for Moving Geo-distributed Big Data to Cloud

被引:3
|
作者
Zhang, Jiangtao [1 ]
Yuan, Qiang [2 ,3 ]
Chen, Shi [2 ,3 ]
Huang, Hejiao [2 ,3 ]
Wang, Xuan [2 ,4 ]
机构
[1] Shenzhen Jingyi Smart Technol Co Ltd, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
[3] Shenzhen Key Lab Internet Informat Collaborat, Shenzhen, Peoples R China
[4] Shenzhen Appl Technol Engn Lab Internet Multimedi, Shenzhen, Peoples R China
来源
JOURNAL OF INTERNET TECHNOLOGY | 2019年 / 20卷 / 01期
基金
中国国家自然科学基金;
关键词
Big data; Data centers selection; Distributed cloud computing; Cost minimization; APPROXIMATION ALGORITHMS; RESOURCE PROVISION; G-HADOOP; MAPREDUCE;
D O I
10.3966/160792642019012001010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of the distributed networking and coexistent abundant computation and storage resources, cloud computing has become a preferred platform for big data analytics, especially for the geo-distributed data across the world. The precondition for data processing is to move the data to the cloud. Due to the large volume of data, high transmission cost across continents and even specific legal prohibition, it is not always feasible to move all data to one data center. Appropriate data centers should be selected while keeping fast data access and low cost. In this paper, four criteria of the problem are explored. A tight 3-approximation algorithm is proposed to address the former two criteria. It can be simplified when the underlying bipartite graph is complete. The latter two criteria are addressed by a heuristic. Comparing to the optimal method and other schemes, extensive simulations demonstrate that the proposed algorithms can find rather good solutions with less time, and hence are more appropriate for large scale applications.
引用
收藏
页码:111 / 122
页数:12
相关论文
共 50 条
  • [1] Fast Big Data Analysis in Geo-Distributed Cloud
    Li, Yue
    Zhao, Laiping
    Cui, Chenzhou
    Yu, Ce
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 388 - 391
  • [2] Cost Minimization for Big Data Processing in Geo-Distributed Data Centers
    Gu, Lin
    Zeng, Deze
    Li, Peng
    Guo, Song
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 314 - 323
  • [3] Bandwidth On-Demand for Multimedia Big Data Transfer Across Geo-Distributed Cloud Data Centers
    Yassine, Abdulsalam
    Shirehjini, Ali Asghar Nazari
    Shirmohammadi, Shervin
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (04) : 1189 - 1198
  • [4] Efficient Process Mapping in Geo-Distributed Cloud Data Centers
    Zhou, Amelie Chi
    Gong, Yifan
    He, Bingsheng
    Zhai, Jidong
    [J]. SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [5] Dynamic Pricing and Profit Maximization for the Cloud with Geo-distributed Data Centers
    Zhao, Jian
    Li, Flongxing
    Wu, Chuan
    Li, Zongpeng
    Zhang, Zhizhong
    Lau, Francis C. M.
    [J]. 2014 PROCEEDINGS IEEE INFOCOM, 2014, : 118 - 126
  • [6] Planning of Geo-Distributed Cloud Data Centers in Fast Developing Economies
    Liu, Ruiyun
    Sun, Weiqiang
    Hu, Weisheng
    [J]. 2018 20TH ANNIVERSARY INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON), 2018,
  • [7] Revenue Maximization for Dynamic Expansion of Geo-Distributed Cloud Data Centers
    Deng, Hou
    Huang, Liusheng
    Xu, Hongli
    Liu, Xiangyan
    Wang, Pengzhan
    Fang, Xianjing
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (03) : 899 - 913
  • [8] Time Optimization Modeling for Big Data Placement and Analysis for Geo-Distributed Data Centers
    Khan, Awais
    Attique, Muhammad
    Chung, Tae-Sun
    Kim, Youngjae
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 140 - 141
  • [9] Privacy Regulation Aware Process Mapping in Geo-Distributed Cloud Data Centers
    Zhou, Amelie Chi
    Xiao, Yao
    Gong, Yifan
    He, Bingsheng
    Zhai, Jidong
    Mao, Rui
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (08) : 1872 - 1888
  • [10] Green Computing with Geo-Distributed Heterogeneous Data Centers
    Pasricha, Sudeep
    Hogade, Ninad
    Siegel, Howard Jay
    Maciejewski, Anthony A.
    [J]. 2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,