Cluster load based content distribution and speculative execution for geographically distributed cloud environment

被引:3
|
作者
Li, Chunlin [1 ,2 ]
Song, Mingyang [1 ]
Zhang, Qingchuan [2 ]
Luo, Youlong [1 ,3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan 430063, Peoples R China
[2] Beijing Technol & Business Univ, Natl Engn Lab Agriprod Qual Traceabil, Beijing 100048, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Geographically distributed cloud; Data placement; Speculative execution; Lagrange relaxation; DATA PLACEMENT;
D O I
10.1016/j.comnet.2021.107807
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The scale of big data has shown an explosive growth, which makes the processing of big data put forward higher requirements on data centers, and a single data center can no longer meet the needs of big data processing. To deal with this situation, a geographically distributed cloud system needs to be built. However, in the geographically distributed cloud system, each data center is distributed in different geographic locations, which makes the data placement operations in the geographically distributed cloud system lead to greater overhead. To solve this problem, this paper proposes a data placement strategy. This strategy comprehensively considers the data transmission latency, bandwidth cost, cloud server storage capacity, and load capacity during the data placement process, and formulates a data placement problem that minimizes the energy consumption of data transmission. Then the minimum set cover method based on Lagrangian relaxation is used to solve this problem and obtain the optimal data placement scheme. On the other hand, in a geographically distributed cloud data center, the execution progress of the job submitted by the user will be affected by the straggler task. To solve this problem, this paper proposes a speculative execution strategy for the geographically distributed cloud system. This strategy performs different speculative execution operations according to the state of the cluster load, and then calculates the load capacity of the nodes in the cluster. The node with the strongest load capacity in the cluster is used to perform speculative execution operations. Experimental results show that the proposed data placement strategy can effectively improve the performance of the energy consumption, the data storage cost, the network transmission cost and the data transmission time. The proposed speculative execution strategy can effectively improve the performance of the job completion time, cluster throughput and QoS satisfaction rate.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A Smart Strategy for Speculative Execution Based on Hardware Resource in a Heterogeneous Distributed Environment
    Liu, Qi
    Cai, Weidong
    Fu, Zhangjie
    Shen, Jian
    Linge, Nigel
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (02): : 203 - 213
  • [2] Load balancing algorithms with cluster in cloud environment
    Kshama, S. B.
    Shobha, K. R.
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2022, 28 (06) : 679 - 703
  • [3] Content sniffer based load distribution in a web server cluster
    Hyun, J
    Jung, IB
    Lee, J
    Maeng, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (07): : 1258 - 1269
  • [4] Bipartite Matching Based Speculative Execution to Improve Cloud MapReduce Performance
    Lin, Jenn-Wei
    Yen, Neil Yuwen
    [J]. 3RD INTERNATIONAL CONFERENCE ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY (ACIT 2015) 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND INTELLIGENCE (CSI 2015), 2015, : 282 - 287
  • [5] A distributed program global execution control environment applied to load balancing
    Borkowski, Janusz
    KopáNski, Damian
    Laskowski, Eryk
    Olejnik, Richard
    Tudruj, Marek
    [J]. Scalable Computing, 2012, 13 (03): : 269 - 284
  • [6] A DISTRIBUTED PROGRAM GLOBAL EXECUTION CONTROL ENVIRONMENT APPLIED TO LOAD BALANCING
    Borkowski, Janusz
    Kopanski, Damian
    Laskowski, Eryk
    Olejnik, Richard
    Tudruj, Marek
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2012, 13 (03): : 269 - 284
  • [7] Mobile Execution Environment for Non-Intermediated Content Distribution
    Christophe, Benoit
    Narganes, Maribel
    Antila, Ville
    Maknavicius, Linas
    [J]. BELL LABS TECHNICAL JOURNAL, 2011, 15 (04) : 117 - 134
  • [8] Cluster Based Load Balancing in Cloud Computing
    Kapoor, Surbhi
    Dabas, Chetna
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 76 - 81
  • [9] Optimal data placement strategy considering capacity limitation and load balancing in geographically distributed cloud
    Li, Chunlin
    Cai, Qianqian
    Youlong, Lou
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 142 - 159
  • [10] Cluster-Based Join for Geographically Distributed Big RDF Data
    Yang, Fan
    Crainiceanu, Adina
    Chen, Zhiyuan
    Needham, Don
    [J]. 2019 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS 2019), 2019, : 170 - 178