Trilogy: Data Placement to Improve Performance and Robustness of Cloud Computing

被引:0
|
作者
Hsu, Chin-Jung [1 ]
Freeh, Vincent W. [1 ]
Villanustre, Flavio [2 ]
机构
[1] North Carolina State Univ, Dept Comp Sci, Raleigh, NC 27695 USA
[2] LexisNexis Risk Solut, Alpharetta, GA USA
关键词
workload-aware data placement; data management; cloud computing; WEB SERVER PERFORMANCE; REPLICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Infrastructure as a Service, one of the most disruptive aspects of cloud computing, enables configuring a cluster for each application for each workload. When the workload changes, a cluster will be either underutilized (wasting resources) or unable to meet demand (incurring opportunity costs). Consequently, efficient cluster resizing requires proper data replication and placement. Our work reveals that coarse-grain, workload-aware replication addresses over-utilization but cannot resolve under-utilization. With fine-grain partitioning of the dataset, data replication can reduce both under-and over-utilization. In our empirical studies, compared to a naive uniform data replication a coarse-grain workload-aware replication increases throughput by 81% on a highly-skewed workload. A fine-grain scheme further reaches 166% increase. Furthermore, a surprisingly small increase in granularity is sufficient to obtain most benefits. Evaluations also show that maximizing the number of unique partitions per node increases robustness to tolerate workload deviation while minimizing this number reduces storage footprint.
引用
收藏
页码:2442 / 2451
页数:10
相关论文
共 50 条
  • [21] Computing Resources Scalability Performance Analysis in Cloud Computing Data Center
    Oumaima Ghandour
    Said El Kafhali
    Mohamed Hanini
    Journal of Grid Computing, 2023, 21
  • [22] Computing Resources Scalability Performance Analysis in Cloud Computing Data Center
    Ghandour, Oumaima
    El Kafhali, Said
    Hanini, Mohamed
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [23] Performance Testing for Cloud Computing with Dependent Data Bootstrapping
    He, Sen
    Liu, Tianyi
    Lama, Palden
    Lee, Jaewoo
    Kim, In Kee
    Wang, Wei
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 666 - 678
  • [24] An Adaptive Data Placement Strategy in scientific workflows over Cloud Computing Environments
    Kim, Heewon
    Kim, Yoonhee
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [25] A data placement strategy based on clustering and consistent hashing algorithm in Cloud Computing
    Li, Qiang
    Wang, Kun
    Wei, Suwei
    Han, Xuefeng
    Xu, Lili
    Gao, Min
    2014 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2014, : 478 - 483
  • [26] DATA PLACEMENT IN ERA OF CLOUD COMPUTING: A SURVEY, TAXONOMY AND OPEN RESEARCH ISSUES
    Kaur, Avinash
    Gupta, Pooja
    Singh, Manpreet
    Nayyar, Anand
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2019, 20 (02): : 377 - 398
  • [27] Using Network Data to Improve Digital Investigation in Cloud Computing Environments
    Spiekermann, Daniel
    Eggendorfer, Tobias
    Keller, Joerg
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 98 - 105
  • [28] A Time-Driven Data Placement Strategy for a Scientific Workflow Combining Edge Computing and Cloud Computing
    Lin, Bing
    Zhu, Fangning
    Zhang, Jianshan
    Chen, Jiaqing
    Chen, Xing
    Xiong, Naixue N.
    Mauri, Jaime Lloret
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (07) : 4254 - 4265
  • [29] Virtual Machine Placement Strategies in Cloud Computing
    Bharathi, Divya P.
    Prakash, P.
    Kiran, Vamsee Krishna M.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [30] Improving Performance of Cloud Computing and Big Data Technologies and Applications
    Zhenjiang Dong
    ZTE Communications, 2014, 12 (04) : 1 - 2