Cloud Analytics Benchmark

被引:1
|
作者
Van Renen, Alexander [1 ]
Leis, Viktor [2 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
[2] Tech Univ Munich, Munich, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 16卷 / 06期
基金
欧洲研究理事会;
关键词
COST;
D O I
10.14778/3583140.3583156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The cloud facilitates the transition to a service-oriented perspective. This affects cloud-native data management in general, and data analytics in particular. Instead of managing a multi-node database cluster on-premise, end users simply send queries to a managed cloud data warehouse and receive results. While this is obviously very attractive for end users, database system architects still have to engineer systems for this new service model. There are currently many competing architectures ranging from self-hosted (Presto, PostgreSQL), over managed (Snowflake, Amazon Redshift) to query-as-a-service (Amazon Athena, Google BigQuery) offerings. Benchmarking these architectural approaches is currently difficult, and it is not even clear what the metrics for a comparison should be. To overcome these challenges, we first analyze a real-world query trace from Snowflake and compare its properties to that of TPC-H and TPC-DS. Doing so, we identify important differences that distinguish traditional benchmarks from real-world cloud data warehouse workloads. Based on this analysis, we propose the Cloud Analytics Benchmark (CAB). By incorporating workload fluctuations and multi-tenancy, CAB allows evaluating different designs in terms of user-centered metrics such as cost and performance.
引用
收藏
页码:1413 / 1425
页数:13
相关论文
共 50 条
  • [41] An Automatic Tool for Benchmark Testing of Cloud Applications
    Casola, Valentina
    De Benedictis, Alessandra
    Rak, Massimiliano
    Villano, Umberto
    [J]. CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 701 - 708
  • [42] Towards A Common Benchmark Framework for Cloud Brokers
    Le Duy Ngan
    Flora, Tsai S.
    Keong, Chan Chee
    Kanagasabai, Rajaraman
    [J]. PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 750 - 754
  • [43] Flexible MapReduce Workflows for Cloud Data Analytics
    Goncalves, Carlos
    Assuncao, Luis
    Cunha, Jose C.
    [J]. INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2013, 5 (04) : 48 - 64
  • [44] Big data analytics in Cloud computing: an overview
    Berisha, Blend
    Meziu, Endrit
    Shabani, Isak
    [J]. JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [45] Advances in cloud computing and big data analytics
    Dong, Fang
    Shen, Jun
    He, Qiang
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (20):
  • [46] AnnoMarket - Multilingual Text Analytics at Scale on the Cloud
    Dimitrov, Marin
    Cunningham, Hamish
    Roberts, Ian
    Kostov, Petar
    Simov, Alex
    Rigaux, Philippe
    Lippell, Helen
    [J]. SEMANTIC WEB: ESWC 2014 SATELLITE EVENTS, 2014, 8798 : 315 - 319
  • [47] Reliability Analytics for Cloud Based Distributed Databases
    Demarne, Mathieu B.
    Gramling, Jim
    Verona, Tomer
    Cilimdzic, Miso
    [J]. SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 1479 - 1492
  • [48] Diabetes Prediction Model Using Cloud Analytics
    Manna, Soumayadeep
    Maity, Swagata
    Munshi, Souvik
    Adhikari, Mainak
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 30 - 36
  • [49] Scalable Progressive Analytics on Big Data in the Cloud
    Chandramouli, Badrish
    Goldstein, Jonathan
    Quamar, Abdul
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1726 - 1737
  • [50] Cloud-Native Transactions and Analytics in SingleStore
    Prout, Adam
    Wang, Szu-Po
    Victor, Joseph
    Sun, Zhou
    Li, Yongzhu
    Chen, Jack
    Bergeron, Evan
    Hanson, Eric
    Walzer, Robert
    Gomes, Rodrigo
    Shamgunov, Nikita
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2340 - 2352