From BigBench to TPCx-BB: Standardization of a Big Data Benchmark

被引:0
|
作者
Cao, Paul [1 ]
Gowda, Bhaskar [2 ]
Lakshmi, Seetha [3 ]
Narasimhadevara, Chinmayi [4 ]
Nguyen, Patrick [5 ]
Poelman, John [6 ]
Poess, Meikel [7 ]
Rabl, Tilmann [8 ,9 ]
机构
[1] Hewlett Packard Enterprise, Palo Alto, CA USA
[2] Intel Corp, Hillsboro, OR 97124 USA
[3] Actian Corp, Palo Alto, CA USA
[4] Cisco Syst Inc, San Jose, CA USA
[5] Microsoft Corp, Redmond, WA 98052 USA
[6] IBM Corp, San Jose, CA USA
[7] Oracle Corp, Redwood City, CA USA
[8] Tech Univ Berlin, Berlin, Germany
[9] DFKI GmbH, Berlin, Germany
关键词
SCALE;
D O I
10.1007/978-3-319-54334-5_3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the increased adoption of Hadoop-based big data systems for the analysis of large volume and variety of data, an effective and common benchmark for big data deployments is needed. There have been a number of proposals from industry and academia to address this challenge. While most either have basic workloads (e.g. word counting), or port existing benchmarks to big data systems (e.g. TPC-H or TPC-DS), some are specifically designed for big data challenges. The most comprehensive proposal among these is the BigBench benchmark, recently standardized by the Transaction Processing Performance Council as TPCx-BB. In this paper, we discuss the progress made since the original BigBench proposal to the standardized TPCx-BB. In addition, we will share the thought process went into creating the specification, challenges in navigating the uncharted territories of a complex benchmark for a fast moving technology domain, and analyze the functionality of the benchmark suite on different Hadoop- and non-Hadoop-based big data engines. We will provide insights on the first official result of TPCx-BB and finally discuss, in brief, other relevant and fast growing big data analytic use cases to be addressed in future big data benchmarks.
引用
收藏
页码:24 / 44
页数:21
相关论文
共 50 条
  • [1] Amdahl's Law in Big Data Analytics: Alive and Kicking in TPCx-BB (BigBench)
    Richins, Daniel
    Ahmed, Tahrina
    Clapp, Russell
    Reddi, Vijay Janapa
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 630 - 642
  • [2] BigBench Specification V0.1 BigBench: An Industry Standard Benchmark for Big Data Analytics
    Rabl, Tilmann
    Ghazal, Ahmad
    Hu, Minqing
    Crolotte, Alain
    Raab, Francois
    Poess, Meikel
    Jacobsen, Hans-Arno
    SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 164 - 201
  • [3] Discussion of BigBench: A Proposed Industry Standard Performance Benchmark for Big Data
    Baru, Chaitanya
    Bhandarkar, Milind
    Curino, Carlo
    Danisch, Manuel
    Frank, Michael
    Gowda, Bhaskar
    Jacobsen, Hans-Arno
    Jie, Huang
    Kumar, Dileep
    Nambiar, Raghunath
    Poess, Meikel
    Raab, Francois
    Rabl, Tilmann
    Ravi, Nishkam
    Sachs, Kai
    Sen, Saptak
    Yi, Lan
    Youn, Choonhan
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING: TRADITIONAL TO BIG DATA, 2015, 8904 : 44 - 63
  • [4] BigDataBench: a Big Data Benchmark Suite from Internet Services
    Wang, Lei
    Zhan, Jianfeng
    Luo, Chunjie
    Zhu, Yuqing
    Yang, Qiang
    He, Yongqiang
    Gao, Wanling
    Jia, Zhen
    Shi, Yingje
    Zhang, Shuji
    Zheng, Chen
    Lu, Gang
    Zhan, Kent
    Li, Xiaona
    Qiu, Bizhu
    2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, : 488 - 499
  • [5] Introducing TPCx-HS: The First Industry Standard for Benchmarking Big Data Systems
    Nambiar, Raghunath
    Poess, Meikel
    Dey, Akon
    Cao, Paul
    Magdon-Ismail, Tariq
    Ren, Da Qi
    Bond, Andrew
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING: TRADITIONAL TO BIG DATA, 2015, 8904 : 1 - 12
  • [6] A Reliability Benchmark for Big Data Systems on JointCloud
    Zheng, Yingying
    Xu, Lijie
    Wang, Wei
    Zhou, Wei
    Ding, Ying
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2017, : 306 - 310
  • [7] Anomaly Detection for Big Data Security: A Benchmark
    Es-Samaali, Hamza H.
    Outchakoucht, Aissam A.
    Benhadou, Siham S.
    Mounnan, Oussama O.
    Abou El Kalam, Anas A.
    2021 THE 3RD INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING AND TECHNOLOGY, BDET 2021, 2021, : 35 - 39
  • [8] Testing of big data analytics systems by benchmark
    Chen, Mingang
    Chen, Wenjie
    Cai, Lizhi
    2018 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW), 2018, : 231 - 238
  • [9] Developing the Raster Big Data Benchmark: A Comparison of Raster Analysis on Big Data Platforms
    Haynes, David
    Mitchell, Philip
    Shook, Eric
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (11)
  • [10] Big Data Architectures Benchmark for Forecasting Electricity Consumption
    2020, Institute of Electrical and Electronics Engineers Inc.