From BigBench to TPCx-BB: Standardization of a Big Data Benchmark

被引:0
|
作者
Cao, Paul [1 ]
Gowda, Bhaskar [2 ]
Lakshmi, Seetha [3 ]
Narasimhadevara, Chinmayi [4 ]
Nguyen, Patrick [5 ]
Poelman, John [6 ]
Poess, Meikel [7 ]
Rabl, Tilmann [8 ,9 ]
机构
[1] Hewlett Packard Enterprise, Palo Alto, CA USA
[2] Intel Corp, Hillsboro, OR 97124 USA
[3] Actian Corp, Palo Alto, CA USA
[4] Cisco Syst Inc, San Jose, CA USA
[5] Microsoft Corp, Redmond, WA 98052 USA
[6] IBM Corp, San Jose, CA USA
[7] Oracle Corp, Redwood City, CA USA
[8] Tech Univ Berlin, Berlin, Germany
[9] DFKI GmbH, Berlin, Germany
来源
PERFORMANCE EVALUATION AND BENCHMARKING: TRADITIONAL - BIG DATA - INTERNET OF THINGS, TPCTC 2016 | 2017年 / 10080卷
关键词
SCALE;
D O I
10.1007/978-3-319-54334-5_3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the increased adoption of Hadoop-based big data systems for the analysis of large volume and variety of data, an effective and common benchmark for big data deployments is needed. There have been a number of proposals from industry and academia to address this challenge. While most either have basic workloads (e.g. word counting), or port existing benchmarks to big data systems (e.g. TPC-H or TPC-DS), some are specifically designed for big data challenges. The most comprehensive proposal among these is the BigBench benchmark, recently standardized by the Transaction Processing Performance Council as TPCx-BB. In this paper, we discuss the progress made since the original BigBench proposal to the standardized TPCx-BB. In addition, we will share the thought process went into creating the specification, challenges in navigating the uncharted territories of a complex benchmark for a fast moving technology domain, and analyze the functionality of the benchmark suite on different Hadoop- and non-Hadoop-based big data engines. We will provide insights on the first official result of TPCx-BB and finally discuss, in brief, other relevant and fast growing big data analytic use cases to be addressed in future big data benchmarks.
引用
收藏
页码:24 / 44
页数:21
相关论文
共 50 条
  • [41] Big Data Analytics on MANET Routing Standardization using Quality Assurance Metrics
    Kush, Ashwani
    Hwang, C. Jinshong
    Dattana, Vishal
    PROCEEDINGS OF 2016 FUTURE TECHNOLOGIES CONFERENCE (FTC), 2016, : 192 - 198
  • [42] Big Biological Impacts from Big Data
    May, Mike
    SCIENCE, 2014, 344 (6189) : 1298 - 1301
  • [43] Data learning from big data
    Torrecilla, Jose L.
    Romo, Juan
    STATISTICS & PROBABILITY LETTERS, 2018, 136 : 15 - 19
  • [44] Federated learning based futuristic biomedical big-data analysis and standardization
    Fathima, Afifa Salsabil
    Basha, Syed Muzamil
    Ahmed, Syed Thouheed
    Mathivanan, Sandeep Kumar
    Rajendran, Sukumar
    Mallik, Saurav
    Zhao, Zhongming
    PLOS ONE, 2023, 18 (10):
  • [45] AI and big data standardization: Contributing to united nations sustainable development goals
    Walshe R.
    Casey K.
    Kernan J.
    Fitzpatrick D.
    Journal of ICT Standardization, 2020, 8 (02): : 77 - 106
  • [46] From Big Data to Big Artificial Intelligence?Algorithmic Challenges and Opportunities of Big Data
    Kristian Kersting
    Ulrich Meyer
    KI - Künstliche Intelligenz, 2018, 32 (1) : 3 - 8
  • [47] From Big Data to Big Information and Big Knowledge: the Case of Earth Observation Data
    Bereta, Konstantina
    Koubarakis, Manolis
    Manegold, Stefan
    Stamoulis, George
    Demir, Beguem
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2293 - 2294
  • [48] Big Data Needs Big Dreamers: Lessons from Successful Big Data Investors
    Simoudis, Evangelos
    Gorenberg, Mark
    Guleri, Tim
    Ocko, Matt
    Sands, Greg
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 11 - 12
  • [49] From Big Data to Big Artificial Intelligence? Algorithmic Challenges and Opportunities of Big Data
    Kersting, Kristian
    Meyer, Ulrich
    KUNSTLICHE INTELLIGENZ, 2018, 32 (01): : 3 - 8
  • [50] Big data driven smart energy management: From big data to big insights
    Zhou, Kaile
    Fu, Chao
    Yang, Shanlin
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2016, 56 : 215 - 225