Discussion of BigBench: A Proposed Industry Standard Performance Benchmark for Big Data

被引:12
|
作者
Baru, Chaitanya [11 ]
Bhandarkar, Milind [10 ]
Curino, Carlo [7 ]
Danisch, Manuel [1 ]
Frank, Michael [1 ]
Gowda, Bhaskar [6 ]
Jacobsen, Hans-Arno [8 ]
Jie, Huang [6 ]
Kumar, Dileep [3 ]
Nambiar, Raghunath [2 ]
Poess, Meikel [9 ]
Raab, Francois [5 ]
Rabl, Tilmann [1 ]
Ravi, Nishkam [3 ]
Sachs, Kai [12 ]
Sen, Saptak [4 ]
Yi, Lan [6 ]
Youn, Choonhan [11 ]
机构
[1] Bankmark, Passau, Germany
[2] Cisco Syst, San Jose, CA USA
[3] Cloudera, Palo Alto, CA USA
[4] Hortonworks, Santa Clara, CA USA
[5] Infosizing, Manitou Springs, CO USA
[6] Intel Corp, Santa Clara, CA USA
[7] Microsoft Corp, Redmond, WA 98052 USA
[8] Middleware Syst Res Grp, Toronto, ON, Canada
[9] Oracle Corp, Redwood City, CA USA
[10] Pivotal, Vancouver, BC, Canada
[11] San Diego Supercomp Ctr, La Jolla, CA USA
[12] SPEC Res Grp, Gainesville, FL USA
来源
PERFORMANCE CHARACTERIZATION AND BENCHMARKING: TRADITIONAL TO BIG DATA | 2015年 / 8904卷
关键词
D O I
10.1007/978-3-319-15350-6_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enterprises perceive a huge opportunity in mining information that can be found in big data. New storage systems and processing paradigms are allowing for ever larger data sets to be collected and analyzed. The high demand for data analytics and rapid development in technologies has led to a sizable ecosystem of big data processing systems. However, the lack of established, standardized benchmarks makes it difficult for users to choose the appropriate systems that suit their requirements. To address this problem, we have developed the BigBench benchmark specification. BigBench is the first end-to-end big data analytics benchmark suite. In this paper, we present the BigBench benchmark and analyze the workload from technical as well as business point of view. We characterize the queries in the workload along different dimensions, according to their functional characteristics, and also analyze their runtime behavior. Finally, we evaluate the suitability and relevance of the workload from the point of view of enterprise applications, and discuss potential extensions to the proposed specification in order to cover typical big data processing use cases.
引用
收藏
页码:44 / 63
页数:20
相关论文
共 50 条
  • [31] Benchmark Tests on Improved Merge for Big Data Processing
    Marszalek, Zbigniew
    Wozniak, Marcin
    Borowik, Grzegorz
    Wazirali, Raniyah
    Napoli, Christian
    Pappalardo, Giuseppe
    Tramontana, Emiliano
    2015 ASIA-PACIFIC CONFERENCE ON COMPUTER-AIDED SYSTEM ENGINEERING - APCASE 2015, 2015, : 96 - 101
  • [32] Construction industry benchmark of key performance indicators
    Akintoye, A
    Chinyio, E
    CONSTRUCTION INNOVATION AND GLOBAL COMPETITIVENESS, VOLS 1 AND 2: THE ORGANIZATION AND MANAGEMENT OF CONSTRUCTION, 2003, : 1077 - 1091
  • [33] A Standard for Benchmarking Big Data Systems
    Nambiar, Raghunath
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [34] PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data Platforms
    Dreher, Patrick
    Byun, Chansup
    Hill, Chris
    Gadepally, Vijay
    Kuszmaul, Bradley
    Kepner, Jeremy
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 929 - 937
  • [35] Big Data: a big opportunity for the petroleum and petrochemical industry
    Hassani, Hossein
    Silva, Emmanuel Sirimal
    OPEC ENERGY REVIEW, 2018, 42 (01) : 74 - 89
  • [36] A proposed polarity standard for multicomponent seismic data
    Brown, RJ
    Stewart, RR
    Lawton, DC
    GEOPHYSICS, 2002, 67 (04) : 1028 - 1037
  • [37] A PROPOSED STANDARD FOR ARCHIVING AND SHARING STRATIGRAPHIC DATA
    Costa, Stefano
    ARCHEOLOGIA E CALCOLATORI, 2019, 30 : 459 - 462
  • [38] PROPOSED STANDARD FOR IMAGE CYTOMETRY DATA FILES
    DEAN, P
    MASCIO, L
    OW, D
    SUDAR, D
    MULLIKIN, J
    CYTOMETRY, 1990, 11 (05): : 561 - 569
  • [39] Construction and Application of Data Standard in Big Data Environment
    Jia, Haitian
    Jia, Chun
    BDE 2019: 2019 INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING, 2019, : 121 - 124
  • [40] Proposed revision to an industry standard for sampling and preparing wood for analysis
    BICKING CA
    1971, : 47 - 49