Proxy Benchmarks for Emerging Big-data Workloads

被引:10
|
作者
Panda, Reena [1 ]
John, Lizy Kurian [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/PACT.2017.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Early design-space evaluation of computer-systems is usually performed using performance models such as detailed simulators, RTL-based models etc. Unfortunately, it is very challenging (often impossible) to run many emerging applications on detailed performance models owing to their complex application software-stacks, significantly long run times, system dependencies and the limited speed/potential of early performance models. To overcome these challenges in benchmarking complex, long-running database applications, we propose a fast and efficient proxy generation methodology, PerfProx that can generate miniature proxy benchmarks, which are representative of the performance of real-world database applications and yet, converge to results quickly and do not need any complex software-stack support. Past research on proxy generation utilizes detailed micro-architecture independent metrics derived from detailed functional simulators, which are often difficult to generate for many emerging applications. PerfProx enables fast and efficient proxy generation using performance metrics derived primarily from hardware performance counters. We evaluate the proposed proxy generation approach on three modern, real-world SQL and NoSQL databases, Cassandra, MongoDB and MySQL running both the data-serving and data-analytics class of applications on different hardware platforms and cache/TLB configurations. The proxy benchmarks mimic the performance (IPC) of the original database applications with similar to 94.2% (avg) accuracy. We further demonstrate that the proxies mimic original application performance across several other key metrics, while significantly reducing the instruction counts.
引用
收藏
页码:105 / 116
页数:12
相关论文
共 50 条
  • [1] Proxy Benchmarks for Emerging Big-data Workloads
    Panda, Reena
    John, Lizy Kurian
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 139 - 140
  • [2] Data Motif-based Proxy Benchmarks for Big Data and AI Workloads
    Gao, Wanling
    Zhan, Jianfeng
    Wang, Lei
    Luo, Chunjie
    Jia, Zhen
    Zheng, Daoyi
    Zheng, Chen
    He, Xiwen
    Ye, Hainan
    Wang, Haibin
    Ren, Rui
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2018, : 48 - 58
  • [3] A Linear Combination-Based Method to Construct Proxy Benchmarks for Big Data Workloads
    Yang, Yikang
    Wang, Lei
    Zhan, Jianfeng
    [J]. BENCHMARKING, MEASURING, AND OPTIMIZING, BENCH 2023, 2024, 14521 : 120 - 136
  • [4] Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads
    Mehta, Parmita
    Dorkenwald, Sven
    Zhao, Dongfang
    Kaftan, Tomer
    Cheung, Alvin
    Balazinska, Magdalena
    Rokem, Ariel
    Connolly, Andrew
    Vanderplas, Jacob
    AlSayyad, Yusra
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (11): : 1226 - 1237
  • [5] Quantifying the Performance Impact of Large Pages on In-Memory Big-Data Workloads
    Park, Jinsu
    Han, Myeonggyun
    Baek, Woongki
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2016, : 209 - 218
  • [6] Evolutionary Scheduling of Dynamic Multitasking Workloads for Big-Data Analytics in Elastic Cloud
    Zhang, Fan
    Cao, Junwei
    Tan, Wei
    Khan, Samee U.
    Li, Keqin
    Zomaya, Albert Y.
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 338 - 351
  • [7] Characterizing the impact of last-level cache replacement policies on big-data workloads
    Jamet, Alexandre Valentin
    Alvarez, Lluc
    Jimenez, Daniel A.
    Casas, Marc
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2020), 2020, : 134 - 144
  • [8] Big-Data Visualization
    Keim, Daniel
    Qu, Huamin
    Ma, Kwan-Liu
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2013, 33 (04) : 20 - 21
  • [9] Proactive manufacturing-a big-data driven emerging manufacturing paradigm
    Yao, Xifan
    Zhou, Jiajun
    Zhang, Cunjie
    Liu, Min
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2017, 23 (01): : 172 - 185
  • [10] CAMP: Accurate Modeling of Core and Memory Locality for Proxy Generation of Big-data Applications
    Panda, Reena
    Zheng, Xinnian
    Gerstlauer, Andreas
    John, Lizy Kurian
    [J]. PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 337 - 342