WHIZ: Data-Driven Analytics Execution

被引:0
|
作者
Grandl, Robert [1 ]
Singhvi, Arjun [2 ]
Viswanathan, Raajay [3 ]
Akella, Aditya [2 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Univ Wisconsin, Madison, WI USA
[3] Uber Technol Inc, San Francisco, CA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's data analytics frameworks are computecentric, with analytics execution almost entirely dependent on the predetermined physical structure of the high-level computation. Relegating intermediate data to a second class entity in this manner hurts flexibility, performance, and efficiency. We present Wifiz, a new analytics execution framework that cleanly separates computation from intermediate data. This enables runtime visibility into intermediate data via programmable monitoring, and data-driven computation where data properties drive when/what computation runs. Experiments with a Wmz prototype on a 50-node cluster using batch, streaming, and graph analytics workloads show that it improves analytics completion times 1.3-2 x and cluster efficiency 1.4x compared to state-of-the-art.
引用
收藏
页码:407 / 424
页数:18
相关论文
共 50 条
  • [1] Architectural Support for Data-Driven Execution
    Matheou, George
    Evripidou, Paraskevas
    [J]. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (04)
  • [2] Data-driven Digital Therapeutics Analytics
    Lee, Uichin
    Jung, Gyuwon
    Park, Sangjun
    Ma, Eun-Yeol
    Kim, Heeyoung
    Lee, Yonggeon
    Noh, Youngtae
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 386 - 388
  • [3] IMPROVE QUALITY WITH DATA-DRIVEN ANALYTICS
    HAHN, GJ
    [J]. QUALITY PROGRESS, 1993, 26 (10) : 83 - 86
  • [4] Data-driven execution of fast multipole methods
    Ltaief, Hatem
    Yokota, Rio
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (11): : 1935 - 1946
  • [5] Data-Driven Thread Execution on Heterogeneous Processors
    Arandi, Samer
    Matheou, George
    Kyriacou, Costas
    Evripidou, Paraskevas
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (02) : 198 - 224
  • [6] Framework for Data Analytics in Data-Driven Product Planning
    Massmann, Melina
    Meyer, Maurice
    Frank, Maximilian
    von Enzberg, Sebastian
    Kuehn, Arno
    Dumitrescu, Roman
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON SYSTEM-INTEGRATED INTELLIGENCE (SYSINT 2020): SYSTEM-INTEGRATED INTELLIGENCE - INTELLIGENT, FLEXIBLE AND CONNECTED SYSTEMS IN PRODUCTS AND PRODUCTION, 2020, 52 : 350 - 355
  • [7] Data-Driven Thread Execution on Heterogeneous Processors
    Samer Arandi
    George Matheou
    Costas Kyriacou
    Paraskevas Evripidou
    [J]. International Journal of Parallel Programming, 2018, 46 : 198 - 224
  • [8] Data-driven optimization and analytics for maritime logistics
    Fagerholt, Kjetil
    Heilig, Leonard
    Lalla-Ruiz, Eduardo
    Meisel, Frank
    Wang, Shuaian
    [J]. FLEXIBLE SERVICES AND MANUFACTURING JOURNAL, 2023, 35 (01) : 1 - 4
  • [9] Clinical Analytics for Data-Driven Models of Care
    Nickitas, Donna M.
    [J]. NURSING ECONOMICS, 2014, 32 (03): : 106 - +
  • [10] Data-driven optimization and analytics for maritime logistics
    Kjetil Fagerholt
    Leonard Heilig
    Eduardo Lalla-Ruiz
    Frank Meisel
    Shuaian Wang
    [J]. Flexible Services and Manufacturing Journal, 2023, 35 : 1 - 4