WHIZ: Data-Driven Analytics Execution

被引:0
|
作者
Grandl, Robert [1 ]
Singhvi, Arjun [2 ]
Viswanathan, Raajay [3 ]
Akella, Aditya [2 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Univ Wisconsin, Madison, WI USA
[3] Uber Technol Inc, San Francisco, CA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's data analytics frameworks are computecentric, with analytics execution almost entirely dependent on the predetermined physical structure of the high-level computation. Relegating intermediate data to a second class entity in this manner hurts flexibility, performance, and efficiency. We present Wifiz, a new analytics execution framework that cleanly separates computation from intermediate data. This enables runtime visibility into intermediate data via programmable monitoring, and data-driven computation where data properties drive when/what computation runs. Experiments with a Wmz prototype on a 50-node cluster using batch, streaming, and graph analytics workloads show that it improves analytics completion times 1.3-2 x and cluster efficiency 1.4x compared to state-of-the-art.
引用
收藏
页码:407 / 424
页数:18
相关论文
共 50 条
  • [41] CONVERSION FROM DATA-DRIVEN TO SYNCHRONOUS EXECUTION IN LOOP PROGRAMS
    CUNY, JE
    SNYDER, L
    [J]. ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1987, 9 (04): : 599 - 617
  • [42] Data-Driven Workflow Execution in Service Oriented IoT Architectures
    Varga, Pal
    Kozma, Daniel
    Hegedus, Csaba
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 203 - 210
  • [43] Introduction to People Analytics: A Practical Guide to Data-Driven HR
    Belwalkar, Bharati B.
    Khan, Nadeem
    Millner, Dave
    [J]. PERSONNEL PSYCHOLOGY, 2024, 77 (01) : 284 - 286
  • [44] Introduction to the Special Issue on Petrophysical Data-Driven Analytics (PDDA)
    Torres-Verdin, Carlos
    Prcnsky, Stephen
    [J]. PETROPHYSICS, 2018, 59 (06): : 748 - 749
  • [45] A Data-Driven Platform for the Coordination of Independent Visual Analytics Tools
    Nonnemann, Lars
    Hogräfer, Marius
    Röhlig, Martin
    Schumann, Heidrun
    Urban, Bodo
    Schulz, Hans-Jörg
    [J]. Computers and Graphics (Pergamon), 2022, 106 : 152 - 160
  • [46] Enhancing transparency in public procurement: A data-driven analytics approach
    Felizzola, Heriberto
    Gomez, Camilo
    Arrieta, Nicolas
    Jerez, Vianey
    Erazo, Yilber
    Camacho, Geraldine
    [J]. INFORMATION SYSTEMS, 2024, 125
  • [47] The data-driven analytics for investigating cargo loss in logistics systems
    Wu, Pei-Ju
    Chen, Mu-Chen
    Tsau, Chih-Kai
    [J]. INTERNATIONAL JOURNAL OF PHYSICAL DISTRIBUTION & LOGISTICS MANAGEMENT, 2017, 47 (01) : 68 - 83
  • [48] Theta Architecture: Preserving the Quality of Analytics in Data-Driven Systems
    Theodorou, Vasileios
    Gerostathopoulos, Ilias
    Amini, Sasan
    Scandariato, Riccardo
    Prehofer, Christian
    Staron, Miroslaw
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 767 : 186 - 198
  • [49] DATA-DRIVEN MODELING OF ENGAGEMENT ANALYTICS FOR QUALITY BLENDED LEARNING
    Yang, Nan
    Ghislandi, Patrizia
    Raffaghelli, Juliana
    Ritella, Giuseppe
    [J]. JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2019, 15 (03): : 211 - 225
  • [50] Log Analytics in HPC: A Data-driven Reinforcement Learning Framework
    Luo, Zhengping
    Hou, Tao
    Nguyen, Tung Thanh
    Zeng, Hui
    Lu, Zhuo
    [J]. IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 550 - 555