RHEEM: Enabling Cross-Platform Data Processing

被引:28
|
作者
Agrawal, Divy [2 ]
Chawla, Sanjay [1 ]
Contreras-Rojas, Bertty [1 ]
Elmagarmid, Ahmed [1 ]
Idris, Yasser [1 ]
Kaoudi, Zoi [1 ]
Kruse, Sebastian [3 ]
Lucas, Ji [1 ]
Mansour, Essam [1 ]
Ouzzani, Mourad [1 ]
Papotti, Paolo [1 ,4 ]
Quiane-Ruiz, Jorge-Arnulfo [1 ]
Tang, Nan [1 ]
Thirumuruganathan, Saravanan [1 ]
Troudi, Anis [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
[2] UCSB, Santa Barbara, CA 93106 USA
[3] Hasso Plattner Inst, Potsdam, Germany
[4] Eurecom, Biot, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 11期
关键词
EFFICIENT;
D O I
10.14778/3236187.3236195
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Solving business problems increasingly requires going beyond the limits of a single data processing platform (platform for short), such as Hadoop or a DBMS. As a result, organizations typically perform tedious and costly tasks to juggle their code and data across different platforms. Addressing this pain and achieving automatic cross-platform data processing is quite challenging: finding the most efficient platform for a given task requires quite good expertise for all the available platforms. We present RHEEM, a general-purpose cross-platform data processing system that decouples applications from the underlying platforms. It not only determines the best platform to run an incoming task, but also splits the task into subtasks and assigns each subtask to a specific platform to minimize the overall cost (e.g., runtime or monetary cost). It features (i) an interface to easily compose data analytic tasks; (ii) a novel cost-based optimizer able to find the most efficient platform in almost all cases; and (iii) an executor to efficiently orchestrate tasks over different platforms. As a result, it allows users to focus on the business logic of their applications rather than on the mechanics of how to compose and execute them. Using different real-world applications with RHEEM, we demonstrate how cross-platform data processing can accelerate performance by more than one order of magnitude compared to single-platform data processing.
引用
收藏
页码:1414 / 1427
页数:14
相关论文
共 50 条
  • [41] WebArrayDB: cross-platform microarray data analysis and public data repository
    Xia, Xiao-Qin
    McClelland, Michael
    Porwollik, Steffen
    Song, Wenzhi
    Cong, Xianling
    Wang, Yipeng
    BIOINFORMATICS, 2009, 25 (18) : 2425 - 2429
  • [42] The wxWindows cross-platform framework - A C++ framework for building cross-platform applications
    Zeitlin, V
    DR DOBBS JOURNAL, 2001, 26 (05): : 106 - +
  • [43] CLIC: An Extensible and Efficient Cross-Platform Data Analytics System
    Chen, Qixiang
    Chen, Zhijun
    Zhang, Kai
    Wang, X. Sean
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (01) : 34 - 45
  • [44] Cross-Platform Pathway Activity Transformation and Classification of Microarray Data
    Engchuan, Worrawat
    Meechai, Asawin
    Tongsima, Sissades
    Chan, Jonathan H.
    COMPUTATIONAL INTELLIGENCE IN INFORMATION SYSTEMS, 2015, 331 : 139 - 148
  • [45] Research of the Reconfigurable Cross-Platform Computing System for the Digital Signal Processing
    Martyshkin, A., I
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (05): : 376 - 385
  • [46] Integrative disease classification based on cross-platform microarray data
    Liu, Chun-Chi
    Hu, Jianjun
    Kalakrishnan, Mrinal
    Huang, Haiyan
    Zhou, Xianghong Jasmine
    BMC BIOINFORMATICS, 2009, 10
  • [47] Disparity-preserved Deep Cross-platform Association for Cross-platform Video Recommendation
    Yu, Shengze
    Wang, Xin
    Zhu, Wenwu
    Cui, Peng
    Wang, Jingdong
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4635 - 4641
  • [48] InCroMAP: integrated analysis of cross-platform microarray and pathway data
    Wrzodek, Clemens
    Eichner, Johannes
    Buchel, Finja
    Zell, Andreas
    BIOINFORMATICS, 2013, 29 (04) : 506 - 508
  • [49] A CROSS-PLATFORM BINARY DIFF
    COPPIETERS, K
    DR DOBBS JOURNAL, 1995, 20 (05): : 32 - &
  • [50] CROSS-PLATFORM COMMUNICATION CLASSES
    LAM, RB
    DR DOBBS JOURNAL, 1995, 20 (03): : 20 - &