Optimizing Data-Intensive Applications Automatically By Leveraging Parallel Data Processing Frameworks

被引：5

作者：

Ahmad, Maaz Bin Safeer ^{[1
]}

Cheung, Alvin ^{[1
]}

机构：

[1] Univ Washington, Seattle, WA 98195 USA

来源：

SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2017年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1145/3035918.3056440

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this demonstration we will showcase CASPER, a novel tool that enables sequential data-intensive programs to automatically leverage the optimizations provided by parallel data processing frameworks. The goal of CASPER is to reduce the inertia against adaptation of new data process- ing frameworks particularly for non expert users by au- tomatically re-writing sequential programs written in general purpose languages to the high-level DSLs or APIs of these frameworks. Through CASPER'S browser-based interface, users can enter the source code of their Java applications and have it automatically retargeted to execute on Apache Spark. In our interactive presentation, we will use CASPER to optimize sequential implementations of data visualization programs as well as image processing kernels. The optimized Spark implementations along with the original sequential implementations will then be executed simultaneously on the cloud to allow the demo audience compare the runtime performances and outputs in real-time.

引用

页码：1675 / 1678

页数：4

共 50 条

[1] Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications
Ahmad, Maaz Bin Safeer
Cheung, Alvin
[J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1205 - 1220
[2] Parallel data-intensive algorithms and applications
Talia, D
Srimani, PK
[J]. PARALLEL COMPUTING, 2002, 28 (05) : 669 - 671
[3] Optimizing Interactive Development of Data-Intensive Applications
Interlandi, Matteo
Tetali, Sai Deep
Gulzar, Muhammad Ali
Noor, Joseph
Condie, Tyson
Kim, Miryung
Millstein, Todd
[J]. PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 510 - 522
[4] Optimizing web service composition for data-intensive applications
[J]. 1600, Science and Engineering Research Support Society (07):
[5] Leveraging Parallel Data Processing Frameworks with Verified Lifting
Ahmad, Maaz Bin Safeer
Cheung, Alvin
[J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (229): : 67 - 83
[6] Optimizing Distributed Data-Intensive Workflows
Friese, Ryan D.
Tallent, Nathan R.
Schram, Malachi
Halappanavar, Mahantesh
Barker, Kevin J.
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 279 - 289
[7] Implementing scalable parallel search algorithms for data-intensive applications
Ladányi, L
Ralphs, TK
Saltzman, MJ
[J]. COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 592 - 602
[8] Leveraging Endpoint Flexibility in Data-Intensive Clusters
Chowdhury, Mosharaf
Kandula, Srikanth
Stoica, Ion
[J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 231 - 242
[9] Applications in Data-Intensive Computing
Shah, Anuj R.
Adkins, Joshua N.
Baxter, Douglas J.
Cannon, William R.
Chavarria-Miranda, Daniel G.
Choudhury, Sutanay
Gorton, Ian
Gracio, Deborah K.
Halter, Todd D.
Jaitly, Navdeep D.
Johnson, John R.
Kouzes, Richard T.
Macduff, Matthew C.
Marquez, Andres
Monroe, Matthew E.
Oehmen, Christopher S.
Pike, William A.
Scherrer, Chad
Villa, Oreste
Webb-Robertson, Bobbie-Jo
Whitney, Paul D.
Zuljevic, Nino
[J]. ADVANCES IN COMPUTERS, VOL 79, 2010, 79 : 1 - 70
[10] Metacomputing and data-intensive applications
Messina, P
[J]. WORLDWIDE COMPUTING AND ITS APPLICATIONS, 1997, 1274 : 226 - 236

← 1 2 3 4 5 →