Optimizing Data-Intensive Applications Automatically By Leveraging Parallel Data Processing Frameworks

被引:5
|
作者
Ahmad, Maaz Bin Safeer [1 ]
Cheung, Alvin [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3035918.3056440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this demonstration we will showcase CASPER, a novel tool that enables sequential data-intensive programs to automatically leverage the optimizations provided by parallel data processing frameworks. The goal of CASPER is to reduce the inertia against adaptation of new data process- ing frameworks particularly for non expert users by au- tomatically re-writing sequential programs written in general purpose languages to the high-level DSLs or APIs of these frameworks. Through CASPER'S browser-based interface, users can enter the source code of their Java applications and have it automatically retargeted to execute on Apache Spark. In our interactive presentation, we will use CASPER to optimize sequential implementations of data visualization programs as well as image processing kernels. The optimized Spark implementations along with the original sequential implementations will then be executed simultaneously on the cloud to allow the demo audience compare the runtime performances and outputs in real-time.
引用
收藏
页码:1675 / 1678
页数:4
相关论文
共 50 条
  • [1] Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications
    Ahmad, Maaz Bin Safeer
    Cheung, Alvin
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1205 - 1220
  • [2] Parallel data-intensive algorithms and applications
    Talia, D
    Srimani, PK
    [J]. PARALLEL COMPUTING, 2002, 28 (05) : 669 - 671
  • [3] Optimizing Interactive Development of Data-Intensive Applications
    Interlandi, Matteo
    Tetali, Sai Deep
    Gulzar, Muhammad Ali
    Noor, Joseph
    Condie, Tyson
    Kim, Miryung
    Millstein, Todd
    [J]. PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 510 - 522
  • [4] Optimizing web service composition for data-intensive applications
    [J]. 1600, Science and Engineering Research Support Society (07):
  • [5] Leveraging Parallel Data Processing Frameworks with Verified Lifting
    Ahmad, Maaz Bin Safeer
    Cheung, Alvin
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (229): : 67 - 83
  • [6] Optimizing Distributed Data-Intensive Workflows
    Friese, Ryan D.
    Tallent, Nathan R.
    Schram, Malachi
    Halappanavar, Mahantesh
    Barker, Kevin J.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 279 - 289
  • [7] Implementing scalable parallel search algorithms for data-intensive applications
    Ladányi, L
    Ralphs, TK
    Saltzman, MJ
    [J]. COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 592 - 602
  • [8] Leveraging Endpoint Flexibility in Data-Intensive Clusters
    Chowdhury, Mosharaf
    Kandula, Srikanth
    Stoica, Ion
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 231 - 242
  • [9] Applications in Data-Intensive Computing
    Shah, Anuj R.
    Adkins, Joshua N.
    Baxter, Douglas J.
    Cannon, William R.
    Chavarria-Miranda, Daniel G.
    Choudhury, Sutanay
    Gorton, Ian
    Gracio, Deborah K.
    Halter, Todd D.
    Jaitly, Navdeep D.
    Johnson, John R.
    Kouzes, Richard T.
    Macduff, Matthew C.
    Marquez, Andres
    Monroe, Matthew E.
    Oehmen, Christopher S.
    Pike, William A.
    Scherrer, Chad
    Villa, Oreste
    Webb-Robertson, Bobbie-Jo
    Whitney, Paul D.
    Zuljevic, Nino
    [J]. ADVANCES IN COMPUTERS, VOL 79, 2010, 79 : 1 - 70
  • [10] Metacomputing and data-intensive applications
    Messina, P
    [J]. WORLDWIDE COMPUTING AND ITS APPLICATIONS, 1997, 1274 : 226 - 236