Optimizing Data-Intensive Applications Automatically By Leveraging Parallel Data Processing Frameworks

被引:5
|
作者
Ahmad, Maaz Bin Safeer [1 ]
Cheung, Alvin [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3035918.3056440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this demonstration we will showcase CASPER, a novel tool that enables sequential data-intensive programs to automatically leverage the optimizations provided by parallel data processing frameworks. The goal of CASPER is to reduce the inertia against adaptation of new data process- ing frameworks particularly for non expert users by au- tomatically re-writing sequential programs written in general purpose languages to the high-level DSLs or APIs of these frameworks. Through CASPER'S browser-based interface, users can enter the source code of their Java applications and have it automatically retargeted to execute on Apache Spark. In our interactive presentation, we will use CASPER to optimize sequential implementations of data visualization programs as well as image processing kernels. The optimized Spark implementations along with the original sequential implementations will then be executed simultaneously on the cloud to allow the demo audience compare the runtime performances and outputs in real-time.
引用
下载
收藏
页码:1675 / 1678
页数:4
相关论文
共 50 条
  • [21] Integrating Data-Intensive Computing Systems with Biological Data Analysis Frameworks
    Pedersen, Edvard
    Raknes, Inge Alexander
    Ernstsen, Martin
    Bongo, Lars Ailo
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 733 - 740
  • [22] AnyOLAP: Analytical Processing of Arbitrary Data-Intensive Applications without ETL
    Schuhknecht, Felix
    Priesterroth, Aaron
    Henneberg, Justus
    Salkhordeh, Reza
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (12): : 2823 - 2826
  • [23] Parallel Framework for Data-Intensive Computing with XSEDE
    Subramanian, Ranjini
    Zhang, Hui
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [24] Performance Implications of Processing-in-Memory Designs on Data-Intensive Applications
    Wang, Borui
    Torres, Martin
    Li, Dong
    Zhao, Jishen
    Rusu, Florin
    PROCEEDINGS OF 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW 2016), 2016, : 115 - 122
  • [25] Parallel Optimization for Data-Intensive Service Composition
    Deng, Shuiguang
    Huang, Longtao
    Wu, Bin
    Xiong, Lirong
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (05): : 817 - 824
  • [26] Optimizing VM allocation and data placement for data-intensive applications in cloud using ACO metaheuristic algorithm
    Shabeera, T. P.
    Kumar, S. D. Madhu
    Salam, Sameera M.
    Krishnan, K. Murali
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2017, 20 (02): : 616 - 628
  • [27] Space-and-Time Efficient Parallel Garbage Collector for Data-Intensive Applications
    Liu, Shaoshan
    Wang, Ligang
    Li, Xiao-Feng
    Gaudiot, Jean-Luc
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2011, 39 (04) : 451 - 472
  • [28] Space-and-Time Efficient Parallel Garbage Collector for Data-Intensive Applications
    Shaoshan Liu
    Ligang Wang
    Xiao-Feng Li
    Jean-Luc Gaudiot
    International Journal of Parallel Programming, 2011, 39 : 451 - 472
  • [29] SunwayMR: A distributed parallel computing framework with convenient data-intensive applications programming
    Wu, Renke
    Huang, Linpeng
    Yu, Peng
    Zhou, Haojie
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 71 : 43 - 56
  • [30] A Framework for Data Partitioning for C++ Data-Intensive Applications
    A. Milidonis
    G. Dimitroulakos
    M. D. Galanis
    A. P. Kakarountas
    G. Theodoridis
    C. Goutis
    F. Catthoor
    Design Automation for Embedded Systems, 2004, 9 : 101 - 121