Data Parallelism for Distributed Streaming Applications

被引:0
|
作者
Shinde, Bhagyashali [1 ]
Singh, S. T. [1 ]
机构
[1] Savitribai Phule Pune Univ, Dept Comp Engn, PK Tech Campus, Pune, Maharashtra, India
关键词
Data Processing; Distributed Computing; Parallel Programming;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Streaming applications can analyze vast data streams and requires both high throughput and low latency. They are comprised of operator graphs which produce and consume data tuples where operators are stateful, selective and user-defined. The streaming programming model logically exposes task and pipeline parallelism, enabling it to develop parallel systems. Naturally it doesnot expose data parallelism, which must be extracted from streaming applications. This paper presents a compiler and runtime system that automatically extract data parallelism for distributed stream processing. Our approach is safety guarantee in presence of stateful, selective and user-defined operators. Data parallelization is secure if the sequential semantics of the applications are preserved, also the compiler ensures safety by considering dependencies on other operators in the graph and selectivity, state, partitioning of operator. The distributed runtime system ensures that tuples always exit parallel regions in the same order they would without data parallelism, using the most efficient strategy as identified by the compiler.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Streaming Nested Data Parallelism on Multicores
    Madsen, Frederik M.
    Filinski, Andrzej
    [J]. FHPC'16: PROCEEDINGS OF THE 5TH INTERNATIONAL WORKSHOP ON FUNCTIONAL HIGH-PERFORMANCE COMPUTING, 2016, : 44 - 51
  • [2] Safe Data Parallelism for General Streaming
    Schneider, Scott
    Hirzel, Martin
    Gedik, Bugra
    Wu, Kun-Lung
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (02) : 504 - 517
  • [3] Exploiting Task- and Data-Level Parallelism in Streaming Applications Implemented in FPGAs
    Plavec, Franjo
    Vranesic, Zvonko
    Brown, Stephen
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2013, 6 (04)
  • [4] An Edge-Focused Model for Distributed Streaming Data Applications
    Bumgardner, V. K. Cody
    Hickey, Caylin
    Marek, Victor
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2018,
  • [5] InContext: Simple Parallelism for Distributed Applications
    Yoo, Sunghwan
    Lee, Hyojeong
    Killian, Charles
    Kulkarni, Milind
    [J]. HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 97 - 108
  • [6] Adding Data Parallelism to Streaming Pipelines for Throughput Optimization
    Li, Peng
    Agrawal, Kunal
    Buhler, Jeremy
    Chamberlain, Roger D.
    [J]. 2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 20 - 29
  • [7] Streaming Task Parallelism
    Cohen, Albert
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS'15), 2015, : 1 - 1
  • [8] QoS-aware Deployment of Data Streaming Applications over Distributed Infrastructures
    Nardelli, Matteo
    [J]. 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 736 - 741
  • [9] Using Hardware Parallelism for Reducing Power Consumption in Video Streaming Applications
    Ali, Karim M. A.
    Ben Atitallah, Rabie
    Fakhfakh, Nizar
    Dekeyser, Jean-Luc
    [J]. 2015 10TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2015,
  • [10] Massive -scale Streaming Analytics: Models, Parallelism, & Real -world Applications
    Bader, David A.
    [J]. SPAA'18: PROCEEDINGS OF THE 30TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2018, : 193 - 193