SCnC: Efficient Unification of Streaming with Dynamic Task Parallelism

被引:0
|
作者
Dragoş Sbîrlea
Jun Shirako
Ryan Newton
Vivek Sarkar
机构
[1] Rice University,
[2] Indiana University,undefined
关键词
Streaming; Task parallelism; Dynamic parallelism ; Dataflow;
D O I
暂无
中图分类号
学科分类号
摘要
Stream processing is a special form of the dataflow execution model that offers extensive opportunities for optimization and automatic parallelization. To take full advantage of the paradigm programmers are typically required to learn a new language and re-implement their applications. This work shows that it is possible to exploit streaming as a safe and automatic optimization of a more general dataflow-based model—one in which computation kernels are written in standard, general-purpose languages and organized as a coordination graph. We propose streaming concurrent collections (SCnC), a streaming system that can efficiently run a subset of programs supported by concurrent collections (CnC). CnC is a general purpose parallel programming paradigm that integrates task parallelism and dataflow computing. The proposed streaming support allows application developers to reason about their program as a general dataflow graph, while benefiting from the performance and tight memory footprint of stream parallelism when their program satisfies streaming constraints. In this paper, we formally define the application requirements for using SCnC, and outline a static decision procedure for identifying and processing eligible SCnC subgraphs. We present initial results showing that transitioning from general CnC to SCnC leads to a throughput increase of up to 40×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} for certain benchmarks, and also enables programs with large data sizes to execute in available memory for cases where CnC execution may run out of memory.
引用
收藏
页码:233 / 256
页数:23
相关论文
共 50 条
  • [1] SCnC: Efficient Unification of Streaming with Dynamic Task Parallelism
    Sbirlea, Dragos
    Shirako, Jun
    Newton, Ryan
    Sarkar, Vivek
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (02) : 233 - 256
  • [2] Streaming Task Parallelism
    Cohen, Albert
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS'15), 2015, : 1 - 1
  • [3] Execution-time prediction for dynamic streaming applications with task-level parallelism
    Poplavko, Peter
    Basten, Twan
    van Meerbergen, Jef
    [J]. DSD 2007: 10TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN ARCHITECTURES, METHODS AND TOOLS, PROCEEDINGS, 2007, : 228 - +
  • [4] Energy-Aware-Task-Parallelism for Efficient Dynamic Voltage, and Frequency Scaling, in CGRAs
    Jafri, Syed. M. A. H.
    Tajammul, Muhammad Adeel
    Hemani, Ahmed
    Paul, Kolin
    Plosila, Juha
    Tenhunen, Hannu
    [J]. 2013 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (IC-SAMOS), 2013, : 104 - 112
  • [5] A Process Network Model for Reactive Streaming Software with Deterministic Task Parallelism
    Gioulekas, Fotios
    Poplavko, Peter
    Katsaros, Panagiotis
    Bensalem, Saddek
    Palomo, Pedro
    [J]. FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING (FASE 2018), 2018, 10802 : 94 - 110
  • [6] System-Enforced Deterministic Streaming for Efficient Pipeline Parallelism
    Yu Zhang
    Zhao-Peng Li
    Hui-Fang Cao
    [J]. Journal of Computer Science and Technology, 2015, 30 : 57 - 73
  • [7] System-Enforced Deterministic Streaming for Efficient Pipeline Parallelism
    Zhang, Yu
    Li, Zhao-Peng
    Cao, Hui-Fang
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (01) : 57 - 73
  • [8] Efficient GPU Computation Using Task Graph Parallelism
    Lin, Dian-Lun
    Huang, Tsung-Wei
    [J]. EURO-PAR 2021: PARALLEL PROCESSING, 2021, 12820 : 435 - 450
  • [9] Dynamic Determinacy Race Detection for Task Parallelism with Futures
    Surendran, Rishi
    Sarkar, Vivek
    [J]. RUNTIME VERIFICATION, (RV 2016), 2016, 10012 : 368 - 385
  • [10] Efficient dynamic parallelism with OpenMP on Linux SMPs
    Antonopoulos, CD
    Venetis, IE
    Nikolopoulos, DS
    Papatheodorou, TS
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2507 - 2513