Distributed Data-Parallel Computing Using a High-Level Programming Language

被引:0
|
作者
Isard, Michael
Yu, Yuan
机构
关键词
Distributed programming; cloud computing; concurrency;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Dryad and DryadLINQ systems offer a new programming model for large scale data-parallel computing. They generalize previous execution environments such as SQL and Map Reduce in three ways: by providing a general-purpose distributed execution engine for data-parallel applications; by adopting an expressive data model of strongly typed .NET objects; and by supporting general-purpose imperative and declarative operations on datasets within a traditional high-level programming language. A DryadLINQ program is a sequential program composed of LINQ expressions performing arbitrary side-effect-free operations on datasets, and can be written and debugged using standard .NET development tools. The DryadLINQ system automatically and transparently translates the data-parallel portions of the program into a distributed execution plan which is passed to the Dryad execution platform. Dryad, which has been in continuous operation for several years on production clusters made up of thousands of computers, ensures efficient, reliable execution of this plan on a large compute cluster. This paper describes the programming model, provides a high-level overview of the design and implementation of the Dryad and DryadLINQ systems, and discusses the tradeoffs and connections to parallel and distributed databases.
引用
收藏
页码:987 / 994
页数:8
相关论文
共 50 条
  • [1] GLU: A high-level system for granular data-parallel programming
    Jagannathan, R
    Dodd, C
    Agi, I
    [J]. CONCURRENCY-PRACTICE AND EXPERIENCE, 1997, 9 (01): : 63 - 83
  • [2] FastPara: a high-level declarative data-parallel programming framework on clusters
    Mao, Yong
    Gu, Yunhong
    Chen, Jia
    Grossman, Robert L.
    [J]. PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 321 - +
  • [3] Parallel quantum chemistry computation in a high-level data-parallel programming environment PROMISE
    Jiang, ZY
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 2749 - 2755
  • [4] HIGH-LEVEL PROGRAMMING FOR DISTRIBUTED COMPUTING
    FELDMAN, JA
    [J]. COMMUNICATIONS OF THE ACM, 1979, 22 (06) : 353 - 368
  • [5] High-level parallel computing language
    Zhou, JF
    Yang, Y
    Su, Y
    [J]. OPTIMIZING SCIENTIFIC RETURN FOR ASTRONOMY THROUGH INFORMATION TECHNOLOGIES, 2004, 5493 : 530 - 537
  • [6] TRANSFORMING HIGH-LEVEL DATA-PARALLEL PROGRAMS INTO VECTOR OPERATIONS
    PRINS, JF
    PALMER, DW
    [J]. SIGPLAN NOTICES, 1993, 28 (07): : 119 - 128
  • [7] High-level data parallel programming in promoter
    Besch, M
    Bi, H
    Enskonatus, P
    Heber, G
    Wilhelmi, M
    [J]. SECOND INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING MODELS AND SUPPORTIVE ENVIRONMENTS, PROCEEDINGS, 1997, : 47 - 54
  • [8] NIL: A high-level language for distributed systems programming
    Parr, FN
    Strom, RE
    [J]. IBM SYSTEMS JOURNAL, 1999, 38 (2-3) : 181 - 197
  • [9] NIL - A HIGH-LEVEL LANGUAGE FOR DISTRIBUTED SYSTEMS PROGRAMMING
    PARR, FN
    STROM, RE
    [J]. IBM SYSTEMS JOURNAL, 1983, 22 (1-2) : 111 - 127
  • [10] A high-level symbolic language for distributed Web programming
    Visser, CT
    Eliëns, A
    [J]. IC'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, 2000, : 93 - 97