An Asymmetric Distributed Shared Memory Model for Heterogeneous Parallel Systems

被引:52
|
作者
Gelado, Isaac [1 ]
Cabezas, Javier [1 ]
Navarro, Nacho [1 ]
Stone, John E. [2 ]
Patel, Sanjay [2 ]
Hwu, Wen-mei W. [2 ]
机构
[1] Univ Politecn Cataluna, E-08028 Barcelona, Spain
[2] Univ Illinois, Chicago, IL 60680 USA
关键词
Design; Experimentation; Performance; Heterogeneous Systems; Data-centric Programming Models; Asymmetric Distributed Shared Memory;
D O I
10.1145/1735971.1736059
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Heterogeneous computing combines general purpose CPUs with accelerators to efficiently execute both sequential control-intensive and data-parallel phases of applications. Existing programming models for heterogeneous computing rely on programmers to explicitly manage data transfers between the CPU system memory and accelerator memory. This paper presents a new programming model for heterogeneous computing, called Asymmetric Distributed Shared Memory (ADSM), that maintains a shared logical memory space for CPUs to access objects in the accelerator physical memory but not vice versa. The asymmetry allows light-weight implementations that avoid common pitfalls of symmetrical distributed shared memory systems. ADSM allows programmers to assign data objects to performance critical methods. When a method is selected for accelerator execution, its associated data objects are allocated within the shared logical memory space, which is hosted in the accelerator physical memory and transparently accessible by the methods executed on CPUs. We argue that ADSM reduces programming efforts for heterogeneous computing systems and enhances application portability. We present a software implementation of ADSM, called GMAC, on top of CUDA in a GNU/Linux environment. We show that applications written in ADSM and running on top of GMAC achieve performance comparable to their counterparts using programmer-managed data transfers. This paper presents the GMAC system and evaluates different design choices. We further suggest additional architectural support that will likely allow GMAC to achieve higher application performance than the current CUDA model.
引用
下载
收藏
页码:347 / 358
页数:12
相关论文
共 50 条
  • [1] An Asymmetric Distributed Shared Memory Model for Heterogeneous Parallel Systems
    Gelado, Isaac
    Cabezas, Javier
    Navarro, Nacho
    Stone, John E.
    Patel, Sanjay
    Hwu, Wen-mei W.
    ASPLOS XV: FIFTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2010, : 347 - 358
  • [2] PANEL - PARALLEL AND DISTRIBUTED COMPUTING DISTRIBUTED MEMORY OR SHARED MEMORY-SYSTEMS
    REIJNS, GL
    IFIP TRANSACTIONS A-COMPUTER SCIENCE AND TECHNOLOGY, 1992, 12 : 543 - 544
  • [3] Distributed parallel volume rendering on shared memory systems
    Hancock, D.J.
    Hubbold, R.J.
    Future Generation Computer Systems, 1998, 13 (4-5): : 251 - 259
  • [4] Distributed parallel volume rendering on shared memory systems
    Hancock, DJ
    Hubbold, RJ
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1997, 1225 : 157 - 164
  • [5] Distributed parallel volume rendering on shared memory systems
    Hancock, DJ
    Hubbold, RJ
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 1998, 13 (4-5): : 251 - 259
  • [6] HETEROGENEOUS DISTRIBUTED SHARED MEMORY
    ZHOU, SN
    STUMM, M
    LI, K
    WORTMAN, D
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (05) : 540 - 554
  • [7] PARALLEL LOOP SCHEDULING APPROACHES FOR DISTRIBUTED AND SHARED MEMORY SYSTEMS
    Aguilar, Jose
    Leiss, Ernst
    PARALLEL PROCESSING LETTERS, 2005, 15 (1-2)
  • [8] A PARALLEL VERSION OF ARGOS - A DISTRIBUTED MEMORY MODEL FOR SHARED MEMORY UNIX COMPUTERS
    HARRISON, RJ
    KENDALL, RA
    THEORETICA CHIMICA ACTA, 1991, 79 (05): : 337 - 347
  • [9] Design and analysis of a parallel file system for distributed shared memory systems
    Mac, SC
    Shieh, CK
    Chang, JB
    JOURNAL OF SYSTEMS ARCHITECTURE, 1999, 45 (08) : 603 - 617
  • [10] Adaptively scheduling parallel loops in distributed shared-memory systems
    Yan, Y
    Jin, CM
    Zhang, XD
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (01) : 70 - 81