Distributed-Shared Memory Computed Tomography

被引:0
|
作者
de la Fuente, Francisco [1 ]
Torres, Felipe [1 ]
Rannou, Fernando R. [1 ]
机构
[1] Univ Santiago Chile, Dept Ingn Informat, Av Ecuador 3659, Santiago, Chile
关键词
Distributed-shared memory; Statistical Reconstruction;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large-scale statistical reconstruction algorithms are known to be memory and processor intensive applications. For instance, the system matrix for a small animal scanner requires several gigabytes of memory storage and the algorithm usually needs many iterations to produce acceptable images. In this abstract we design distributed-shared memory (DSM) statistical reconstruction algorithms to exploit all available computational resources as a unified infrastructure and thereby improving the cost-efficiency of the investment and scalability of the system. We use and compare two distinct approaches. The first one uses the Unified Parallel C (UPC) compiler which transparently provides a global shared virtual address space across all computers. Data is physically stored in different computers, but threads can access any shared item as it if were in its local memory. The second approach combines OpenMP and Pthreads shared-memory libraries with the message-passing library MPI. In this case threads only have access to the node's local memory and access to remote data is carried out explicitly through message-passing. Early UPC experiments showed that keeping all data shared heavily affects reconstruction performance. Therefore, we devised a distribution method where some data is kept shared and other is kept private, mimicking somehow the library-based approach. However, even with data privatization, the compiler solution cannot compete with the library solutions. We explore three workload distribution strategies: LOR-based, Nonzero-based and Cores-based. The best performance is obtained with OpenMP+MPI and the Core-based balance algorithm, which reaches a speedup of 36 with 112 cores. However, both OpenMP+MPI and Pthreads+MPI outperform UPC by large. The low system efficiency of 0.32 is mainly due to the slow internode communication network.
引用
收藏
页码:2452 / 2455
页数:4
相关论文
共 50 条
  • [21] AN IMPLEMENTATION OF DISTRIBUTED SHARED MEMORY
    RAMACHANDRAN, U
    KHALIDI, MYA
    WORKSHOP ON EXPERIENCES WITH DISTRIBUTED AND MULTIPROCESSOR SYSTEMS, 1989, : 21 - 38
  • [22] COHERENT SHARED MEMORY ON A DISTRIBUTED MEMORY MACHINE
    BISIANI, R
    NOWATZYK, A
    RAVISHANKAR, M
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 1: ARCHITECTURE, 1989, : I133 - I142
  • [23] Two layers distributed shared memory
    Baiardi, F
    Guerri, D
    Mori, P
    Moroni, L
    Ricci, L
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 302 - 311
  • [24] Distributed Shared Memory: To relax or not to relax?
    Iosevich, V
    Schuster, A
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 198 - 205
  • [25] Distributed shared memory in kernel mode
    Trevisan, TS
    Costa, VS
    Whately, L
    Amorim, CL
    14TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2002, : 159 - 166
  • [26] Analysis of Shared Memory in Distributed and Non Distributed Environment
    Dangi, Dharmendra
    Bhandari, Sachin
    Bhagat, Amit
    2016 FIFTH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS), 2016, : 114 - 118
  • [27] Checkpointing speculative distributed shared memory
    Danilecki, Arkadiusz
    Kobusinska, Anna
    Szychowiak, Michal
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 9 - 16
  • [28] RIGHT ON TIME DISTRIBUTED SHARED MEMORY
    Guerraoui, Rachid
    Kozhaya, David
    Pignolet, Yvonne-Anne
    PROCEEDINGS OF 2016 IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS), 2016, : 315 - 326
  • [29] The tempest approach to distributed shared memory
    Wood, DA
    Hill, MD
    Larus, JR
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1996, : 63 - 64
  • [30] A Distributed Hash Table for Shared Memory
    Oortwijn, Wytse
    van Dijk, Tom
    van de Pol, Jaco
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PPAM 2015, PT II, 2016, 9574 : 15 - 24