Distributed-Shared Memory Computed Tomography

被引:0
|
作者
de la Fuente, Francisco [1 ]
Torres, Felipe [1 ]
Rannou, Fernando R. [1 ]
机构
[1] Univ Santiago Chile, Dept Ingn Informat, Av Ecuador 3659, Santiago, Chile
关键词
Distributed-shared memory; Statistical Reconstruction;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large-scale statistical reconstruction algorithms are known to be memory and processor intensive applications. For instance, the system matrix for a small animal scanner requires several gigabytes of memory storage and the algorithm usually needs many iterations to produce acceptable images. In this abstract we design distributed-shared memory (DSM) statistical reconstruction algorithms to exploit all available computational resources as a unified infrastructure and thereby improving the cost-efficiency of the investment and scalability of the system. We use and compare two distinct approaches. The first one uses the Unified Parallel C (UPC) compiler which transparently provides a global shared virtual address space across all computers. Data is physically stored in different computers, but threads can access any shared item as it if were in its local memory. The second approach combines OpenMP and Pthreads shared-memory libraries with the message-passing library MPI. In this case threads only have access to the node's local memory and access to remote data is carried out explicitly through message-passing. Early UPC experiments showed that keeping all data shared heavily affects reconstruction performance. Therefore, we devised a distribution method where some data is kept shared and other is kept private, mimicking somehow the library-based approach. However, even with data privatization, the compiler solution cannot compete with the library solutions. We explore three workload distribution strategies: LOR-based, Nonzero-based and Cores-based. The best performance is obtained with OpenMP+MPI and the Core-based balance algorithm, which reaches a speedup of 36 with 112 cores. However, both OpenMP+MPI and Pthreads+MPI outperform UPC by large. The low system efficiency of 0.32 is mainly due to the slow internode communication network.
引用
收藏
页码:2452 / 2455
页数:4
相关论文
共 50 条
  • [1] Conservative simulation using distributed-shared memory
    Department of Computer Science, National University of Singapore, 3 Science Drive 2, Singapore, Singapore
    Proc. - Workshop Parallel Distributed Simul., PADS, 1600, (1-7):
  • [2] Conservative simulation using distributed-shared memory
    Teo, YM
    Ng, YK
    Onggo, BSS
    16TH WORKSHOP ON PARALLEL AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2002, : 3 - 10
  • [3] Using distributed-shared memory mechanisms for agents communication in a distributed system
    Gonzaga, Thiago
    Bentes, Cristiana
    Farias, Ricardo
    De Castro, Maria Clicia S.
    Garcia, Ana Cristina B.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 39 - +
  • [4] Parallelizing the ZSWEEP algorithm for distributed-shared memory architectures
    Farias, R
    Silva, CT
    VOLUME GRAPHICS 2001, 2001, : 181 - +
  • [5] Biological sequence analysis on distributed-shared memory multiprocessors
    Rodriguez, A
    de la Fraga, LG
    Zapata, EL
    Carazo, JM
    Trelles, O
    PROCEEDINGS OF THE SIXTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING - PDP '98, 1998, : 20 - 26
  • [6] MPI management of Hermite collocation computation on a distributed-shared memory system
    Applied Mathematics and Computers Laboratory, Department of Sciences, Technical University of Crete, University Campus, 73100 Chania, Greece
    WSEAS Trans. Math., 2006, 5 (520-525):
  • [7] Features of optical interconnects in distributed-shared memory organized MIMD architectures: the ultimate goal
    Frietman, EEE
    Ernst, RJ
    Crosbie, P
    Shimoji, M
    THIRD INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS, AND NETWORKS, PROCEEDINGS (I-SPAN '97), 1997, : 37 - 43
  • [8] MPI plus OpenMP implementation of memory-saving parallel PIC applications on hierarchical distributed-shared memory architectures
    Briguglio, S
    Fogaccia, G
    Vlad, G
    Di Martino, B
    MODELLING AND SIMULATION 2003, 2003, : 247 - 251
  • [9] Performance analysis of two address allocation schemes in an optically interconnected distributed-shared memory (OIDSM) system
    Hwang, I-Shyan
    Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an, 1997, 4 (04): : 245 - 260
  • [10] A hierarchical distributed-shared memory parallel Branch&Bound application with PVM and OpenMP for multiprocessor clusters
    Aversa, R
    Di Martino, B
    Mazzocca, N
    Venticinque, S
    PARALLEL COMPUTING, 2005, 31 (10-12) : 1034 - 1047