Parallel simulation of chip-multiprocessor architectures

被引:29
|
作者
Chidester, Matthew [1 ,3 ]
George, Alan [2 ,4 ]
机构
[1] Intel Corporation
[2] University of Florida
[3] Intel Corporation, RA2-455, 2501 NW 229th Street, Hillsboro, OR 97124
[4] Department of Electrical Engineering, University of Florida, 327 Larsen Hall, PO Box 116200, Gainesville, FL 32611
关键词
Cache memory - Computer aided design - Computer architecture - Computer simulation - Coupled circuits - Interfaces (computer) - Microprocessor chips - Parallel algorithms - Synchronization;
D O I
10.1145/643114.643116
中图分类号
学科分类号
摘要
Chip-multiprocessor (CMP) architectures present a challenge for efficient simulation, combining the requirements of a detailed microprocessor simulator with that of a tightly-coupled parallel system. In this paper, a distributed simulator for target CMPs is presented based on the Message Passing Interface (MPI) designed to run on a host cluster of workstations. Microbenchmark-based evaluation is used to narrow the parallelization design space concerning the performance impact of distributed vs. centralized target L2 simulation, blocking vs. non-blocking remote cache accesses, null-message vs. barrier techniques for clock synchronization, and network interconnect selection. The best combination is shown to yield speedups of up to 16 on a 9-node cluster of dual-CPU workstations, partially due to cache effects.
引用
收藏
页码:176 / 200
相关论文
共 50 条
  • [41] Parallel and multiprocessor vision system architectures - Preface
    Bourbakis, NG
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (03) : 263 - 264
  • [42] Multiprocessor architectures for embedded system-on-chip applications
    Ravikumar, CP
    17TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS: DESIGN METHODOLOGIES FOR THE GIGASCALE ERA, 2004, : 512 - 519
  • [43] Multiprocessor DSP scheduling in system-on-a-chip architectures
    Gai, P
    Abeni, L
    Buttazzo, G
    EUROMICRO RTS 2002: 14TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS, PROCEEDINGS, 2002, : 231 - 238
  • [44] CHIP ARCHITECTURES FOR PARALLEL PROCESSING
    DETTMER, R
    ELECTRONICS AND POWER, 1985, 31 (03): : 227 - 231
  • [45] Heterogeneous Chip Multiprocessor Architectures for Big Data Applications
    Homayoun, Houman
    PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF'16), 2016, : 400 - 405
  • [46] PARALLEL SIMULATION ON THE HYPERCUBE MULTIPROCESSOR
    BOUKERCHE, A
    TROPPER, C
    DISTRIBUTED COMPUTING, 1995, 8 (04) : 181 - 190
  • [47] PARALLEL FAULT SIMULATION ON A MULTIPROCESSOR
    BHATTACHARYYA, UK
    MOHAN, G
    SENGUPTA, I
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1992, 23 (11) : 2025 - 2035
  • [48] Memory subsystem characterization in a 16-core snoop-based chip-multiprocessor architecture
    Villa, FJ
    Acacio, ME
    García, JM
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2005, 3726 : 213 - 222
  • [50] PARALLEL OPTICAL INTERCONNECTS - IMPLEMENTATION OF OPTOELECTRONICS IN MULTIPROCESSOR ARCHITECTURES
    FRIETMAN, EEE
    VANNIFTERICK, W
    DEKKER, L
    JONGELING, TJM
    APPLIED OPTICS, 1990, 29 (08): : 1161 - 1177