Parallel simulation of chip-multiprocessor architectures

被引:29
|
作者
Chidester, Matthew [1 ,3 ]
George, Alan [2 ,4 ]
机构
[1] Intel Corporation
[2] University of Florida
[3] Intel Corporation, RA2-455, 2501 NW 229th Street, Hillsboro, OR 97124
[4] Department of Electrical Engineering, University of Florida, 327 Larsen Hall, PO Box 116200, Gainesville, FL 32611
关键词
Cache memory - Computer aided design - Computer architecture - Computer simulation - Coupled circuits - Interfaces (computer) - Microprocessor chips - Parallel algorithms - Synchronization;
D O I
10.1145/643114.643116
中图分类号
学科分类号
摘要
Chip-multiprocessor (CMP) architectures present a challenge for efficient simulation, combining the requirements of a detailed microprocessor simulator with that of a tightly-coupled parallel system. In this paper, a distributed simulator for target CMPs is presented based on the Message Passing Interface (MPI) designed to run on a host cluster of workstations. Microbenchmark-based evaluation is used to narrow the parallelization design space concerning the performance impact of distributed vs. centralized target L2 simulation, blocking vs. non-blocking remote cache accesses, null-message vs. barrier techniques for clock synchronization, and network interconnect selection. The best combination is shown to yield speedups of up to 16 on a 9-node cluster of dual-CPU workstations, partially due to cache effects.
引用
下载
收藏
页码:176 / 200
相关论文
共 50 条
  • [31] Performance and power impact of issue-width in chip-multiprocessor cores
    Ekman, M
    Stenstrom, P
    2003 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS, 2003, : 359 - 368
  • [32] Three-dimensional chip-multiprocessor run-time thermal management
    Zhu, Changyun
    Gu, Zhenyu
    Shang, Li
    Dick, Robert P.
    Joseph, Russ
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (08) : 1479 - 1492
  • [33] LE1: A parameterizable VLIW Chip-Multiprocessor with hardware PThreads support
    Stevens, David
    Chouliaras, Vassilios
    IEEE ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2010), 2010, : 122 - 126
  • [34] A DEA-GA multi-objective scheduling algorithm for Chip-Multiprocessor
    Chai, Song
    Li, Yubai
    Wu, Chang
    Wang, Jian
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE AND SOCIAL RESEARCH (ICSSR 2013), 2013, 64 : 666 - 670
  • [35] On chip interconnects for multiprocessor turbo decoding architectures
    Martina, M.
    Masera, G.
    Moussa, H.
    Baghdadi, A.
    MICROPROCESSORS AND MICROSYSTEMS, 2011, 35 (02) : 167 - 181
  • [36] Power Gating with Block Migration in Chip-Multiprocessor Last-Level Caches
    Kadjo, David
    Kim, Hyungjun
    Gratz, Paul
    Hu, Jiang
    Ayoub, Raid
    2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2013, : 93 - 99
  • [37] Modeling Cache sharing on chip multiprocessor architectures
    Petoumenos, Pavlos
    Keramidas, Georgios
    Zeffer, Hakan
    Kaxiras, Stefanos
    Hagersten, Erik
    PROCEEDINGS OF THE IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2006, : 160 - +
  • [38] Communication/Synchronisation mechanism for Multiprocessor on Chip Architectures
    Zertal, Soraya
    Timsit, Claude
    Chatti, Majed
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 1096 - +
  • [39] Hardening Architectures for Multiprocessor System-on-Chip
    Aviles, Pablo M.
    Garcia-Astudillo, Luis A.
    Entrena, Luis
    Garcia-Valderas, Mario
    Martin-Holgado, Pedro
    Morilla, Yolanda
    Lindoso, Almudena
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2024, 71 (08) : 1887 - 1895
  • [40] A multiprocessor cache for massively parallel SoC architectures
    Niemann, Jorg-Christian
    LiB, Christian
    Porrmann, Mario
    Rueckert, Ulrich
    ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2007, PROCEEDINGS, 2007, 4415 : 83 - +