Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism

被引:0
|
作者
Emami, Mahyar [1 ]
Kashani, Sahand [1 ]
Kamahori, Keisuke [1 ,2 ]
Pourghannad, Mohammad Sepehr [1 ,3 ]
Raj, Ritik [1 ,4 ]
Larus, James R. [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[2] Univ Tokyo, Tokyo, Japan
[3] Sharif Univ, Tehran, Iran
[4] Indian Inst Technol Roorkee, Roorkee, Uttar Pradesh, India
关键词
GATE-LEVEL SIMULATION;
D O I
10.1145/3623278.3624750
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The demise of Moore's Law and Dennard Scaling has revived interest in specialized computer architectures and accelerators. Verification and testing of this hardware depend heavily upon cycle-accurate simulation of register-transfer-level (RTL) designs. The fastest software RTL simulators can simulate designs at 1-1000 kHz, i.e., more than three orders of magnitude slower than hardware. Improved simulators can increase designers' productivity by speeding design iterations and permitting more exhaustive exploration. One possibility is to exploit low-level parallelism, as RTL expresses considerable fine-grain concurrency. Unfortunately, state-of-the-art RTL simulators often perform best on a single core since modern processors cannot effectively exploit fine-grain parallelism. This work presents Manticore: a parallel computer designed to accelerate RTL simulation. Manticore uses a static bulk-synchronous parallel (BSP) execution model to eliminate fine-grain synchronization overhead. It relies entirely on a compiler to schedule resources and communication, which is feasible since RTL code contains few divergent execution paths. With static scheduling, communication and synchronization no longer incur runtime overhead, making fine-grain parallelism practical. Moreover, static scheduling dramatically simplifies processor implementation, significantly increasing the number of cores that fit on a chip. Our 225-core FPGA implementation running at 475 MHz outperforms a state-of-the-art RTL simulator running on desktop and server computers in 8 out of 9 benchmarks.
引用
收藏
页码:219 / 237
页数:19
相关论文
共 19 条
  • [1] Efficient longest common subsequence computation using bulk-synchronous parallelism
    Krusche, Peter
    Tiskin, Alexander
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 5, 2006, 3984 : 165 - 174
  • [2] Hardware-Accelerated Cache Simulation for Multicore by FPGA
    Hung, Shih-Hao
    Ho, Yi-Mo
    Yeh, Chih-Wei
    Liu, Cheng-Yueh
    Lee, Chen-Pang
    [J]. PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 231 - 236
  • [3] NestStepModelica - Mathematical modeling and bulk-synchronous parallel simulation
    Kessler, Christoph
    Fritzson, Peter
    Eriksson, Mattias
    [J]. APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 1006 - +
  • [4] Generating RTL Synthesizable Code from Behavioral Testbenches for Hardware-Accelerated Verification
    Kakoee, Mohammad Reza
    Riazati, Mohammad
    Mohammadi, Siamak
    [J]. 11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 714 - +
  • [5] WASP: Exploiting GPU Pipeline Parallelism with Hardware-Accelerated Automatic Warp Specialization
    Crago, Neal C.
    Damani, Sana
    Sankaralingam, Karthikeyan
    Keckler, Stephen W.
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 1095 - 1110
  • [6] Genomic PCR simulation with hardware-accelerated approximate sequence matching
    Lexa, Matej
    Martinek, Tomas
    Beck, Patrik
    Fucik, Otto
    Valle, Giorgio
    Zara, Ivano
    [J]. 21ST EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2007: SIMULATIONS IN UNITED EUROPE, 2007, : 333 - +
  • [7] Hardware-Accelerated SAR Simulation with NVIDIA-RTX Technology
    Willis, Andrew R.
    Hossain, Md Sajjad
    Godwin, Jamie
    [J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXVII, 2020, 11393
  • [8] Analytic Rendering and Hardware-Accelerated Simulation for Scientific Applications INTRODUCTION
    Navratil, Paul
    Gribble, Christiaan
    Grosset, Pascal
    Stone, John E.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2022, 24 (02) : 4 - 6
  • [10] An evaluation of conservative protocols for bulk-synchronous parallel discrete-event simulation
    Marín, M
    [J]. SIMULATION AND MODELLING: ENABLERS FOR A BETTER QUALITY OF LIFE, 2000, : 83 - 90