ParTejas: A Parallel Simulator for Multicore Processors

被引:5
|
作者
Malhotra, Geetika [1 ]
Kalayappan, Rajshekar [1 ]
Goel, Seep [1 ]
Aggarwal, Pooja [1 ]
Sagar, Abhishek [1 ]
Sarangi, Smruti R. [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Comp Sci & Engn, New Delhi 110016, India
关键词
Parallel simulation; architectural simulator; Tejas; ParTejas; phasers; parallel ports; slot scheduling; SYSTEM SIMULATION;
D O I
10.1145/3077582
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this article, we present the design of a novel parallel architecture simulator called ParTejas. ParTejas is a timing simulation engine that gets its execution traces from instrumented binaries using a fast shared-memory-based mechanism. Subsequently, the waiting threads simulate the execution of multiple pipelines and an elaborate memory system with support for multilevel coherent caches. ParTejas is written in Java and primarily derives its speedups from the use of novel data structures. Specifically, it uses lock-free slot schedulers to design an entity called a parallel port that effectively models the contention at shared resources in the CPU and memory system. Parallel ports remove the need for fine-grained synchronization and allow each thread to use its local clock. Unlike conventional simulators that use barriers for synchronization at epoch boundaries, we use a sophisticated type of barrier, known as a phaser. A phaser allows threads to perform additional work without waiting for other threads to arrive at the barrier. Additionally, we use a host of Java-specific optimizations and use profiling to effectively schedule the threads. With all our optimizations, we demonstrate a speedup of 11.8x for a multi-issue in-order pipeline and 10.9x for an out-of-order pipeline with 64 threads, for a suite of seven Splash2 and Parsec benchmarks. The simulation error is limited to 2% to 4% as compared to strictly sequential simulation.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] ParTejas: A Parallel Simulator for Multicore Processors
    Malhotra, Geetika
    Aggarwal, Pooja
    Sagar, Abhishek
    Sarangi, Smruti R.
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 130 - 131
  • [2] Parallel evidence propagation on multicore processors
    Yinglong Xia
    Viktor K. Prasanna
    [J]. The Journal of Supercomputing, 2011, 57 : 189 - 202
  • [3] Parallel Evidence Propagation on Multicore Processors
    Xia, Yinglong
    Feng, Xiaojun
    Prasanna, Viktor K.
    [J]. PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2009, 5698 : 377 - +
  • [4] Parallel evidence propagation on multicore processors
    Xia, Yinglong
    Prasanna, Viktor K.
    [J]. JOURNAL OF SUPERCOMPUTING, 2011, 57 (02): : 189 - 202
  • [5] MPTLsim: A Simulator for X86 Multicore Processors
    Zeng, Hui
    Yourst, Matt
    Ghose, Kanad
    Ponomarev, Dmitry
    [J]. DAC: 2009 46TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2009, : 226 - 231
  • [6] Efficient Parallel Mining of Gradual Patterns on Multicore Processors
    Laurent, Anne
    Negrevergne, Benjamin
    Sicard, Nicolas
    Termier, Alexandre
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND MANAGEMENT, VOL 2, 2012, 398 : 137 - +
  • [7] Parallelism on multicore processors using Parallel.FX
    Marquez, A. L.
    Gil, C.
    Banos, R.
    Gomez, J.
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2011, 42 (05) : 259 - 265
  • [8] Parallelism on Multicore Processors using Parallel.FX
    Marquez, A. L.
    Gil, C.
    Banos, R.
    Gomez, J.
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING FOR ENGINEERING, 2009, (90): : 147 - 158
  • [9] Global Asynchronous Parallel Program Control for Multicore Processors
    Borkowski, Janusz
    Tudruj, Marek
    Smyk, Adam
    Kopanski, Damian
    [J]. APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 119 - 130
  • [10] Multicore processors and GPUs: the power of parallel computing in the Cloud
    Bennett, Kelly W.
    Robertson, James
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413