RaceR: A Thread Mapping Algorithm for Race Reduction in Multi-Level Shared Caches

被引:3
|
作者
Sahneh, Pezhman Shojaa [1 ]
Sarihi, Amin [1 ]
Warburton, Benjamin [1 ]
Patooghy, Ahmad [1 ]
机构
[1] Univ Cent Arkansas, Dept Comp Sci, Conway, AR 72035 USA
关键词
Multi-core architecture; cache race; shared cache; multi-level cache; LOW-OVERHEAD;
D O I
10.1109/EMPDP.2019.8671576
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-level hierarchical cache architectures are now being widely used in the design and fabrication of multi and many-core chips. However, when two or more threads race to write their own data into the shared-cache, contentions may happen. This natural conflict seriously aggravates the performance of multi-core systems by showing variant performance in multiple runs of even a same program. In this paper, an efficient thread-mapping algorithm is proposed to minimize the cache race condition between threads of multi-core systems. The proposed algorithm, dynamically monitors races on cache blocks and distributes existing and new threads on cores such that the cache contention is minimized. The proposed algorithm uses instructions per cycle (IPC) parameter to detect conflicting threads on the multi-core system. Upon detection of a high contention rate, two mechanisms of cache access rate reduction, and thread migration are used to resolve the race situation. The first solution is a short term one with negligible performance loss, while the former totally resolves the problem with a relatively higher performance cost. Evaluations of the proposed algorithm are done by the use of AKULA simulator alongside SPEC CPU 2006 benchmark suit. Simulation results show that the proposed algorithm improves system performance by average of 6.12% for the SPEC CPU 2006 benchmark suit.
引用
收藏
页码:228 / 232
页数:5
相关论文
共 50 条
  • [41] A multi-level multi-integral algorithm for the Helmholtz equation
    Dargush, G. F.
    Grigoriev, M. M.
    [J]. Noise Control and Acoustics Division - 2005, 2005, 32 : 41 - 48
  • [42] Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors
    Martorell, Xavier
    Ayguade, Eduard
    Navarro, Nacho
    Corbalan, Julita
    Gonzalez, Marc
    Labarta, Jesus
    [J]. Proceedings of the International Conference on Supercomputing, 1999, : 294 - 301
  • [43] Multi-level, multi-step motion estimation algorithm
    Shin, DS
    Kwak, NJ
    Kwon, HB
    Ahn, JH
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (06) : 760 - 762
  • [44] Research of Real-time Data Warehouse Storage Strategy Based on Multi-level Caches
    Shao YiChuan
    Yao, Xingjia
    [J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 2315 - 2321
  • [45] WCET analysis of multi-level non-inclusive set-associative instruction caches
    Hardy, Damien
    Puaut, Isabelle
    [J]. RTSS: 2008 REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2008, : 456 - 466
  • [46] Leakage power optimization techniques for ultra deep sub-micron multi-level caches
    Kim, NS
    Blaauw, D
    Mudge, T
    [J]. ICCAD-2003: IEEE/ACM DIGEST OF TECHNICAL PAPERS, 2003, : 627 - 632
  • [47] Multi-level passive order reduction of interconnect networks
    Khazaka, R
    Nakhla, M
    [J]. 2001 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST, VOLS 1-3, 2001, : 1155 - 1158
  • [48] Design and Integration of Hierarchical-Placement Multi-level Caches for Real-Time Systems
    Benedicte, Pedro
    Hernandez, Caries
    Abella, Jaume
    Cazorla, Francisco J.
    [J]. PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 455 - 460
  • [49] Multi-level order reduction with nonlinear port constraints
    Ma, Min
    Khazaka, Roni
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 1485 - 1488
  • [50] Multi-level complexity reduction for HEVC multiview coding
    Jiang, Caoyang
    Nooshabadi, Saeid
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (02) : 197 - 213