RaceR: A Thread Mapping Algorithm for Race Reduction in Multi-Level Shared Caches

被引：3

作者：

Sahneh, Pezhman Shojaa ^{[1
]}

Sarihi, Amin ^{[1
]}

Warburton, Benjamin ^{[1
]}

Patooghy, Ahmad ^{[1
]}

机构：

[1] Univ Cent Arkansas, Dept Comp Sci, Conway, AR 72035 USA

来源：

2019 27TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP) | 2019年

关键词：

Multi-core architecture; cache race; shared cache; multi-level cache; LOW-OVERHEAD;

D O I：

10.1109/EMPDP.2019.8671576

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-level hierarchical cache architectures are now being widely used in the design and fabrication of multi and many-core chips. However, when two or more threads race to write their own data into the shared-cache, contentions may happen. This natural conflict seriously aggravates the performance of multi-core systems by showing variant performance in multiple runs of even a same program. In this paper, an efficient thread-mapping algorithm is proposed to minimize the cache race condition between threads of multi-core systems. The proposed algorithm, dynamically monitors races on cache blocks and distributes existing and new threads on cores such that the cache contention is minimized. The proposed algorithm uses instructions per cycle (IPC) parameter to detect conflicting threads on the multi-core system. Upon detection of a high contention rate, two mechanisms of cache access rate reduction, and thread migration are used to resolve the race situation. The first solution is a short term one with negligible performance loss, while the former totally resolves the problem with a relatively higher performance cost. Evaluations of the proposed algorithm are done by the use of AKULA simulator alongside SPEC CPU 2006 benchmark suit. Simulation results show that the proposed algorithm improves system performance by average of 6.12% for the SPEC CPU 2006 benchmark suit.

引用

页码：228 / 232

页数：5

共 50 条

[41] A multi-level multi-integral algorithm for the Helmholtz equation
Dargush, G. F.
Grigoriev, M. M.
[J]. Noise Control and Acoustics Division - 2005, 2005, 32 : 41 - 48
[42] Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors
Martorell, Xavier
Ayguade, Eduard
Navarro, Nacho
Corbalan, Julita
Gonzalez, Marc
Labarta, Jesus
[J]. Proceedings of the International Conference on Supercomputing, 1999, : 294 - 301
[43] Multi-level, multi-step motion estimation algorithm
Shin, DS
Kwak, NJ
Kwon, HB
Ahn, JH
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (06) : 760 - 762
[44] Research of Real-time Data Warehouse Storage Strategy Based on Multi-level Caches
Shao YiChuan
Yao, Xingjia
[J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 2315 - 2321
[45] WCET analysis of multi-level non-inclusive set-associative instruction caches
Hardy, Damien
Puaut, Isabelle
[J]. RTSS: 2008 REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2008, : 456 - 466
[46] Leakage power optimization techniques for ultra deep sub-micron multi-level caches
Kim, NS
Blaauw, D
Mudge, T
[J]. ICCAD-2003: IEEE/ACM DIGEST OF TECHNICAL PAPERS, 2003, : 627 - 632
[47] Multi-level passive order reduction of interconnect networks
Khazaka, R
Nakhla, M
[J]. 2001 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST, VOLS 1-3, 2001, : 1155 - 1158
[48] Design and Integration of Hierarchical-Placement Multi-level Caches for Real-Time Systems
Benedicte, Pedro
Hernandez, Caries
Abella, Jaume
Cazorla, Francisco J.
[J]. PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 455 - 460
[49] Multi-level order reduction with nonlinear port constraints
Ma, Min
Khazaka, Roni
[J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 1485 - 1488
[50] Multi-level complexity reduction for HEVC multiview coding
Jiang, Caoyang
Nooshabadi, Saeid
[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (02) : 197 - 213

← 1 2 3 4 5 →