Makespan computation for GPU threads running on a single streaming multiprocessor

被引:4
|
作者
Berezovskyi, Kostiantyn [1 ]
Bletsas, Konstantinos [1 ]
Andersson, Bjoern
机构
[1] Polytech Inst Porto, ISEP Res Unit, CISTER, Oporto, Portugal
来源
PROCEEDINGS OF THE 24TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS (ECRTS 2012) | 2012年
关键词
GENERAL-PURPOSE COMPUTATION;
D O I
10.1109/ECRTS.2012.16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graphics processors were originally developed for rendering graphics but have recently evolved towards being an architecture for general-purpose computations. They are also expected to become important parts of embedded systems hardware - not just for graphics. However, this necessitates the development of appropriate timing analysis techniques which would be required because techniques developed for CPU scheduling are not applicable. The reason is that we are not interested in how long it takes for any given GPU thread to complete, but rather how long it takes for all of them to complete. We therefore develop a simple method for finding an upper bound on the makespan of a group of GPU threads executing the same program and competing for the resources of a single streaming multiprocessor (whose architecture is based on NVIDIA Fermi, with some simplifying assumptions). We then build upon this method to formulate the derivation of the exact worst-case makespan (and corresponding schedule) as an optimization problem. Addressing the issue of tractability, we also present a technique for efficiently computing a safe estimate of the worst-case makespan with minimal pessimism, for use when finding an exact value would take too long.
引用
收藏
页码:277 / 286
页数:10
相关论文
共 6 条
  • [1] Improving GPU Performance with a Power-Aware Streaming Multiprocessor Allocation Methodology
    Tasoulas, Zois-Gerasimos
    Anagnostopoulos, Iraklis
    ELECTRONICS, 2019, 8 (12)
  • [2] MAKESPAN COMPUTATION OF LOT SWITCHING PERIOD IN SINGLE-ARMED CLUSTER TOOLS
    Kim, Hyun-Jung
    Lee, Jun-Ho
    2015 WINTER SIMULATION CONFERENCE (WSC), 2015, : 2983 - 2990
  • [3] Accelerating SPH-Fatigue Computation by Using Single Precision Program on GPU
    Tazoe, Koki
    Yamada, Tomonori
    Yagawa, Genki
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2024, 21 (05)
  • [4] Experiments With Single Core, Multi Core, and GPU-based Computation of Cellular Automata
    Rybacki, Stefan
    Himmelspach, Jan
    Uhrmacher, Adelinde M.
    SIMUL: 2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN SYSTEM SIMULATION, 2009, : 62 - 67
  • [5] Exploiting Parallelism in Matrix-Computation Kernels for Symmetric Multiprocessor Systems Matrix-Multiplication and Matrix-Addition Algorithm Optimizations by Software Pipelining and Threads Allocation
    D'Alberto, Paolo
    Bodrato, Marco
    Nicolau, Alexandru
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2011, 38 (01):
  • [6] Real-time data processing in colorimetry camera-based single-molecule localization microscopy via CPU-GPU-FPGA heterogeneous computation
    Lin, Jiaxun
    Wang, Kun
    Huang, Zhen-Li
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (09): : 5560 - 5573