Fine-Grained Management of Thread Blocks for Irregular Applications

被引:0
|
作者
Beaumont, Jonathan [1 ]
Mudge, Trevor [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
GPUs; thread scheduling; resource management; preemption; irregular parallelism; ALGORITHMS; DRIVEN;
D O I
10.1109/ICCD46524.2019.00042
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Co-locating threads with complementary resource usage is a key strategy for improving throughput in parallel machines such as GPUs. However, the proliferation of irregular algorithms which change execution behavior dynamically makes optimal thread placement impossible when done statically or by profiling kernels as a whole. In this work, we characterize the performance loss associated with current thread block scheduling policies in GPU architectures. We then demonstrate that an extension of these strategies incorporating dynamic performance metrics such as memory and functional unit utilization at the thread block level as well as preemptive thread block swapping can improve throughput. We show that performance on irregular algorithms can be improved by an average increase of 17.1% over static profiling methods and 12.9% over dynamic strategies with no changes to legacy software and minimal hardware extensions which increase SRAM storage area by less than 0.5%.
引用
收藏
页码:283 / 292
页数:10
相关论文
共 50 条
  • [1] A fine-grained thread-aware management policy for shared caches
    Rolan, Dyer
    Andrade, Diego
    Fraguela, Basilio B.
    Doallo, Ramon
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (06): : 1355 - 1374
  • [2] Perman: Fine-grained Permission Management for Android Applications
    Fu, Jiaojiao
    Zhou, Yangfan
    Liu, Huan
    Kang, Yu
    Wang, Xin
    2017 IEEE 28TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2017, : 250 - 259
  • [3] Thread Motion: Fine-Grained Power Management for Multi-Core Systems
    Rangan, Krishna K.
    Wei, Gu-Yeon
    Brooks, David
    ISCA 2009: 36TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2009, : 302 - 313
  • [4] Fine-Grained Management for Microservice Applications with Lazy Configuration Distribution
    Wang, Ning
    Wang, Lin
    Li, Xin
    Qin, Xiaolin
    ELECTRONICS, 2023, 12 (16)
  • [5] Address Scaling: Architectural Support for Fine-Grained Thread-Safe Metadata Management
    Mishra, Deepanjali
    Kanellopoulos, Konstantinos
    Panwar, Ashish
    Sriraman, Akshitha
    Seshadri, Vivek
    Mutlu, Onur
    Mowry, Todd C.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (01) : 69 - 72
  • [6] Critical thread guided fine-grained adaptive capacity management for shared CMP caches
    Xu, Xin
    Peng, Manman
    Information Technology Journal, 2013, 12 (07) : 1366 - 1372
  • [7] Fine-Grained Parallel Traversals of Irregular Data Structures
    Ren, Bin
    Agrawal, Gagan
    Larus, James R.
    Mytkowicz, Todd
    Poutanen, Tomi
    Schulte, Wolfram
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 461 - 462
  • [8] Arithmetic/Logic Blocks for Fine-Grained Reconfigurable Units
    Cardarilli, Gian Carlo
    Di Nunzio, Luca
    Re, Marco
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 2001 - 2004
  • [9] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [10] Making dense fine-grained large graphite blocks
    N. A. Lobastov
    A. N. Chernyavets
    Refractories and Industrial Ceramics, 2008, 49 : 126 - 128