Carbon: Architectural Support for Fine-Grained Parallelism on Chip Multiprocessors

被引:0
|
作者
Kumar, Sanjeev [1 ]
Hughes, Christopher J. [1 ]
Nguyen, Anthony [1 ]
机构
[1] Intel, Microprocessor Technol Labs, Santa Clara, CA 95052 USA
关键词
CMP; loop and task parallelism; architectural support;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Chip multiprocessors (CMPs) are now commonplace, and the number of cores on a CMP is likely to grow steadily. However, in order to harness the additional compute resources of a CMP, applications must expose their thread-level parallelism to the hardware. One common approach to doing this is to decompose a program into parallel "tasks" and allow an underlying software layer to schedule these tasks to different threads. Software task scheduling can provide good parallel performance as long as tasks are large compared to the software overheads. We examine a set of applications from an important emerging domain: Recognition, Mining, and Synthesis (RMS). Many RMS applications are compute-intensive and have abundant thread-level parallelism, and are therefore good targets for running on a CMP. However, a significant number have small tasks for which software task schedulers achieve only limited parallel speedups. We propose Carbon, a hardware technique to accelerate dynamic task scheduling on scalable CMPs. Carbon has relatively simple hardware, most of which can be placed far from the cores. We compare Carbon to some highly tuned software task schedulers for a set of RMS benchmarks with small tasks. Carbon delivers significant performance improvements over the best software scheduler: on average for 64 cores, 68% faster on a set of loop-parallel benchmarks, and 109% faster on a set of task-parallel benchmarks.
引用
收藏
页码:162 / 173
页数:12
相关论文
共 50 条
  • [21] Fine-grained concretes for small architectural forms
    Lesovik, R.V.
    Ageeva, M.S.
    Golikov, V.G.
    Fomenko, Yu.V.
    Stroitel'nye Materialy, 2005, (11): : 66 - 67
  • [22] Architectural Support for Exploiting Fine Grain Parallelism
    Rosas-Ham, Demian
    Herath, Isuru
    Yiapanis, Paraskevas
    Lujan, Mikel
    Watson, Ian
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 61 - 70
  • [23] A MULTIPROCESSOR ARCHITECTURE COMBINING FINE-GRAINED AND COARSE-GRAINED PARALLELISM STRATEGIES
    LILJA, DJ
    PARALLEL COMPUTING, 1994, 20 (05) : 729 - 751
  • [24] Address Scaling: Architectural Support for Fine-Grained Thread-Safe Metadata Management
    Mishra, Deepanjali
    Kanellopoulos, Konstantinos
    Panwar, Ashish
    Sriraman, Akshitha
    Seshadri, Vivek
    Mutlu, Onur
    Mowry, Todd C.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (01) : 69 - 72
  • [25] Study of Fine-grained Nested Parallelism in CDCL SAT Solvers
    Edwards, James
    Vishkin, Uzi
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2021, 8 (03)
  • [26] Fine-grained parallelism in probabilistic parsing with Habanero Java']Java
    Francis-Landau, Matthew
    Xue, Bing
    Eisner, Jason
    Sarkar, Vivek
    PROCEEDINGS OF 2016 6TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURE AND ALGORITHMS (IA3), 2016, : 78 - 81
  • [27] FINGERS: Exploiting Fine-Grained Parallelism in Graph Mining Accelerators
    Chen, Qihang
    Tian, Boyu
    Gao, Mingyu
    ASPLOS '22: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2022, : 43 - 55
  • [28] An FPGA Overlay for CNN Inference with Fine-grained Flexible Parallelism
    Choudhury, Ziaul
    Shrivastava, Shashwat
    Ramapantulu, Lavanya
    Purini, Suresh
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 19 (03)
  • [29] Towards Fine-Grained Dataflow Parallelism in Big Data Systems
    Ertel, Sebastian
    Adam, Justus
    Castrillon, Jeronimo
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2017, 2019, 11403 : 281 - 282
  • [30] Accelerating a Lossy Compression Method with Fine-Grained Parallelism on a GPU
    Wu, Yifan
    Shen, Jingcheng
    Okita, Masao
    Ino, Fumihiko
    PAAP 2021: 2021 12TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING, 2021, : 76 - 81