Design space exploration of hardware task superscalar architecture

被引:0
|
作者
Fahimeh Yazdanpanah
Mohammad Alaei
机构
[1] Shahid Bahonar University of Kerman,Computer Engineering Department, Faculty of Engineering
来源
关键词
Task scheduling; Task parallelism; Task superscalar; OmpSs;
D O I
暂无
中图分类号
学科分类号
摘要
For current high performance computing systems, exploiting concurrency is a serious and important challenge. Recently, several dynamic software task management mechanisms have been proposed. In particular, task-based dataflow programming models which benefit from dataflow principles to improve task-level parallelism and overcome the limitations of static task management systems. However, these programming models rely on software-based dependency analysis, which are performed inherently slowly; and this limits their scalability specially when there is fine-grained task granularity and a large amount of tasks. Moreover, task scheduling in software introduces overheads, and so becomes increasingly inefficient with the number of cores. In contrast, a hardware scheduling solution, like Task SuperScalar (TSS), can achieve greater values of speed-up because a hardware task scheduler requires fewer cycles than the software version to dispatch a task. TSS combines the effectiveness of Out-of-Order processors together with the task abstraction. It has been implemented in software with limited parallelism and high memory consumption due to the nature of the software implementation. Hardware Task Superscalar (HTSS) is proposed to solve these drawbacks. HTSS is designed to be integrated in a future high performance computer with the ability to exploit fine-grained task parallelism. In this article, a deep latency and design space exploration of HTSS is described. For design space exploration, we have designed a full cycle-accurate simulator of HTSS, called SimTSS. The simulator has been tuned based on latency exploration of HTSS components resulted from VHDL description of each component. As the result of this exploration, we have found the number of components and memory capacity of HTSS for HPC systems.
引用
收藏
页码:3567 / 3592
页数:25
相关论文
共 50 条
  • [31] Design Space Exploration for Hardware Acceleration of Machine Learning Applications in MapReduce
    Neshatpour, Katayoun
    Mokrani, Hosein Mohammadi
    Sasan, Avesta
    Ghasemzadeh, Hassan
    Rafatirad, Setareh
    Homayoun, Houman
    PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 221 - 221
  • [32] Hardware-accelerated design space exploration framework for communication systems
    Kock, Markus
    Hesselbarth, Sebastian
    Pfitzner, Martin
    Blume, Holger
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2014, 78 (03) : 557 - 571
  • [33] Design Space Exploration of Hardware Spiking Neurons for Embedded Artificial Intelligence
    Abderrahmane, Nassim
    Lemaire, Edgar
    Miramond, Benoit
    NEURAL NETWORKS, 2020, 121 : 366 - 386
  • [34] Rapid Hardware/Software Design Space Exploration for Efficient Intermittent Systems
    Kim, Youngbin
    Kim, Hyoseung
    PROCEEDINGS OF THE 29TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED 2024, 2024,
  • [35] Design Space Exploration of Hardware Architectures for Content Based Music Classification
    Schmaedecke, I.
    Blume, H.
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 526 - 527
  • [36] Design space exploration of heterogeneous MPSoCs with variable number of hardware accelerators
    Xu, Siyuan
    Liu, Shuangnan
    Liu, Yidi
    Mahapatra, Anushree
    Villaverde, Monica
    Moreno, Felix
    Schafer, Benjamin Carrion
    MICROPROCESSORS AND MICROSYSTEMS, 2019, 65 : 169 - 179
  • [37] Superscalar architecture design for high performance DSP operations
    Sheikh, Faheem
    Masud, Shahid
    Ahmed, Rehan
    MICROPROCESSORS AND MICROSYSTEMS, 2009, 33 (02) : 154 - 160
  • [38] Configurable SoC In Situ Hardware/Software Co-Design Design Space Exploration
    Xu, Siyuan
    Schafer, Benjamin Carrion
    Liu, Yidi
    2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 509 - 512
  • [39] An Automatic Design Space Exploration Framework for Multicore Architecture Optimizations
    Calborean, Horia
    Vintan, Lucian
    9TH ROEDUNET IEEE INTERNATIONAL CONFERENCE, 2010, : 202 - 207
  • [40] A networks-on-chip architecture design space exploration - The LIB
    Liu, Peng
    Xia, Bingjie
    Xiang, Chunchang
    Wang, Xiaohang
    Wang, Weidong
    Yao, Qingdong
    COMPUTERS & ELECTRICAL ENGINEERING, 2009, 35 (06) : 817 - 836