Cost-Effective Speculative Scheduling in High Performance Processors

被引:7
|
作者
Perais, Arthur [1 ]
Seznec, Andre [1 ]
Michaud, Pierre [1 ]
Sembrant, Andreas [2 ]
Hagersten, Erik [2 ]
机构
[1] IRISA INRIA, Rennes, France
[2] Uppsala Univ, S-75105 Uppsala, Sweden
关键词
D O I
10.1145/2749469.2749470
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To maximize performance, out-of-order execution processors sometimes issue instructions without having the guarantee that operands will be available in time; e.g. loads are typically assumed to hit in the L1 cache and dependent instructions are issued accordingly. This form of speculation - that we refer to as speculative scheduling - has been used for two decades in real processors, but has received little attention from the research community. In particular, as pipeline depth grows, and the distance between the Issue and the Execute stages increases, it becomes critical to issue instructions dependent on variable-latency instructions as soon as possible rather than wait for the actual cycle at which the result becomes available. Unfortunately, due to the uncertain nature of speculative scheduling, the scheduler may wrongly issue an instruction that will not have its source(s) available on the bypass network when it reaches the Execute stage. In that event, the instruction is canceled and replayed, potentially impairing performance and increasing energy consumption. In this work, we do not present a new replay mechanism. Rather, we focus on ways to reduce the number of replays that are agnostic of the replay scheme. First, we propose an easily implementable, low-cost solution to reduce the number of replays caused by L1 bank conflicts. Schedule shifting always assumes that, given a dual-load issue capacity, the second load issued in a given cycle will be delayed because of a bank conflict. Its dependents are thus always issued with the corresponding delay. Second, we also improve on existing L1 hit/miss prediction schemes by taking into account instruction criticality. That is, for some criterion of criticality and for loads whose hit/miss behavior is hard to predict, we show that it is more cost-effective to stall dependents if the load is not predicted critical.
引用
收藏
页码:247 / 259
页数:13
相关论文
共 50 条
  • [21] Cost-Effective Scheduling of Steel Plants With Flexible EAFs
    Zhang, Xiao
    Hug, Gabriela
    Harjunkoski, Iiro
    IEEE TRANSACTIONS ON SMART GRID, 2017, 8 (01) : 239 - 249
  • [22] Cost-Effective Traffic Scheduling For Cloud Resource Management
    Shareef, Zayd Ashraf
    Hussin, Masnida
    Abdullah, Azizol
    Muhammed, Abdullah
    2015 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2015, : 189 - 194
  • [23] A Cost-Effective Scheduling Algorithm for Scientific Workflows in Clouds
    Zhu, Mengxia
    Wu, Qishi
    Zhao, Yang
    2012 IEEE 31ST INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2012, : 256 - 265
  • [24] VLSI AND WSI ASSOCIATIVE STRING PROCESSORS FOR COST-EFFECTIVE PARALLEL PROCESSING
    LEA, RM
    COMPUTER JOURNAL, 1986, 29 (06): : 486 - 494
  • [25] Cost-Effective Learning for Cost-Effective Care?
    Walsh, Kieran
    ACADEMIC MEDICINE, 2011, 86 (12) : 1485 - 1486
  • [26] Cost-effective graceful degradation in speculative processor subsystems: The branch prediction case
    Almukhaizim, S
    Verdel, T
    Makris, Y
    21ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, PROCEEDINGS, 2003, : 194 - 197
  • [27] Cost-Effective High-Performance Concrete: Experimental Analysis on Shrinkage
    Kucharczykova, Barbara
    Kocab, Dalibor
    Danek, Petr
    Terzijski, Ivailo
    MATERIALS, 2019, 12 (17)
  • [28] A cost-effective, high-performance approach to critical care testing
    Weilert, W
    Workman, RD
    Danaye-Elmi, M
    Darmanian, C
    LABORATORY MEDICINE, 1999, 30 (09) : 601 - 604
  • [29] Cost-effective pixel structures for high performance mobile PVA LCDs
    Sohn, Jiwon
    Lyu, Jaejin
    Cho, Seon-Ah
    Park, Jinwon
    Park, Seung Beom
    Yang, Sunghoon
    Jung, Mee-Hye
    Kim, Kyeong-Hyeon
    Kim, Sang Soo
    2007 SID INTERNATIONAL SYMPOSIUM, DIGEST OF TECHNICAL PAPERS, VOL XXXVIII, BOOKS I AND II, 2007, 38 : 1659 - 1660
  • [30] A HIGH-PERFORMANCE, COST-EFFECTIVE CIRCUIT FOR CCD PULSE PROCESSING
    DRURY, WG
    WEIGOLD, E
    PESOR, JZ
    BRUNGER, MJ
    MEASUREMENT SCIENCE AND TECHNOLOGY, 1995, 6 (01) : 125 - 127