A Principal Factor of Performance in Decoupled Front-End

被引:0
|
作者
Degawa, Yuya [1 ]
Koizumi, Toru [1 ,2 ]
Nakamura, Tomoki [1 ]
Shioya, Ryota [1 ]
Kadomoto, Junichiro [1 ]
Irie, Hidetsugu [1 ]
Sakai, Shuichi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
[2] Nagoya Inst Technol, Dept Comp Sci & Engn, Nagoya 4668555, Japan
关键词
instruction fetch; modeling techniques;
D O I
10.1587/transinf.2023PAP0011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the performance bottlenecks of a processor is the front-end that supplies instructions. Various techniques, such as cache re-placement algorithms and hardware prefetching, have been investigated to facilitate smooth instruction supply at the front-end and to improve processor performance. In these approaches, one of the most important factors has been the reduction in the number of instruction cache misses. By using the number of instruction cache misses or derived factors, previous studies have explained the performance improvements achieved by their proposed methods. However, we found that the number of instruction cache misses does not always explain performance changes well in modern processors. This is because the front-end in modern processors handles subsequent in-struction cache misses in overlap with earlier ones. Based on this observa-tion, we propose a novel factor: the number of miss regions. We define a region as a sequence of instructions from one branch misprediction to the next, while we define a miss region as a region that contains one or more instruction cache misses. At the boundary of each region, the pipeline is flushed owing to a branch misprediction. Thus, cache misses after this boundary are not handled in overlap with cache misses before the bound-ary. As a result, the number of miss regions is equal to the number of cache misses that are processed without overlap. In this paper, we demonstrate that the number of miss regions can well explain the variation in perfor-mance through mathematical models and simulation results. The results show that the model explains cycles per instruction with an average error of 1.0% and maximum error of 4.1% when applying an existing prefetcher to the instruction cache. The idea of miss regions highlights that instruction cache misses and branch mispredictions interact with each other in processors with a decoupled front-end. We hope that considering this interaction will motivate the development of fast performance estimation methods and new microarchitectural methods.
引用
收藏
页码:1960 / 1968
页数:9
相关论文
共 50 条
  • [41] RPC performance versus front-end electronics and detector parameters
    Cardarelli, R.
    Aielli, G.
    Camelia, E. Alunno
    Bruno, S.
    Caltabiano, A.
    Camarri, P.
    Di Ciaccio, A.
    Liberti, B.
    Massa, L.
    Pizzimento, L.
    Rocchi, A.
    JOURNAL OF INSTRUMENTATION, 2019, 14
  • [42] CFD predictions of minivan's front-end flow performance
    Wang, Dong
    Yang, Zhigang
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY, 2006, : 152 - +
  • [43] RF Front-End for SEAMS
    Kulkarni, Atharva
    Pingale, Sunil
    Gharpure, Damayanti
    Ananthakrishnan, Subramaniam
    2019 URSI ASIA-PACIFIC RADIO SCIENCE CONFERENCE (AP-RASC), 2019,
  • [44] FRONT-END AUTOMATION CATCHES ON
    WALLER, L
    LINEBACK, JR
    ELECTRONICSWEEK, 1985, 58 (11): : 20 - 20
  • [45] Front-end process simulation
    Rafferty, CS
    SOLID-STATE ELECTRONICS, 2000, 44 (05) : 863 - 868
  • [46] Front-end software proliferates
    Tenopir, C
    LIBRARY JOURNAL, 1996, 121 (08) : 29 - 30
  • [47] Influence of Front-End Electronics on Metrological Performance of QCM Systems
    Fort, Ada
    Landi, Elia
    Moretti, Riccardo
    Mugnaini, Marco
    Liguori, Consolatina
    Paciello, Vincenzo
    Dello Iacono, Salvatore
    SENSORS, 2024, 24 (11)
  • [48] Performance of a front-end preamplifier with a digital output for semiconductor detectors
    Univ of Surrey, Guildford, United Kingdom
    IEEE Nucl Sci Symp Med Imaging Conf, (480-483):
  • [49] POLAR Front-End Electronics: Concept, Performance and Qualification Tests
    Marcinkowski, Radoslaw M.
    Hajdas, Wojtek
    Rybka, Dominik
    Britvitch, Ilia
    Rodriguez, Ismael Traseira
    Gauvin, Neal
    Produit, Nicolas
    Rapin, Divic
    Pohl, Martin
    Orsi, Silvio
    Lechanoine-Leluc, Catherine
    Paniccia, Mercedes
    Batsch, Tadeusz
    Rutczynska, Aleksandra
    Szabelski, Jacek
    Zwolinska, Anna
    Krakowski, Tomasz
    Bao, Tianwei
    Chai, Junying
    Dong, Yongwei
    Kong, Minnan
    Li, Lu
    Liu, Jiangtao
    Liu, Xin
    Shi, Haoli
    Sun, Jianchao
    Wang, Ruijie
    Wen, Xing
    Wu, Bobing
    Xiao, Hualin
    Xu, Hanhui
    Zhang, Li
    Zhang, Laiyu
    Zhang, Shuangnan
    Zhang, Yongjie
    2013 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2013,
  • [50] Latency Performance Evaluation of RF Front-End Transceiver Architecture
    Hussain, Intikhab
    Dyab, Walid
    Sakr, Ahmed A.
    Wu, Ke
    2019 49TH EUROPEAN MICROWAVE CONFERENCE (EUMC), 2019, : 750 - 753