A Principal Factor of Performance in Decoupled Front-End

被引:0
|
作者
Degawa, Yuya [1 ]
Koizumi, Toru [1 ,2 ]
Nakamura, Tomoki [1 ]
Shioya, Ryota [1 ]
Kadomoto, Junichiro [1 ]
Irie, Hidetsugu [1 ]
Sakai, Shuichi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
[2] Nagoya Inst Technol, Dept Comp Sci & Engn, Nagoya 4668555, Japan
关键词
instruction fetch; modeling techniques;
D O I
10.1587/transinf.2023PAP0011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the performance bottlenecks of a processor is the front-end that supplies instructions. Various techniques, such as cache re-placement algorithms and hardware prefetching, have been investigated to facilitate smooth instruction supply at the front-end and to improve processor performance. In these approaches, one of the most important factors has been the reduction in the number of instruction cache misses. By using the number of instruction cache misses or derived factors, previous studies have explained the performance improvements achieved by their proposed methods. However, we found that the number of instruction cache misses does not always explain performance changes well in modern processors. This is because the front-end in modern processors handles subsequent in-struction cache misses in overlap with earlier ones. Based on this observa-tion, we propose a novel factor: the number of miss regions. We define a region as a sequence of instructions from one branch misprediction to the next, while we define a miss region as a region that contains one or more instruction cache misses. At the boundary of each region, the pipeline is flushed owing to a branch misprediction. Thus, cache misses after this boundary are not handled in overlap with cache misses before the bound-ary. As a result, the number of miss regions is equal to the number of cache misses that are processed without overlap. In this paper, we demonstrate that the number of miss regions can well explain the variation in perfor-mance through mathematical models and simulation results. The results show that the model explains cycles per instruction with an average error of 1.0% and maximum error of 4.1% when applying an existing prefetcher to the instruction cache. The idea of miss regions highlights that instruction cache misses and branch mispredictions interact with each other in processors with a decoupled front-end. We hope that considering this interaction will motivate the development of fast performance estimation methods and new microarchitectural methods.
引用
收藏
页码:1960 / 1968
页数:9
相关论文
共 50 条
  • [21] The Design of High Performance RF Front-End Circuit
    Wei Liancheng
    Song Qinge
    2018 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT2018), 2018,
  • [22] Performance of the front-end electronics of the ANTARES neutrino telescope
    Aguilar, J. A.
    Al Samarai, I.
    Albert, A.
    Anghinolfi, M.
    Anton, G.
    Anvar, S.
    Ardid, M.
    Jesus, A. C. Assis
    Astraatmadja, T.
    Aubert, J-J
    Auer, R.
    Baret, B.
    Basa, S.
    Bazzotti, M.
    Bertin, V.
    Biagi, S.
    Bigongiari, C.
    Bou-Cabo, M.
    Bouwhuis, M. C.
    Brown, A.
    Brunner, J.
    Busto, J.
    Camarena, F.
    Capone, A.
    Caponetto, L.
    Carloganu, C.
    Carminati, G.
    Carr, J.
    Castorina, E.
    Cavasinni, V.
    Cecchini, S.
    Chaleil, Th
    Charvis
    Chiarusi, T.
    Sen, N. Chon
    Circella, M.
    Costantini, H.
    Cottini, N.
    Coyle, P.
    Curtil, C.
    De Bonis, G.
    de Botton, N.
    Dekeyser, I.
    Delagnes, E.
    Deschamps, A.
    Distefano, C.
    Donzaud, C.
    Dornic, D.
    Drouhin, D.
    Druillole, F.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2010, 622 (01): : 59 - 73
  • [23] Front-end automation
    Williamson, Ian
    EP Electronic Production (London), 1990, 19 (02):
  • [24] High Performance Analog Front-End for Digital Spectroscopy
    Abba, Andrea
    Manenti, Antonio
    Suardi, Andrea
    Riboldi, Stefano
    Geraci, Angelo
    2009 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-5, 2009, : 331 - +
  • [25] NIRPS Front-End: Design, performance, and lessons learned
    Blind, N.
    Conod, U.
    de Meideros, A.
    Wildi, F.
    Bouchy, F.
    Bovay, S.
    Brousseau, D.
    Cabral, A.
    Genolet, L.
    Kolb, J.
    Schnell, R.
    Segovia, A.
    Sordet, M.
    Thibault, S.
    Wehbe, B.
    Zins, G.
    GROUND-BASED AND AIRBORNE INSTRUMENTATION FOR ASTRONOMY IX, 2022, 12184
  • [26] The Impact of Front-End Innovation Activities on Product Performance
    Markham, Stephen K.
    JOURNAL OF PRODUCT INNOVATION MANAGEMENT, 2013, 30 : 77 - 92
  • [27] Design and performance of the ALICE TRD front-end electronics
    Angelov, Venelin
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2006, 563 (02): : 317 - 320
  • [28] High Performance Analog Front-End for Digital Spectroscopy
    Abba, Andrea
    Manenti, Antonio
    Caponio, Francesco
    Geraci, Angelo
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2010, 57 (04) : 2173 - 2177
  • [29] Timing performance of the Timepix4 front-end
    Heijhoff, K.
    Akiba, K.
    Ballabriga, R.
    van Beuzekom, M.
    Campbell, M.
    Colijn, A. P.
    Fransen, M.
    Geertsema, R.
    Gromov, V
    Cudie, X. Llopart
    JOURNAL OF INSTRUMENTATION, 2022, 17 (07)
  • [30] PERFORMANCE OF A LHC FRONT-END RUNNING AT 67 MHZ
    BRENNER, R
    KAPLON, J
    VONDERLIPPE, H
    NYGARD, E
    ROE, S
    WEILHAMMER, P
    YOSHIOKA, K
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1994, 339 (03): : 477 - 484