Automatic measurement of instruction cache capacity

被引:4
|
作者
Yotov, Kamen [1 ]
Jackson, Sandra [1 ]
Steele, Tyler [1 ]
Pingali, Keshav [1 ]
Stodghill, Paul [1 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
D O I
10.1007/978-3-540-69330-7_16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is growing interest in autonomic computing systems that can optimize their own behavior on different platforms without manual intervention. Examples of successful self-optimizing systems are ATLAS, which generates Basic Linear Algebra Subroutine (BLAS) Libraries, and FFTW, which generates FFT libraries. Self-optimizing systems may need the values of hardware parameters such as the number of registers of various types and the capacities of caches at various levels. For example, ATLAS uses the capacity of the L1 cache and the number of registers in determining the size of cache tiles and register tiles. We have built a system called X-Ray(1), which uses micro-benchmarks to measure such parameter values automatically. The micro-benchmarks currently implemented in X-Ray can determine the latency of various instructions, the existence of important instructions like fused multiply-add, the number of registers of various kinds, and parameters of the memory hierarchy. In this paper, we discuss how X-Ray determines the capacity of the instruction cache (I-cache), which is needed for important optimizations such as loop unrolling. We present the micro-benchmark used in X-Ray to measure I-cache capacity, the experimental methodology used to obtain accurate estimates, and experimental results on a large number of current platforms.
引用
收藏
页码:230 / +
页数:3
相关论文
共 50 条
  • [1] Way Stealing: Cache-assisted Automatic Instruction Set Extensions
    Kluter, Theo
    Brisk, Philip
    Ienne, Paolo
    Charbon, Edoardo
    DAC: 2009 46TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2009, : 31 - +
  • [2] AUTOMATIC MEASUREMENT OF CAPACITY BY FREQUENCY DISCRIMINATOR
    NAKONECH.YS
    PRIBORY I TEKHNIKA EKSPERIMENTA, 1973, (05): : 160 - 162
  • [3] AN AUTOMATIC APPARATUS FOR THE MEASUREMENT OF CRANIAL CAPACITY
    Bushkovitch, V. J.
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 1927, 10 (03) : 355 - 363
  • [4] Instruction trace compression for rapid instruction cache, simulation
    Janapsatya, Andhi
    Ignjatovic, Aleksandar
    Parameswaran, Sri
    Henkel, Joerg
    2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 803 - +
  • [5] Impact of Instruction Cache and Different Instruction Scratchpads on the WCET Estimate
    Metzlaff, Stefan
    Ungerer, Theo
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 1442 - 1449
  • [6] Instruction Cache Aging Mitigation Through Instruction Set Encoding
    Gebregiorgis, Anteneh
    Oboril, Fabian
    Tahoori, Mehdi B.
    Hamdioui, Said
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ISQED 2016, 2016, : 325 - 330
  • [7] Instruction cache prefetching with extended BTB
    Chi, SA
    Shiu, RM
    Chiu, JC
    Chang, SE
    Chung, CP
    1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 360 - 365
  • [8] A QUEUE BASED INSTRUCTION CACHE MEMORY
    CORAOR, LD
    HULINA, PT
    MANNAI, DN
    CA-DSP 89, VOLS 1 AND 2: 1989 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SIGNAL PROCESSING, 1989, : 281 - 286
  • [9] Compositional static instruction cache simulation
    Patil, K
    Seth, K
    Mueller, F
    ACM SIGPLAN NOTICES, 2004, 39 (07) : 136 - 145
  • [10] Improving instruction cache performance in OLTP
    Harizopoulos, Stavros
    Ailamaki, Anastassia
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2006, 31 (03): : 887 - 920