A programmable co-processor for profiling

被引:16
|
作者
Zilles, CB [1 ]
Sohi, GS [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
关键词
D O I
10.1109/HPCA.2001.903267
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Aggressive program optimization requires accurate profile information, but such accuracy requires many samples to be collected. We explore a novel profiling architecture that reduces the overhead of collecting each sample by including a programmable co-processor that analyzes a stream of profile samples generated by a microprocessor From this stream of samples, the co-processor can detect correlations between instructions (e.g., memory dependence profiling) as well as those between different dynamic instances of the same instruction (e.g., value profiling). The profiler's programmable nature allows a broad range of data to be extracted, post-processed and formatted as well as provides the flexibility to tailor the profiling application to the program under test. Because the co-processor is specialized for profiling, it can execute profiling applications more efficiently than a general-purpose processor The co-processor should not significantly impact the cost or performance of the main processor because it can be implemented using a small number of transistors at the chip's periphery We demonstrate the proposed design through a detailed evaluation of load value profiling. Our implementation quickly and accurately estimates the value invariance of loads, with rime overhead roughly proportional to the size of the instruction working set of the program. This algorithm demonstrates a number of general techniques for profiling, including: estimating the completeness of a profile, a means to focus profiling on particular instructions management of profiling resources.
引用
下载
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [1] Architecture of a programmable FIR filter co-processor
    Gay-Bellile, O
    Dujardin, E
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D433 - D436
  • [2] A universal and efficient AES co-processor for field programmable logic arrays
    Pramstaller, N
    Wolkerstorfer, J
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 565 - 574
  • [3] Implementation of a Clifford algebra co-processor design on a field programmable gate array
    Perwass, C
    Gebken, C
    Sommer, G
    CLIFFORD ALGEBRAS: APPLICATIONS TO MATHEMATICS, PHYSICS, AND ENGINEERING, 2004, 34 : 561 - 575
  • [4] A DSP co-processor for the ARM RISC processor
    Walsh, D
    ELECTRONIC ENGINEERING, 1997, 69 (842): : 43 - +
  • [5] Implementation of a cryptographic co-processor
    Kakarountas, A. P.
    Michail, H.
    PROCEEDINGS OF THE 6TH WSEAS INTERNATIONAL CONFERENCE ON INFORMATION SECURITY AND PRIVACY (ISP '07): ADVANCED TOPICS IN INFORMATION SECURITY AND PRIVACY, 2007, : 160 - 165
  • [6] Data manipulation co-processor
    Labib, GA
    ELECTRONICS WORLD, 2005, 111 (1829): : 20 - 26
  • [7] Implementation of a field programmable logic based co-processor for the acceleration of discrete event simulators
    Touhafi, A.
    Brissinck, W.
    Dirkx, E.F.
    Lecture Notes in Computer Science, 1996, 1142
  • [8] A GRAPHICS CO-PROCESSOR AND ITS DISPLAY PROCESSOR ICS
    QUEINNEC, O
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1987, 33 (04) : 551 - 556
  • [9] Reconfigurable co-processor for software acceleration
    Electronic Product Design, 1997, 18 (11):
  • [10] A design of hardware cryptographic co-processor
    Fan, MY
    Wang, JH
    Wang, GW
    IEEE SYSTEMS, MAN AND CYBERNETICS SOCIETY INFORMATION ASSURANCE WORKSHOP, 2003, : 234 - 236