A programmable co-processor for profiling

被引:16
|
作者
Zilles, CB [1 ]
Sohi, GS [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
关键词
D O I
10.1109/HPCA.2001.903267
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Aggressive program optimization requires accurate profile information, but such accuracy requires many samples to be collected. We explore a novel profiling architecture that reduces the overhead of collecting each sample by including a programmable co-processor that analyzes a stream of profile samples generated by a microprocessor From this stream of samples, the co-processor can detect correlations between instructions (e.g., memory dependence profiling) as well as those between different dynamic instances of the same instruction (e.g., value profiling). The profiler's programmable nature allows a broad range of data to be extracted, post-processed and formatted as well as provides the flexibility to tailor the profiling application to the program under test. Because the co-processor is specialized for profiling, it can execute profiling applications more efficiently than a general-purpose processor The co-processor should not significantly impact the cost or performance of the main processor because it can be implemented using a small number of transistors at the chip's periphery We demonstrate the proposed design through a detailed evaluation of load value profiling. Our implementation quickly and accurately estimates the value invariance of loads, with rime overhead roughly proportional to the size of the instruction working set of the program. This algorithm demonstrates a number of general techniques for profiling, including: estimating the completeness of a profile, a means to focus profiling on particular instructions management of profiling resources.
引用
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [21] A Unified Co-Processor Architecture for Matrix Decomposition
    窦勇
    周杰
    邬贵明
    姜晶菲
    雷元武
    倪时策
    Journal of Computer Science & Technology, 2010, 25 (04) : 874 - 885
  • [22] A Unified Co-Processor Architecture for Matrix Decomposition
    Yong Dou
    Jie Zhou
    Gui-Ming Wu
    Jing-Fei Jiang
    Yuan-Wu Lei
    Shi-Ce Ni
    Journal of Computer Science and Technology, 2010, 25 : 874 - 885
  • [23] Simulating and Estimating the Behavior of a Neuromorphic Co-Processor
    Schuman, Catherine D.
    Pooser, Raphael
    Mintz, Tiffany
    Adnan, Md Musabbir
    Rose, Garrett S.
    Ku, Bon Woong
    Lim, Sung Kyu
    PROCEEDINGS OF 2ND INTERNATIONAL WORKSHOP ON POST MOORE'S ERA SUPERCOMPUTING (PMES 2017), 2017, : 8 - 14
  • [24] A pattern matching co-processor for network security
    Cho, YH
    Mangione-Smith, WH
    42ND DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2005, 2005, : 234 - 239
  • [25] A Multi-Algorithm Cryptographic Co-Processor
    Fronte, Daniele
    Perez, Annie
    Payrat, Eric
    IMETI 2008: INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING AND TECHNOLOGICAL INNOVATION, VOL II, PROCEEDINGS, 2008, : 102 - +
  • [26] FPGA CO-PROCESSOR TARGETS EMBEDDED CONTROL
    GALLANT, J
    EDN, 1995, 40 (20) : 24 - 24
  • [27] Efficient A* Co-processor for Reconfigurable Gaming Devices
    Nery, Alexandre S.
    Sena, Alexandre C.
    2018 17TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2018), 2018, : 97 - 106
  • [28] An FPGA Co-Processor Implementation of Homomorphic Encryption
    Cousins, David Bruce
    Golusky, John
    Rohloff, Kurt
    Sumorok, Daniel
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [29] Configurable CNN SoC Co-Processor Architecture
    Wijaya, Joshua Adiel
    Adiono, Trio
    2019 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2019, : 281 - 282
  • [30] Architecture for an advanced Java']Java co-processor
    Säntti, T
    Plosila, J
    ISSCS 2005: International Symposium on Signals, Circuits and Systems, Vols 1 and 2, Proceedings, 2005, : 501 - 504