A programmable co-processor for profiling

被引:16
|
作者
Zilles, CB [1 ]
Sohi, GS [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
关键词
D O I
10.1109/HPCA.2001.903267
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Aggressive program optimization requires accurate profile information, but such accuracy requires many samples to be collected. We explore a novel profiling architecture that reduces the overhead of collecting each sample by including a programmable co-processor that analyzes a stream of profile samples generated by a microprocessor From this stream of samples, the co-processor can detect correlations between instructions (e.g., memory dependence profiling) as well as those between different dynamic instances of the same instruction (e.g., value profiling). The profiler's programmable nature allows a broad range of data to be extracted, post-processed and formatted as well as provides the flexibility to tailor the profiling application to the program under test. Because the co-processor is specialized for profiling, it can execute profiling applications more efficiently than a general-purpose processor The co-processor should not significantly impact the cost or performance of the main processor because it can be implemented using a small number of transistors at the chip's periphery We demonstrate the proposed design through a detailed evaluation of load value profiling. Our implementation quickly and accurately estimates the value invariance of loads, with rime overhead roughly proportional to the size of the instruction working set of the program. This algorithm demonstrates a number of general techniques for profiling, including: estimating the completeness of a profile, a means to focus profiling on particular instructions management of profiling resources.
引用
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [41] Parallel Data Processing With Magnonic Holographic Co-Processor
    Balynsky, M.
    Gutierrez, D.
    Chiang, H.
    Khitun, A.
    Kozhevnikov, A.
    Khivintsev, Y.
    Dudko, G.
    Filimonov, Y.
    2016 IEEE INTERNATIONAL CONFERENCE ON REBOOTING COMPUTING (ICRC), 2016,
  • [42] Exploring the Vision Processing Unit as Co-processor for Inference
    Rivas-Gomez, Sergio
    Pena, Antonio J.
    Moloney, David
    Laure, Erwin
    Markidis, Stefano
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 589 - 598
  • [43] Communication scheme for an advanced Java']Java co-processor
    Säntti, T
    Plosila, J
    22ND NORCHIP CONFERENCE, PROCEEDINGS, 2004, : 229 - 232
  • [44] A Geometric Algebra Co-Processor for Color Edge Detection
    Mishra, Biswajit
    Wilson, Peter
    Wilcock, Reuben
    ELECTRONICS, 2015, 4 (01): : 94 - 117
  • [45] A Reconfigurable Co-processor for GMM-Based Classifier
    Wang, Wei
    Liang, Weiqian
    IEEE INTERNATIONAL SOC CONFERENCE, PROCEEDINGS, 2009, : 343 - +
  • [46] ML Based Co-processor Verification in SoC Environment
    Pruthvi, D.
    Srividya, P.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2023, 44 (10): : 131 - 142
  • [47] FPGA Implementation of AES Co-processor in Counter Mode
    Singh, Balwinder
    Kaur, Harpreet
    Monga, Himanshu
    INFORMATION PROCESSING AND MANAGEMENT, 2010, 70 : 491 - +
  • [48] Automatic instruction generation for application specific co-processor
    Sang, ST
    Li, XM
    Ye, YZ
    2005 6TH INTERNATIONAL CONFERENCE ON ASIC PROCEEDINGS, BOOKS 1 AND 2, 2005, : 854 - 857
  • [49] Hi-performance co-processor based on FPGA
    Gromilin, GI
    Devjatajkin, AM
    Lysakov, KF
    Shadrin, MJ
    Proceedings of the Second IASTED International Multi-Conference on Automation, Control, and Information Technology - Automation, Control, and Applications, 2005, : 89 - 92
  • [50] FLOATING POINT MICROPROCESSOR IMPLEMENTED AS OPTIONAL CO-PROCESSOR
    LOWENTHAL, R
    COMPUTER DESIGN, 1981, 20 (03): : 178 - &