Accelerating Multi-core Processor Design Space Evaluation Using Automatic Multi-threaded Workload Synthesis

被引:0
|
作者
Hughes, Clay [1 ]
Li, Tao [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, IDEAL, Gainesville, FL 32611 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The design and evaluation of microprocessor architectures is a difficult and time-consuming task. Although small, hand-coded microbenchmarks can be used to accelerate performance evaluation, these programs lack the complexity to stress increasingly complex architecture designs. Larger and more complex real-world workloads should be employed to measure the performance of a given design or to evaluate the efficiency of various design alternatives. These applications can take days or weeks if run to completion on a detailed architecture simulator. In the past, researchers have applied machine learning and statistical sampling methods to reduce the average number of instructions required for detailed simulation. Others have proposed statistical simulation and workload synthesis techniques, which can produce programs that emulate the execution characteristics of the application from which they are derived but have a much shorter execution period than the original. However, these existing methods are difficult to apply to multi-threaded programs and can result in simplifications that miss the complex interactions between multiple, concurrently running threads. This study focuses on developing new techniques for accurate and effective multi-threaded workload synthesis, which can significantly accelerate architecture design evaluation of multi-core processors. We propose to construct synchronized statistical flow graphs that incorporate inter-thread synchronization and sharing behavior to capture the complex characteristics and interactions of multiple threads. Moreover, we develop thread-aware data reference models and wavelet-based branching models to generate accurate memory access and dynamic branch statistics. Experimental results show that a framework integrated with the aforementioned models can automatically generate synthetic programs that maintain characteristics of original workloads but have significantly reduced runtime.
引用
收藏
页码:151 / 160
页数:10
相关论文
共 50 条
  • [1] Performance evaluation of evolutionary multi-core and aggressively multi-threaded processor architectures
    Tirumalai, Partha
    Song, Yonghong
    Kalogeropulos, Spiros
    [J]. ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2007, 4697 : 280 - +
  • [2] Securing Multi-core Multi-threaded Packet Processors
    Chasaki, Danai
    [J]. PROCEEDINGS OF THE EIGHTH ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'12), 2012, : 149 - 150
  • [3] Performance Evaluation of Multi-Core, Multi-Threaded SIP Proxy Servers (SPS)
    Krishnamurthy, Ramesh
    Rouskas, George N.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [4] Development of a simultaneously threaded multi-core processor
    Zaghloul, SS
    Mudawar, M
    Darwish, MG
    [J]. ENABLING TECHNOLOGIES FOR THE NEW KNOWLEDGE SOCIETY, 2005, : 913 - 927
  • [5] Lightweight multi-threaded network processor core in FPGA
    Buciak, Piotr
    Botwicz, Jakub
    [J]. PROCEEDINGS OF THE 2007 IEEE WORKSHOP ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, 2007, : 125 - +
  • [6] ANALYSIS OF MULTI-THREADED CODE EXECUTION ON SMALL MULTI-CORE ARCHITECTURES
    Sgroi, Kevin J.
    Spetka, Scott E.
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, DETC 2010, VOL 3, A AND B, 2010, : 807 - 814
  • [7] Design of Multi-threaded Processor's Pause Mechanism
    Ma, Pei-Jun
    Liu, Meng
    Peng, Yu-Jia
    Li, Kang
    Shi, Jiang-Yi
    [J]. 2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, : 1416 - 1419
  • [8] Dynamic Partition of Shared Cache for Multi-Threaded Application in Multi-Core System
    Li, Shuo
    Wu, Feng
    [J]. ADVANCED MEASUREMENT AND TEST, PARTS 1 AND 2, 2010, 439-440 : 1587 - +
  • [9] Enhancing an open source resource manager with multi-core/multi-threaded support
    Balle, Susanne M.
    Palermo, Daniel J.
    [J]. JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2008, 4942 : 37 - 50
  • [10] Remarn: A Reconfigurable Multi-threaded Multi-core Accelerator for Recurrent Neural Networks
    Que, Zhiqiang
    Nakahara, Hiroki
    Fan, Hongxiang
    Li, He
    Meng, Jiuxi
    Tsoi, Kuen Hung
    Niu, Xinyu
    Nurvitadhi, Eriko
    Luk, Wayne
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (01)