Accelerating Multi-core Processor Design Space Evaluation Using Automatic Multi-threaded Workload Synthesis

被引:0
|
作者
Hughes, Clay [1 ]
Li, Tao [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, IDEAL, Gainesville, FL 32611 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The design and evaluation of microprocessor architectures is a difficult and time-consuming task. Although small, hand-coded microbenchmarks can be used to accelerate performance evaluation, these programs lack the complexity to stress increasingly complex architecture designs. Larger and more complex real-world workloads should be employed to measure the performance of a given design or to evaluate the efficiency of various design alternatives. These applications can take days or weeks if run to completion on a detailed architecture simulator. In the past, researchers have applied machine learning and statistical sampling methods to reduce the average number of instructions required for detailed simulation. Others have proposed statistical simulation and workload synthesis techniques, which can produce programs that emulate the execution characteristics of the application from which they are derived but have a much shorter execution period than the original. However, these existing methods are difficult to apply to multi-threaded programs and can result in simplifications that miss the complex interactions between multiple, concurrently running threads. This study focuses on developing new techniques for accurate and effective multi-threaded workload synthesis, which can significantly accelerate architecture design evaluation of multi-core processors. We propose to construct synchronized statistical flow graphs that incorporate inter-thread synchronization and sharing behavior to capture the complex characteristics and interactions of multiple threads. Moreover, we develop thread-aware data reference models and wavelet-based branching models to generate accurate memory access and dynamic branch statistics. Experimental results show that a framework integrated with the aforementioned models can automatically generate synthetic programs that maintain characteristics of original workloads but have significantly reduced runtime.
引用
收藏
页码:151 / 160
页数:10
相关论文
共 50 条
  • [41] The Design and Algorithm Mapping of A Heterogeneous Multi-core Processor for SDR
    Fang, Xing
    Chen, Shuming
    [J]. 2008 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2008), VOLS 1-4, 2008, : 1086 - 1089
  • [42] Design and Evaluation of Multi-threaded Optimizations for Individual MPI I/O Operations
    Feki, Raafat
    Gabriel, Edgar
    [J]. 30TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2022), 2022, : 122 - 126
  • [43] Parallel Blob Extraction Using the Multi-core Cell Processor
    Kumar, Praveen
    Palaniappan, Kannappan
    Mittal, Ankush
    Seetharaman, Guna
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2009, 5807 : 320 - +
  • [44] Optimal Workload Allocation for Performance Evaluation on Multi-core Automotive ECUs
    Paranjape, Sneha
    Pillai, Anju S.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 685 - 690
  • [45] Accelerating Metric Space Similarity Joins with Multi-core and Many-core Processors
    Jin, Shichao
    Kim, Okhee
    Feng, Wenya
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V, 2013, 7975 : 166 - 180
  • [46] Accelerating metric space similarity joins with multi-core and many-core processors
    Jin, Shichao
    Kim, Okhee
    Feng, Wenya
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, 7971 : 166 - 180
  • [47] Performance Evaluation of Inter-Processor Communication for an Embedded Heterogeneous Multi-Core Processor
    Tsao, Shiao-Li
    Lee, Sung-Yuan
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (03) : 537 - 554
  • [48] Power Channel Design and Verification for Automotive Chipset with Multi-Core Processor
    Chen, Nansen
    [J]. 2019 IEEE CPMT SYMPOSIUM JAPAN (ICSJ), 2019, : 227 - 232
  • [49] ASSESSING SATELLITE CONJUNCTIONS FOR THE ENTIRE SPACE CATALOG USING COTS MULTI-CORE PROCESSOR HARDWARE
    Coppola, Vincent T.
    Dupont, Sylvain
    Ring, Kevin
    Stoner, Frank
    [J]. ASTRODYNAMICS 2009, VOL 135, PTS 1-3, 2010, 135 : 1193 - 1205
  • [50] Design and Implementation of Scalable, Transparent Threads for Multi-Core Media Processor
    Kodaka, Takeshi
    Sasaki, Shunsuke
    Tokuyoshi, Takahiro
    Ohyama, Ryuichiro
    Nonogaki, Nobuhiro
    Kitayama, Koji
    Mori, Tatsuya
    Ueda, Yasuyuki
    Arakida, Hideho
    Okuda, Yuji
    Kizu, Toshiki
    Tsuboi, Yoshiro
    Matsumoto, Nobu
    [J]. DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 1035 - 1039