Boosting SMT trace processors performance with data cache miss sensitive thread scheduling mechanism

被引:1
|
作者
Wang, Kai-feng [1 ]
Ji, Zhen-zhou [1 ]
Hu, Ming-zeng [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
SMT trace processors; thread schedule; data cache miss;
D O I
10.1016/j.micpro.2005.12.008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The penalty associated with data cache misses is one of the obstacles to the performance of SMT trace processors. The increased latency is not only required to resolve the missing data, the miss will also have negative impact on the PE resources utilization rate of the SMT trace processors. When data cache miss occurs in SMT trace processors, all the completed traces following the data-miss-trace (a trace with at least one data cache miss) will be delayed to commit for the data cache miss event. PE resources occupied by those traces can not be released until traces are committed, which wastes the PE execution resources and hampers the performance of SMT trace processors. In this paper, we propose several data cache miss sensitive thread scheduling mechanisms with the aim to tolerate the penalties of data cache misses. By choosing the thread wisely in trace dispatch and trace commit stages, the SMT trace processors performance can be improved further. Simulation results show that when using L1-L2 thread scheduling mechanism, performance will be improved by 2.8% (2-thread SMT trace processors), 8.0% (4-thread SMT trace processors) and 18.2% (8-thread SMT trace processors) with 8-PE. 512 KB L2 cache configuration. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:225 / 233
页数:9
相关论文
共 26 条
  • [1] Code and data transformations for improving shared cache performance on SMT processors
    Nikolopoulos, DS
    [J]. HIGH PERFORMANCE COMPUTING, 2003, 2858 : 54 - 69
  • [2] Thread Isolation to Improve Symbiotic Scheduling on SMT Multicore Processors
    Feliu, Josue
    Sahuquillo, Julio
    Petit, Salvador
    Eeckhout, Lieven
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (02) : 359 - 373
  • [3] Trace cache miss tolerance for deeply pipelined superscalar processors
    Reinman, G.
    Pitigoi-Aron, G.
    [J]. IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2006, 153 (05): : 355 - 361
  • [4] CACHE-CONSCIOUS THREAD SCHEDULING FOR MASSIVELY MULTITHREADED PROCESSORS
    Rogers, Timothy G.
    O'Connor, Mike
    Aamodt, Tor M.
    [J]. IEEE MICRO, 2013, 33 (03) : 78 - 85
  • [5] CPU Scheduling for Power/Energy Management on Multicore Processors Using Cache Miss and Context Switch Data
    Datta, Ajoy K.
    Patel, Rajesh
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (05) : 1190 - 1199
  • [6] Using instruction fetch policy to control performance of a thread in SMT processors
    School of Computer Science, National University of Defense Technology, Changsha 410073, China
    [J]. Jisuanji Xuebao, 2008, 2 (309-317):
  • [7] THREAD PRIORITIZATION - A THREAD SCHEDULING MECHANISM FOR MULTIPLE-CONTEXT PARALLEL PROCESSORS
    FISKE, S
    DALLY, WJ
    [J]. FUTURE GENERATION COMPUTER SYSTEMS, 1995, 11 (06) : 503 - 518
  • [8] A trace driven study of the effectiveness of cache mechanism for network processors
    Liu, Z
    Che, H
    Zheng, K
    Chen, SZ
    Liu, B
    [J]. 2005 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING PROCEEDINGS, VOLS 1 AND 2, 2005, : 978 - 981
  • [9] Transparent threads: Resource sharing in SMT processors for high single-thread performance
    Dorai, GK
    Yeung, D
    [J]. 2002 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2002, : 30 - 41
  • [10] Controlling performance of a time-critical thread in SMT processors by instruction fetch policy
    Sun, Caixia
    Tang, Hongwei
    Zhang, Minxuan
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2006, : 217 - +