Improving latency tolerance of network processors through simultaneous multithreading

被引:0
|
作者
Bo, L [1 ]
Hong, A
Fang, L
Rui, G
机构
[1] Univ Sci & Technol China, Dept Comp Sci & Technol, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Comp Technol Inst, Comp Architecture Lab, Beijing 100086, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Existing multithreaded network processors architecture with multiple processing engines (PEs), aims at taking advantage of blocked multithreading technique which executes instructions of different user-defined threads in the same PE pipeline, in explicit and interleave way. Multiple PEs, each of which is a multithreaded processor core, process several packets in parallel to hide long memory access latency. Most of them are optimized for throughputs mostly in data-plane. In future network workloads, the boundaries between data-plane and control-plane become blurred, so that PEs are demanded not only wire speed packet forwarding on data-plane, but also highly intelligent and increased complex packet processing function on control-plane. In this paper, we analyze SMT's short latency tolerance potential when used in out-of-order and dynamic scheduling PE cores. We show in this paper that 2-4 issue SMT provides an excellent short memory and branch latency tolerance, which gain higher instructions throughout as well as much simpler structures.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 50 条
  • [1] Improving latency tolerance of multithreading through decoupling
    Parcerisa, JM
    González, A
    IEEE TRANSACTIONS ON COMPUTERS, 2001, 50 (10) : 1084 - 1094
  • [2] Network Applications on Simultaneous Multithreading Processors
    Yi, Kyueun
    Gaudiot, Jean-Luc
    IEEE TRANSACTIONS ON COMPUTERS, 2010, 59 (09) : 1200 - 1209
  • [3] Simultaneous multithreading trace processors: Improving trace processors performance
    Wang, KF
    Ji, ZZ
    Hu, MZ
    MICROPROCESSORS AND MICROSYSTEMS, 2006, 30 (02) : 102 - 116
  • [4] Tolerating memory latency through software-controlled pre-execution in simultaneous multithreading processors
    Luk, CK
    28TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2001, : 40 - 51
  • [5] Latency Hiding through Multithreading on a Network Processor
    Guo, Xiaofeng
    Dai, Jinquan
    Li, Long
    Lv, Zhiyuan
    Chandra, Prashant R.
    PROCEEDINGS OF THE 2007 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING PPOPP'07, 2007, : 130 - 131
  • [6] A Study of Improving Fetch and Execute Engine for Simultaneous Multithreading Processors
    Yang, Shih-Hung
    Shieh, Jong-Jiann
    WMSCI 2008: 12TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS, 2008, : 79 - 84
  • [7] Simultaneous multithreading trace processors
    Wang, KF
    Ji, ZZ
    Hu, MZ
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2003, 2834 : 96 - 103
  • [8] On the Power Management of Simultaneous Multithreading Processors
    Youssef, Ahmed
    Zahran, Mohamed
    Anis, Mohab
    Elmasry, Mohamed
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2010, 18 (08) : 1243 - 1248
  • [9] Improving pipelined soft processors with multithreading
    Labrecque, Martin
    Steffan, J. Gregory
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 210 - 215
  • [10] Cray MTA: Multithreading for latency tolerance
    Smith, B
    COMPUTER, 2001, 34 (04) : 69 - 69