Instruction Fusion for Multiscalar and Many-Core Processors

被引:1
|
作者
Lu, Yaojie [1 ]
Ziavras, Sotirios G. [1 ]
机构
[1] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
关键词
Instruction fusion; Superscalar; Benchmarking; Many-core processor;
D O I
10.1007/s10766-015-0386-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The utilization wall, caused by the breakdown of threshold voltage scaling, hinders performance gains for new generation microprocessors. We propose an instruction fusion technique for multiscalar and many-core processors to alleviate its impact. With instruction fusion, similar copies of an instruction to be run on multiple pipelines or cores are merged into a single copy for simultaneous execution. Instruction fusion applied to vector code enables the processor to idle early pipeline stages and instruction caches at various times during program implementation with minimum performance degradation, while reducing program size and the required instruction memory bandwidth. Instruction fusion is applied here to a MIPS-based dual-core that resembles an ideal multiscalar of degree two. Benchmarking using an FPGA prototype shows a 6-11 % reduction in the dynamic power dissipation for the targeted applications as well as a 17-45 % decrease in code size with frequent performance improvements due to higher instruction cache hit rates.
引用
收藏
页码:67 / 78
页数:12
相关论文
共 50 条
  • [1] Instruction Fusion for Multiscalar and Many-Core Processors
    Yaojie Lu
    Sotirios G. Ziavras
    International Journal of Parallel Programming, 2017, 45 : 67 - 78
  • [2] Economic models for many-core processors
    Kumar, Rakesh
    DR DOBBS JOURNAL, 2008, 33 (03): : 10 - 10
  • [3] Memory Access and Communication Fusion Compiler Optimization for Sunway Many-core Processors
    Fang Y.-F.
    Li Y.-B.
    Dong E.-M.
    Wang Y.-F.
    Liu Q.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (06): : 2648 - 2667
  • [4] Federated Scheduling in Clustered Many-core Processors
    Koike, Ryotaro
    Azumi, Takuya
    PROCEEDINGS OF THE 2021 IEEE/ACM 25TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2021), 2021,
  • [5] Fast Data Delivery for Many-Core Processors
    Bakhshalipour, Mohammad
    Lotfi-Kamran, Pejman
    Mazloumi, Abbas
    Samandi, Farid
    Naderan-Tahan, Mahmood
    Modarressi, Mehdi
    Sarbazi-Azad, Hamid
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (10) : 1416 - 1429
  • [6] Emerging Applications for Multi/Many-Core Processors
    Lee, Victor W.
    Chen, Yen-Kuang
    Debuy, Pradeep
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 1524 - 1527
  • [7] Efficient Fault Simulation on Many-Core Processors
    Kochte, Michael A.
    Schaal, Marcel
    Wunderlich, Hans-Joachim
    Zoellin, Christian G.
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 380 - 385
  • [8] Many-core processors and GPU opportunities in Particle Detectors
    Neufeld, Niko
    Vilasis-Cardona, Xavier
    2012 13TH INTERNATIONAL WORKSHOP ON CELLULAR NANOSCALE NETWORKS AND THEIR APPLICATIONS (CNNA), 2012,
  • [9] Queuing Ports for Mesh Based Many-Core Processors
    Villaescusa D.G.
    Rivas M.A.
    Harbour M.G.
    Ada User Journal, 2021, 42 (3-4): : 189 - 192
  • [10] A Study of an Infrastructure for Research and Development of Many-Core Processors
    Uehara, Koh
    Sato, Shimpei
    Miyoshi, Takefumi
    Kise, Kenji
    2009 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009), 2009, : 414 - 419