Partially shared cache and adaptive replacement algorithm for NoC-based many-core systems

被引:4
|
作者
Yang, Pengfei [1 ]
Wang, Quan [1 ]
Ye, Hongwei [1 ]
Zhang, Zhiqiang [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Univ Leeds, Sch Elect & Elect Engn, Leeds, W Yorkshire, England
关键词
Many-core system; NoC; Cache structure; Replacemant algorithm; HIGH-PERFORMANCE; COMMUNICATION; REPLICATION; CAPACITY; POLICIES; LOCALITY; UTILITY;
D O I
10.1016/j.sysarc.2019.05.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Network-on-Chip(NoC) is a promising alternative to traditional bus-based architectures that has been widely applied to interconnect multi/many-core systems due to its scalable and modular design. Undoubtedly, the memory wall problem is one of the most important challenges; however, this problem can now be somewhat be alleviated by cache subsystems. In this paper, to overcome the high resource consumption and low data-sharing rate problems of the private cache scheme, we propose a partially shared cache structure and a corresponding replacement algorithm based on a mesh NoC. In this scheme, the L2 cache is shared by each group of four cores that connected as a cluster to a given node by the local bus. To maximize the performance of this partially shared cache structure, we propose a core-aware re-reference interval prediction (CA-RRIP) replacement algorithm. The algorithm performs dynamic virtual partitioning on the partially shared cache; the core that initiated the cache access request will be given top priority when a cache area needs to be replaced or inserted. This approach guarantees cache exclusivity and can mitigate interactions among cores using different access patterns. We implement the traditional private, the proposed partially shared and the row-shared cache subsystems in our experiments. The comparisons indicate that the overall system resource occupation can be reduced by 20% with the same number of cores, and the instructions per cycle(IPC) of the system could increase by up to 49.2%. Moreover, the system throughput(STP) increased by an average of 5.89%. Our experimental results showed that the proposed CA-RRIP algorithm also reduces the average cache miss rate of the system under various cache access patterns.
引用
收藏
页码:424 / 433
页数:10
相关论文
共 50 条
  • [41] In-Cache Streaming: Morphable Infrastructure for Many-Core Processing Systems
    Neves, Nuno
    Mussio, Adrien
    Goncalves, Fabien
    Tomas, Pedro
    Roma, Nuno
    EURO-PAR 2016: PARALLEL PROCESSING WORKSHOPS, 2017, 10104 : 775 - 787
  • [42] Adaptive Fault Simulation on Many-core Microprocessor Systems
    Haghbayan, Mohammad-Hashem
    Teravainen, Sami
    Rahmani, Amir-Mohammad
    Liljeberg, Pasi
    Tenhunen, Hannu
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFTS), 2015, : 151 - 154
  • [43] Temporized Data Prefetching Algorithm for NoC-based Multiprocessor Systems
    Cireno, Maria
    Aziz, Andre
    Barros, Edna
    2016 IEEE 27TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2016, : 235 - 236
  • [44] Optimized mapping algorithm to extend lifetime of both NoC and cores in many-core system
    Wang, Lihuan
    Jiang, Shuyan
    Chen, Shuyu
    Wang, Junshi
    Huang, Letian
    INTEGRATION-THE VLSI JOURNAL, 2019, 67 : 82 - 94
  • [45] An Agile Tile-based Platform for Adaptive Heterogeneous Many-Core Systems
    Kamaleldin, Ahmed
    Goehringer, Diana
    2022 21ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2022), 2022, : 270 - 273
  • [46] Thermally Adaptive Cache Access Mechanisms for 3D Many-Core Architectures
    Xiao, He
    Yueh, Wen
    Mukhopadhyay, Saibal
    Yalamanchili, Sudhakar
    IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) : 129 - 132
  • [47] A Dynamic Cache Architecture for Efficient Memory Resource Allocation in Many-Core Systems
    Tradowsky, Carsten
    Cordero, Enrique
    Orsinger, Christoph
    Vesper, Malte
    Becker, Jurgen
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 343 - 351
  • [48] Reliable Ultra-Low-Voltage Cache Design for Many-Core Systems
    Zhang, Meilin
    Stojanovic, Vladimir M.
    Ampadu, Paul
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2012, 59 (12) : 858 - 862
  • [49] KTS: a real-time mapping algorithm for NoC-based many-cores
    Audrey Queudet
    Nadine Abdallah
    Maryline Chetto
    The Journal of Supercomputing, 2017, 73 : 3635 - 3651
  • [50] Adaptive Algorithm and Tool Flow for Accelerating SystemC on Many-Core Architectures
    Roth, Christoph
    Reder, Simon
    Bucher, Harald
    Sander, Oliver
    Becker, Juergen
    2014 17TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2014, : 137 - 145