Partially shared cache and adaptive replacement algorithm for NoC-based many-core systems

被引:4
|
作者
Yang, Pengfei [1 ]
Wang, Quan [1 ]
Ye, Hongwei [1 ]
Zhang, Zhiqiang [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Univ Leeds, Sch Elect & Elect Engn, Leeds, W Yorkshire, England
关键词
Many-core system; NoC; Cache structure; Replacemant algorithm; HIGH-PERFORMANCE; COMMUNICATION; REPLICATION; CAPACITY; POLICIES; LOCALITY; UTILITY;
D O I
10.1016/j.sysarc.2019.05.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Network-on-Chip(NoC) is a promising alternative to traditional bus-based architectures that has been widely applied to interconnect multi/many-core systems due to its scalable and modular design. Undoubtedly, the memory wall problem is one of the most important challenges; however, this problem can now be somewhat be alleviated by cache subsystems. In this paper, to overcome the high resource consumption and low data-sharing rate problems of the private cache scheme, we propose a partially shared cache structure and a corresponding replacement algorithm based on a mesh NoC. In this scheme, the L2 cache is shared by each group of four cores that connected as a cluster to a given node by the local bus. To maximize the performance of this partially shared cache structure, we propose a core-aware re-reference interval prediction (CA-RRIP) replacement algorithm. The algorithm performs dynamic virtual partitioning on the partially shared cache; the core that initiated the cache access request will be given top priority when a cache area needs to be replaced or inserted. This approach guarantees cache exclusivity and can mitigate interactions among cores using different access patterns. We implement the traditional private, the proposed partially shared and the row-shared cache subsystems in our experiments. The comparisons indicate that the overall system resource occupation can be reduced by 20% with the same number of cores, and the instructions per cycle(IPC) of the system could increase by up to 49.2%. Moreover, the system throughput(STP) increased by an average of 5.89%. Our experimental results showed that the proposed CA-RRIP algorithm also reduces the average cache miss rate of the system under various cache access patterns.
引用
收藏
页码:424 / 433
页数:10
相关论文
共 50 条
  • [21] ROS-lite: ROS Framework for NoC-Based Embedded Many-Core Platform
    Azumi, Takuya
    Maruyama, Yuya
    Kato, Shinpei
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4375 - 4382
  • [22] Round-trip DRAM Access Fairness in 3D NoC-based Many-core Systems
    Chen, Xiaowen
    Lu, Zhonghai
    Liu, Sheng
    Chen, Shuming
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2017, 16
  • [23] Work In Progress: Considering Heuristic Scheduling Algorithm for NoC-based Clustered Many-core Processor Using LET Model
    Igarashi, Shingo
    Azumi, Takuya
    2019 IEEE 40TH REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2019), 2019, : 516 - 519
  • [24] MACRON: The NoC-based Many-Core Parallel Processing Platform and its Applications in 4G Communication Systems
    Ling, Xiang
    Chen, Yiou
    Yu, Zhiliang
    Chen, Shihua
    Wang, Xiaodong
    Liang, Gui
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 396 - 403
  • [25] Open-source NoC-based Many-Core for Evaluating Hardware Trojan Detection Methods
    Weber, Iacana
    Marchezan, Geaninne
    Caimi, Luciano
    Marcon, Cesar
    Moraes, Fernando G.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [26] Maintaining Real-Time Application Timing Similarity for Defect-Tolerant NoC-Based Many-Core Systems
    Li, Zheng
    Lockom, Frank
    Ren, Shangping
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [27] Adaptive fault-tolerant architecture and routing algorithm for reliable many-core 3D-NoC systems
    Ben Ahmed, Akram
    Ben Abdallah, Abderazek
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 93-94 : 30 - 43
  • [28] BrNoC: A broadcast NoC for control messages in many-core systems
    Wachter, Eduardo
    Caimi, Luciano L.
    Fochi, Vinicius
    Munhoz, Daniel
    Moraes, Fernando G.
    MICROELECTRONICS JOURNAL, 2017, 68 : 69 - 77
  • [29] Design Space Exploration of Energy Efficient NoC- and Cache-based Many-Core Architectures
    Souza, Matheus A.
    Freitas, Henrique C.
    Mehaut, Jean-Francois
    2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 402 - 409
  • [30] Multi-rate DAG Scheduling Considering Communication Contention for NoC-based Embedded Many-core Processor
    Igarashi, Shingo
    Kitagawa, Yuto
    Ishigooka, Tasuku
    Horiguchi, Tatsuya
    Azumi, Takuya
    2019 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2019, : 283 - 292