Efficient selective multicore prefetching under limited memory bandwidth

被引:3
|
作者
Selfa, Vicent [1 ]
Sahuquillo, Julio [1 ]
Gomez, Maria E. [1 ]
Gomez, Crispin [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn, Valencia, Spain
关键词
Multicore prefetching; Adaptive prefetching; Deactivation policies; Global feedback;
D O I
10.1016/j.jpdc.2018.05.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current multicore systems implement multiple hardware prefetchers to tolerate long main memory latencies. However, memory bandwidth is a scarce shared resource which becomes critical with the increasing core count. To deal with this fact, recent works have focused on adaptive prefetchers, which control the prefetcher aggressiveness to regulate the main memory bandwidth consumption. Nevertheless, in limited bandwidth machines or under memory-hungry workloads, keeping active the prefetcher can damage the system performance and increase energy consumption. This paper introduces selective prefetching, where individual prefetchers are activated or deactivated to improve both main memory energy and performance, and proposes ADP, a prefetcher that deactivates local prefetchers in some cores when they present low performance and co-runners need additional bandwidth. Based on heuristics, an individual prefetcher is reactivated when performance enhancements are foreseen. Compared to a state-of-the-art adaptive prefetcher, ADP provides both performance and energy enhancements in limited memory bandwidth. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:32 / 43
页数:12
相关论文
共 50 条
  • [21] Phase Detection Based Data Prefetching for Utilizing Memory Bandwidth of 3D Integrated Circuits
    Lim, Hong-Yeol
    Kee, Min-Kwan
    Park, Gi-Ho
    2013 IEEE INTERNATIONAL 3D SYSTEMS INTEGRATION CONFERENCE (3DIC), 2013,
  • [22] MePPM-Memory Efficient Prediction by Partial Match Model for Web Prefetching
    Gracia, Chithra D.
    Sudha, S.
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 736 - 740
  • [23] An extended R-Tree indexing method using selective prefetching in main memory
    Kang, Hong-Koo
    Kim, Joung-Joon
    Kim, Dong-Oh
    Han, Ki-Joon
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 692 - +
  • [24] Plutus: Bandwidth-Efficient Memory Security for GPUs
    Abdullah, Rahaf
    Zhou, Huiyang
    Awad, Amro
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 543 - 555
  • [25] Energy Efficient and Hign Bandwidth Embedded Memory Implementation
    Mohammad, Baker
    Saleh, Hani
    2013 8TH INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA (DTIS), 2013, : 117 - 121
  • [26] Memory bandwidth efficient hardware architecture for AVS encoder
    Ding, Dandan
    Yao, Shuo
    Yu, Lu
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 675 - 680
  • [27] Dynamic Cache Tuning for Efficient Memory Based Computing in Multicore Architectures
    Hajimiri, Hadi
    Mishra, Prabhat
    Bhunia, Swarup
    2013 26TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2013 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2013, : 49 - 54
  • [28] Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs
    Ahn, Jung Ho
    Leverich, Jacob
    Schreiber, Robert S.
    Jouppi, Norman P.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2009, 8 (01) : 5 - 8
  • [29] Efficient Tensor Slicing for Multicore NPUs using Memory Burst Modeling
    Sousa, Rafael
    Jung, Byungmin
    Kwak, Jaehwa
    Frank, Michael
    Araujo, Guido
    2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 84 - 93
  • [30] DetLock: Portable and Efficient Deterministic Execution for Shared Memory Multicore Systems
    Mushtaq, Hamid
    Al-Ars, Zaid
    Bertels, Koen
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 721 - 730