Efficient selective multicore prefetching under limited memory bandwidth

被引:3
|
作者
Selfa, Vicent [1 ]
Sahuquillo, Julio [1 ]
Gomez, Maria E. [1 ]
Gomez, Crispin [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn, Valencia, Spain
关键词
Multicore prefetching; Adaptive prefetching; Deactivation policies; Global feedback;
D O I
10.1016/j.jpdc.2018.05.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current multicore systems implement multiple hardware prefetchers to tolerate long main memory latencies. However, memory bandwidth is a scarce shared resource which becomes critical with the increasing core count. To deal with this fact, recent works have focused on adaptive prefetchers, which control the prefetcher aggressiveness to regulate the main memory bandwidth consumption. Nevertheless, in limited bandwidth machines or under memory-hungry workloads, keeping active the prefetcher can damage the system performance and increase energy consumption. This paper introduces selective prefetching, where individual prefetchers are activated or deactivated to improve both main memory energy and performance, and proposes ADP, a prefetcher that deactivates local prefetchers in some cores when they present low performance and co-runners need additional bandwidth. Based on heuristics, an individual prefetcher is reactivated when performance enhancements are foreseen. Compared to a state-of-the-art adaptive prefetcher, ADP provides both performance and energy enhancements in limited memory bandwidth. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:32 / 43
页数:12
相关论文
共 50 条
  • [31] Efficient OpenMP Data Mapping for Multicore Platforms with Vertically Stacked Memory
    Marongiu, Andrea
    Ruggiero, Martino
    Benini, Luca
    2010 DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2010), 2010, : 105 - 110
  • [32] Minimizing Average Schedule Length under Memory Constraints by Optimal Partitioning and Prefetching
    Zhong Wang
    Timothy W. O'Neil
    Edwin H.-M. Sha
    Journal of VLSI signal processing systems for signal, image and video technology, 2001, 27 : 215 - 233
  • [33] 2X-Bandwidth Burst 6T-SRAM for Memory Bandwidth Limited Workloads
    Augustine, Charles
    Paul, Somnath
    Majumder, Turbo
    Tschanz, James
    Khellah, Muhammad
    De, Vivek
    2020 IEEE SYMPOSIUM ON VLSI CIRCUITS, 2020,
  • [34] Minimizing average schedule length under memory constraints by optimal partitioning and prefetching
    Wang, Z
    O'Neil, TW
    Sha, EHM
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 27 (03): : 215 - 233
  • [35] Learning Agent Communication under Limited Bandwidth by Message Pruning
    Mao, Hangyu
    Zhang, Zhengchao
    Xiao, Zhen
    Gong, Zhibo
    Ni, Yan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5142 - 5149
  • [36] Effective Multi-Agent Communication Under Limited Bandwidth
    Yu, Lebin
    Wang, Qiexiang
    Qiu, Yunbo
    Wang, Jian
    Zhang, Xudong
    Han, Zhu
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (07) : 7771 - 7784
  • [37] Optimization of Video Parameters for Teleoperation Under Limited Communication Bandwidth
    Xu, Jianqiang
    Li, Xiaohui
    Sun, Zhenping
    Bu, Yafeng
    Peng, Yuzhe
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2671 - 2680
  • [38] A bandwidth-efficient cooperative relaying scheme with limited feedback information
    Van Khuong, Ho
    Le-Ngoc, Tho
    2008 24TH BIENNIAL SYMPOSIUM ON COMMUNICATIONS, 2008, : 175 - 178
  • [39] Efficient routing strategies in scale-free networks with limited bandwidth
    Tang, Ming
    Zhou, Tao
    PHYSICAL REVIEW E, 2011, 84 (02):
  • [40] Bandwidth scaling for efficient inference over a power-limited MAC
    Marano, Stefano
    Matta, Vincenzo
    Tong, Lang
    Willett, Peter
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 597 - +