Transactional Prefetching: Narrowing the Window of Contention in Hardware Transactional Memory

被引:0
|
作者
Negi, Anurag [1 ]
Armejach, Adria [2 ,3 ]
Cristal, Adrian [2 ,4 ]
Unsal, Osman S. [2 ]
Stenstrom, Per [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
[2] Barcelona Supercomp Ctr, Barcelona, Spain
[3] Univ Politecn Cataluna, Barcelona, Spain
[4] Spanish Natl Res Council, CSIC, IIIA, Barcelona, Spain
关键词
hardware transactional memory; multicores; prefetching;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory access latency is the primary performance bottle-neck in modern computer systems. Prefetching data before it is needed by a processing core allows substantial performance gains by overlapping significant portions of memory latency with useful work. Prior work has investigated this technique and measured potential benefits in a variety of scenarios. However, its use in speeding up Hardware Transactional Memory (HTM) has remained hitherto unexplored. In several HTM designs transactions invalidate speculatively updated cache lines when they abort. Such cache lines tend to have high locality and are likely to be accessed again when the transaction re-executes. Coarse grained transactions that update several cache lines are particularly susceptible to performance degradation even under moderate contention. However, such transactions show strong locality of reference, especially when contention is high. Prefetching cache lines with high locality can, therefore, improve overall concurrency by speeding up transactions and, thereby, narrowing the window of time in which such transactions persist and can cause contention. Such transactions are important since they are likely to form a common TM use-case. We note that traditional prefetch techniques may not be able to track such lines adequately or issue prefetches quickly enough. This paper investigates the use of prefetching in HTMs, proposing a simple design to identify and request prefetch candidates, and measures performance gains to be had for several representative TM workloads.
引用
收藏
页码:181 / 190
页数:10
相关论文
共 50 条
  • [31] A Comprehensive Strategy for Contention Management in Software Transactional Memory
    Spear, Michael F.
    Dalessandro, Luke
    Marathe, Virendra J.
    Scott, Michael L.
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 141 - 150
  • [32] On the Performance of Contention Managers for Complex Transactional Memory Benchmarks
    Ansari, Mohammad
    Kotselidis, Christos
    Lujan, Mikel
    Kirkham, Chris
    Watson, Ian
    EIGHTH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 83 - 90
  • [33] Lightweight Hardware Transactional Memory for GPU Scratchpad Memory
    Villegas, Alejandro
    Asenjo, Rafael
    Navarro, Angeles
    Plata, Oscar
    Kaeli, David
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (06) : 816 - 829
  • [34] On The Power of Hardware Transactional Memory to Simplify Memory Management
    Dragojevic, Aleksandar
    Herlihy, Maurice
    Lev, Yossi
    Moir, Mark
    PODC 11: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM PRINCIPLES OF DISTRIBUTED COMPUTING, 2011, : 99 - 108
  • [35] Removal of Conflicts in Hardware Transactional Memory Systems
    M. M. Waliullah
    Per Stenstrom
    International Journal of Parallel Programming, 2014, 42 : 198 - 218
  • [36] Improving Utilization of Hardware Signatures in Transactional Memory
    Choi, Woojin
    Draper, Jeffrey
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (11) : 2230 - 2239
  • [37] Conflict Graph Based Hardware Transactional Memory
    Zeng, Kun
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 5, 2010, : 496 - 501
  • [38] Core Reliability: Leveraging Hardware Transactional Memory
    Do, Sang Wook Stephen
    Dubois, Michel
    IEEE COMPUTER ARCHITECTURE LETTERS, 2018, 17 (02) : 105 - 108
  • [39] Hardware Transactional Memory with Delayed-Committing
    Ichii, Sekai
    Tashiro, Saki
    Nunome, Atsushi
    Hirata, Hiroaki
    Shibayama, Kiyoshi
    3RD INTERNATIONAL CONFERENCE ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY (ACIT 2015) 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND INTELLIGENCE (CSI 2015), 2015, : 154 - 161
  • [40] Exploiting object structure in hardware transactional memory
    Khan, Behram
    Horsnell, Matthew
    Rogers, Ian
    Lujan, Mikel
    Dinn, Andrew
    Watson, Ian
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2009, 24 (05): : 303 - 315