Data Marshaling for Multi-core Architectures

被引:0
|
作者
Suleman, M. Aater [1 ]
Mutlu, Onur
Joao, Jose A. [1 ]
Khubaib [1 ]
Patt, Yale N. [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
关键词
Staged Execution; Critical Sections; Pipelining; CMP;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Previous research has shown that Staged Execution (SE), i.e., dividing a program into segments and executing each segment at the core that has the data and/or functionality to best run that segment, can improve performance and save power. However, SE's benefit is limited because most segments access inter-segment data, i.e., data generated by the previous segment. When consecutive segments run on different cores, accesses to inter-segment data incur cache misses, thereby reducing performance. This paper proposes Data Marshaling (DM), a new technique to eliminate cache misses to inter-segment data. DM uses profiling to identify instructions that generate inter-segment data, and adds only 96 bytes/core of storage overhead. We show that DM significantly improves the performance of two promising Staged Execution models, Accelerated Critical Sections and producer-consumer pipeline parallelism, on both homogeneous and heterogeneous multi-core systems. In both models, DM can achieve almost all of the potential of ideally eliminating cache misses to inter-segment data. DM's performance benefit increases with the number of cores.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 50 条
  • [1] Heterogeneous multi-core architectures
    Mitra, Tulika
    IPSJ Transactions on System LSI Design Methodology, 2015, 8 : 51 - 62
  • [2] ParIS plus : Data Series Indexing on Multi-Core Architectures
    Peng, Botao
    Fatourou, Panagiota
    Palpanas, Themis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (05) : 2151 - 2164
  • [3] Multi-core Architectures and Streaming Applications
    Smit, Gerard J. M.
    Kokkeler, Andre B. J.
    Wolkotte, Pascal T.
    de Burgwal, Marcel D. van
    SLIP '08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL WORKSHOP ON SYSTEM LEVEL INTERCONNECT PREDICTION, 2008, : 35 - 42
  • [4] Scheduling Techniques for Multi-Core Architectures
    Hatanaka, Akira
    Bagherzadeh, Nader
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 865 - 870
  • [5] Improving Branch Prediction Accuracy on Multi-Core Architectures for Big Data
    Ma, Jianqiao
    Yu, Qi
    Huang, Libo
    Guo, Hui
    Qian, Cheng
    Ma, Sheng
    Wang, Zhiying
    2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 377 - 382
  • [6] Branch Prediction Migration for Multi-core Architectures
    Zhang, Tan
    Zhou, Chaobing
    Huang, Libo
    Xiao, Nong
    2017 INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE, AND STORAGE (NAS), 2017, : 282 - 283
  • [7] Cholesky factorization on SIMD multi-core architectures
    Lemaitre, Florian
    Couturier, Benjamin
    Lacassagne, Lionel
    JOURNAL OF SYSTEMS ARCHITECTURE, 2017, 79 : 1 - 15
  • [8] Parallel Programming Tools for Multi-core Architectures
    Mohr, Bernd
    Krammer, Bettina
    Mix, Hartmut
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 643 - 652
  • [9] Impact of Network Sharing in Multi-core Architectures
    Narayanaswamy, G.
    Balaji, R.
    Feng, W.
    2008 PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, VOLS 1 AND 2, 2008, : 249 - +
  • [10] Synchronization mechanisms on modern multi-core architectures
    Liu, Shaoshan
    Gaudiot, Jean-Luc
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2007, 4697 : 290 - +