Bandwidth based performance optimization of Multi-threaded applications

被引:2
|
作者
Manakkadu, Sheheeda [1 ]
Dutta, Sourav [1 ]
机构
[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA
关键词
D O I
10.1109/PAAP.2014.51
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multiple threads running on a multi-core processor can improve the performance of a parallel application significantly. However, effective scaling of threads and cores plays a key role to achieve optimal performance because performance does not necessarily improve with increasing number of cores. Multi-threaded applications suffer due to thread synchronization, negative interference in shared memory including last level cache and main memory. Memory bandwidth also often limits the performance of a multi-threaded workload. In this paper we propose a method to achieve optimal scalability on multi-core platform and predict the bandwidth requirement of parallel workloads for a given number of threads. We employ the proposed method to improve the performance of bandwidth limited parallel applications. We find that DRAM access has various phases and use the highest bandwidth among all phases to predict the performance of a given workload on multi-threaded environment. We evaluate our proposed method using Gem5 multi-core simulator and the experimental results show that the phase based bandwidth utilization method can estimate the optimal number of threads for a given parallel workload and has low prediction error.
引用
收藏
页码:118 / 122
页数:5
相关论文
共 50 条
  • [31] On multi-threaded paging
    Feuerstein, E
    de Loma, AS
    ALGORITHMS AND COMPUTATION, 1996, 1178 : 417 - 426
  • [32] Dynamic Cache Contention Detection in Multi-threaded Applications
    Zhao, Qin
    Koh, David
    Raza, Syed
    Bruening, Derek
    Wong, Weng-Fai
    Amarasinghe, Saman
    ACM SIGPLAN NOTICES, 2011, 46 (07) : 27 - 37
  • [33] An Offline Demand Estimation Method for Multi-Threaded Applications
    Perez, Juan F.
    Pacheco-Sanchez, Sergio
    Casale, Giuliano
    2013 IEEE 21ST INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2013), 2013, : 21 - +
  • [34] Verifying a compiler optimization for multi-threaded Java']Java
    Reus, B
    Knapp, A
    Cenciarelli, P
    Wirsing, M
    RECENT TRENDS IN ALGEBRAIC DEVELOPMENT TECHNIQUES, 1998, 1376 : 402 - 417
  • [35] Workshop on multi-threaded architectures and applications-MTAAP
    Derose, Luiz, 1600, IEEE Computer Society
  • [36] Decoupling contention with VRB mechanism for multi-threaded applications
    Gao, Ke
    Fan, Dongrui
    Liu, Zhiyong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (11): : 2577 - 2588
  • [37] Multi-Threaded Parallel I/O for OpenMP Applications
    Mehta, Kshitij
    Gabriel, Edgar
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (02) : 286 - 309
  • [38] A multi-threaded particle swarm optimization-kmeans algorithm based on MapReduce
    Wang, Xikang
    Wang, Tongxi
    Xiang, Hua
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (06): : 8031 - 8044
  • [39] Performance measurements of multi-threaded servers in a distributed environment
    Somadder, G
    Petriu, D
    OPEN DISTRIBUTED PROCESSING AND DISTRIBUTED PLATFORMS, 1997, : 151 - 165
  • [40] On Improving the Performance of Multi-threaded CUDA Applications with Concurrent Kernel Execution by Kernel Reordering
    Wende, Florian
    Cordes, Frank
    Steinke, Thomas
    2012 SYMPOSIUM ON APPLICATION ACCELERATORS IN HIGH PERFORMANCE COMPUTING (SAAHPC), 2012, : 74 - 83