Dynamic Cache Contention Detection in Multi-threaded Applications

被引:26
|
作者
Zhao, Qin [1 ]
Koh, David [1 ]
Raza, Syed [1 ]
Bruening, Derek [2 ]
Wong, Weng-Fai [3 ]
Amarasinghe, Saman [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Google Inc, Mountain View, CA USA
[3] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
Performance; False Sharing; Cache Contention; Shadow Memory; Dynamic Instrumentation; DATA RACE DETECTOR;
D O I
10.1145/2007477.1952688
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In today's multi-core systems, cache contention due to true and false sharing can cause unexpected and significant performance degradation. A detailed understanding of a given multi-threaded application's behavior is required to precisely identify such performance bottlenecks. Traditionally, however, such diagnostic information can only be obtained after lengthy simulation of the memory hierarchy. In this paper, we present a novel approach that efficiently analyzes interactions between threads to determine thread correlation and detect true and false sharing. It is based on the following key insight: although the slowdown caused by cache contention depends on factors including the thread-to-core binding and parameters of the memory hierarchy, the amount of data sharing is primarily a function of the cache line size and application behavior. Using memory shadowing and dynamic instrumentation, we implemented a tool that obtains detailed sharing information between threads without simulating the full complexity of the memory hierarchy. The runtime overhead of our approach - a 5x slowdown on average relative to native execution - is significantly less than that of detailed cache simulation. The information collected allows programmers to identify the degree of cache contention in an application, the correlation among its threads, and the sources of significant false sharing. Using our approach, we were able to improve the performance of some applications by up to a factor of 12x. For other contention-intensive applications, we were able to shed light on the obstacles that prevent their performance from scaling to many cores.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [31] Multi-Threaded Actors
    Azadbakht, Keyvan
    de Boer, Frank S.
    Serbanescu, Vlad
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (223): : 51 - 66
  • [32] A scalable architecture for multi-threaded JAVA']JAVA applications
    Mrva, M
    Buchenrieder, K
    Kress, R
    [J]. DESIGN, AUTOMATION AND TEST IN EUROPE, PROCEEDINGS, 1998, : 868 - 874
  • [33] On multi-threaded paging
    Feuerstein, E
    de Loma, AS
    [J]. ALGORITHMS AND COMPUTATION, 1996, 1178 : 417 - 426
  • [34] An Offline Demand Estimation Method for Multi-Threaded Applications
    Perez, Juan F.
    Pacheco-Sanchez, Sergio
    Casale, Giuliano
    [J]. 2013 IEEE 21ST INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2013), 2013, : 21 - +
  • [35] Safe and Timely Dynamic Updates for Multi-threaded Programs
    Neamtiu, Iulian
    Hicks, Michael
    [J]. ACM SIGPLAN NOTICES, 2009, 44 (06) : 13 - 24
  • [36] Bandwidth based performance optimization of Multi-threaded applications
    Manakkadu, Sheheeda
    Dutta, Sourav
    [J]. 2014 SIXTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP), 2014, : 118 - 122
  • [37] Scalable Multi-threaded Community Detection in Social Networks
    Riedy, Jason
    Bader, David A.
    Meyerhenke, Henning
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1619 - 1628
  • [38] Workshop on multi-threaded architectures and applications-MTAAP
    [J]. Derose, Luiz, 1600, IEEE Computer Society
  • [39] Dynamic formation of multi-threaded prominences in arcade configurations
    Jercic, V.
    Keppens, R.
    [J]. ASTRONOMY & ASTROPHYSICS, 2023, 670
  • [40] Safe and Timely Dynamic Updates for Multi-threaded Programs
    Neamtiu, Iulian
    Hicks, Michael
    [J]. PLDI'09 PROCEEDINGS OF THE 2009 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2009, : 13 - 24