A Data-centric Profiler for Parallel Programs

被引:16
|
作者
Liu, Xu [1 ]
Mellor-Crummey, John [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
关键词
Data-centric profiling; scalable profiler; data locality;
D O I
10.1145/2503210.2503297
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is difficult to manually identify opportunities for enhancing data locality. To address this problem, we extended the HPCToolkit performance tools to support data-centric profiling of scalable parallel programs. Our tool uses hardware counters to directly measure memory access latency and attributes latency metrics to both variables and instructions. Different hardware counters provide insight into different aspects of data locality (or lack thereof). Unlike prior tools for data-centric analysis, our tool employs scalable measurement, analysis, and presentation methods that enable it to analyze the memory access behavior of scalable parallel programs with low runtime and space overhead. We demonstrate the utility of HPCToolkit's new data-centric analysis capabilities with case studies of five well-known benchmarks. In each benchmark, we identify performance bottlenecks caused by poor data locality and demonstrate non-trivial performance optimizations enabled by this guidance.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Data-Centric Execution of Speculative Parallel Programs
    Jeffrey, Mark C.
    Subramanian, Suvinay
    Abeydeera, Maleen
    Emer, Joel
    Sanchez, Daniel
    [J]. 2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [2] Toward a Data-Centric Profiler for PGAS Applications
    Zhang, Hui
    Hollingsworth, Jeffrey K.
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON PARTITIONED GLOBAL ADDRESS SPACE PROGRAMMING MODELS (PGAS), 2015, : 93 - 95
  • [3] ChplBlamer: A Data-centric and Code-centric Combined Profiler for Multi-locale Chapel Programs
    Zhang, Hui
    Hollingsworth, Jeffrey K.
    [J]. INTERNATIONAL CONFERENCE ON SUPERCOMPUTING (ICS 2018), 2018, : 252 - 262
  • [4] Detecting Deadlock in Programs with Data-Centric Synchronization
    Marino, Daniel
    Hammer, Christian
    Dolby, Julian
    Vaziri, Mandana
    Tip, Frank
    Vitek, Jan
    [J]. PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), 2013, : 322 - 331
  • [5] Data-centric Combinatorial Optimization of Parallel Code
    Luo, Hao
    Chen, Guoyang
    Li, Pengcheng
    Ding, Chen
    Shen, Xipeng
    [J]. ACM SIGPLAN NOTICES, 2016, 51 (08) : 379 - 380
  • [6] A data-centric framework for debugging highly parallel applications
    Minh Ngoc Dinh
    Abramson, David
    Jin, Chao
    Gontarek, Andrew
    Moench, Bob
    DeRose, Luiz
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2015, 45 (04): : 501 - 526
  • [7] Purity: An Integrated, Fine-Grain, Data-Centric, Communication Profiler for the Chapel Language
    Johnson, Richard B.
    Hollingsworth, Jeffrey K.
    [J]. 2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 934 - 942
  • [8] Data-Centric AI
    Malerba, Donato
    Pasquadibisceglie, Vincenzo
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024,
  • [9] Data-centric automated data mining
    Campos, MM
    Stengard, PJ
    Milenova, BL
    [J]. ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 97 - 104
  • [10] An Efficient Data-Dependence Profiler for Sequential and Parallel Programs
    Li, Zhen
    Jannesari, Ali
    Wolf, Felix
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 484 - 493