Performance Patterns and Hardware Metrics on Modern Multicore Processors: Best Practices for Performance Engineering

被引:0
|
作者
Treibig, Jan [1 ]
Hager, Georg [1 ]
Wellein, Gerhard [1 ]
机构
[1] Univ Erlangen Nurnberg, Erlangen Reg Comp Ctr RRZE, D-91058 Erlangen, Germany
来源
EURO-PAR 2012: PARALLEL PROCESSING WORKSHOPS | 2013年 / 7640卷
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many tools and libraries employ hardware performance monitoring (HPM) on modern processors, and using this data for performance assessment and as a starting point for code optimizations is very popular. However, such data is only useful if it is interpreted with care, and if the right metrics are chosen for the right purpose. We demonstrate the sensible use of hardware performance counters in the context of a structured performance engineering approach for applications in computational science. Typical performance patterns and their respective metric signatures are defined, and some of them are illustrated using case studies. Although these generic concepts do not depend on specific tools or environments, we restrict ourselves to modern x86-based multicore processors and use the likwid-perfctr tool under the Linux OS.
引用
收藏
页码:451 / 460
页数:10
相关论文
共 50 条
  • [21] Performance aware shared memory hierarchy model for multicore processors
    Ahmed M. Mohamed
    Nada Mubark
    Saad Zagloul
    Scientific Reports, 13 (1)
  • [22] RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore Processors
    De Pestel, Sander
    Van den Steen, Sam
    Akram, Shoaib
    Eeckhout, Lieven
    2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 257 - 267
  • [23] High Performance Parallelization of COMPSYN on a Cluster of Multicore Processors with GPUs
    Alessi, Ferdinando
    Massini, Annalisa
    Basili, Roberto
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 966 - 975
  • [24] High Performance Memory Requests Scheduling Technique for Multicore Processors
    El-Reedy, Walid
    El-Moursy, Ali A.
    Fahmy, Hossam A. H.
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 127 - 134
  • [25] OpenCL Performance Evaluation on Modern Multicore CPUs
    Lee, Joo Hwan
    Nigania, Nimit
    Kim, Hyesoon
    Patel, Kaushik
    Kim, Hyojong
    SCIENTIFIC PROGRAMMING, 2015, 2015
  • [26] Efficient Portfolio Construction with the Use of Multiobjective Evolutionary Algorithms: Best Practices and Performance Metrics
    Liagkouras, K.
    Metaxiotis, K.
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2015, 14 (03) : 535 - 564
  • [27] Performance Analysis with Unified Hardware Counter Metrics
    Gravelle, Brian J.
    Nystrom, William David
    Norris, Boyana
    2022 IEEE/ACM INTERNATIONAL WORKSHOP ON PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS), 2022, : 60 - 70
  • [28] Performance evaluation on work-stealing featured parallel programs on asymmetric performance multicore processors?
    Adnan
    ARRAY, 2023, 19
  • [29] A best practice for performance engineering
    Hanmer, RS
    Letourneau, JP
    BELL LABS TECHNICAL JOURNAL, 2003, 8 (03) : 75 - 89
  • [30] Performance Analysis of Multi-threaded Applications in NUMA Multicore Processors
    Fang, Juan
    Fan, Qing-Wen
    Hao, Xiao-Ting
    Cai, Min
    Song, Shu-Ying
    Li, Bin
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 257 - 262