Architecture slack exploitation for phase classification and performance estimation in server-class processors

被引:0
|
作者
Chinnakkonda, Diyanesh [1 ]
Rajamani, Karthick [2 ]
Srinivas, M.B. [3 ]
机构
[1] EEE Department, BITS Pilani, Hyderabad Campus, Hyderabad, India
[2] IBM Cloud Infrastructure Operations, AI Ops department, Austin,TX,78758, United States
[3] EEE Department, BITS Pilani, Dubai campus, Dubai, United Arab Emirates
关键词
Benchmarking - Memory architecture - Pipeline processing systems;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we present a highly accurate performance estimation methodology that accounts for architecture slack in workloads. Our work leverages the advanced instrumentation available in POWER8 processor that monitors core pipeline activity in relation to off-core memory accesses to build metrics for architecture slack characterization for workloads. Using these metrics, we construct a workload classifier that classifies workloads as core-bound and memory-bound and propose a performance prediction model for change in processor frequency for each class of workload – cPerf and mPerf, respectively. We evaluated these models with SPECCPU and PARSEC benchmark suites on a POWER8 based OpenPOWER system. We observed that the predicted performance with our models has high accuracy (97%) for both CPU and memory intensive benchmarks. We validated that the classifier is suitable to accurately classify phase of workloads during execution intervals. We developed an algorithm that uses classifier for phase classification and prediction models for performance estimation at runtime. We leveraged this algorithm and evaluated the execution time impacts of CPU and memory classified benchmarks. © 2022 Elsevier Inc.
引用
收藏
页码:157 / 170
相关论文
共 9 条
  • [1] Architecture slack exploitation for phase classification and performance estimation in server-class processors
    Chinnakkonda, Diyanesh
    Rajamani, Karthick
    Sriniyas, M. B.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 169 : 157 - 170
  • [2] Stretching the limits of clock-gating efficiency in server-class processors
    Jacobson, H
    Bose, P
    Hu, ZG
    Buyuktosunoglu, A
    Zyuban, V
    Eickemeyer, R
    Eisen, L
    Griswell, J
    11TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, : 238 - 242
  • [3] Optimizing Energy and Performance for Server-Class File System Workloads
    Sehgal, Priya
    Tarasov, Vasily
    Zadok, Erez
    ACM TRANSACTIONS ON STORAGE, 2010, 6 (03)
  • [4] High-Performance Deep-Learning Coprocessor Integrated into x86 SoC with Server-Class CPUs
    Henry, Glenn
    Palangpour, Parviz
    Thomson, Michael
    Gardner, J. Scott
    Arden, Bryce
    Donahue, Jim
    Houck, Kimble
    Johnson, Jonathan
    O'Brien, Kyle
    Petersen, Scott
    Seroussi, Benjamin
    Walker, Tyler
    2020 ACM/IEEE 47TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2020), 2020, : 15 - 26
  • [5] Bridging the architecture gap: Abstracting performance-relevant properties of modern server processors
    Hofmann J.
    Alappat C.L.
    Hager G.
    Fey D.
    Wellein G.
    Supercomputing Frontiers and Innovations, 2020, 7 (02) : 54 - 78
  • [6] Improving Performance per Watt of Non-Monotonic Multicore Processors via Bottleneck-based Online Program Phase Classification
    Srinivasan, Sudarshan
    Koren, Israel
    Kundu, Sandip
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 528 - 535
  • [7] Improving Performance per Watt of Asymmetric Multi-Core Processors via Online Program Phase Classification and Adaptive Core Morphing
    Rodrigues, Rance
    Annamalai, Arunachalam
    Koren, Israel
    Kundu, Sandip
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 18 (01)
  • [8] Efficient phase estimation for the classification of digitally phase modulated signals using the cross-WVD: a performance evaluation and comparison with the S-transform
    Mei, Chee Yen
    Sha'ameri, Ahmad Zuri
    Boashash, Boualem
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [9] Efficient phase estimation for the classification of digitally phase modulated signals using the cross-WVD: a performance evaluation and comparison with the S-transform
    Chee Yen Mei
    Ahmad Zuri Sha'ameri
    Boualem Boashash
    EURASIP Journal on Advances in Signal Processing, 2012