GPU Auto-tuning Framework for Optimal Performance and Power Consumption

被引:0
|
作者
Cheema, Sunbal [1 ]
Khan, Gul N. [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
关键词
Auto-tuning; Code transformation; Multi-objective optimization; GPU code regeneration; Performance power optimization; EFFICIENCY;
D O I
10.1145/3589236.3589241
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An auto-tuning framework for GPU devices is presented for tuning application kernels of OpenCL. The GPU tuner employs multi-objective optimization methodology to improve the performance and power consumption of applications. It efficiently explores a user defined solution space comprising of possible tunable algorithmic and hardware counter variations through code transformations. The methodology targets GPU code tuning situations where performance and energy consumption are critical. The proposed framework is evaluated for 2D convolution kernels. It utilizes a non-dominated sorting Genetic Algorithm with hardware power sensor data for application code transformation through code rewrite and validation. Various algorithmic variations such as loop unrolling, caching, workgroup size and memory utilization are applied. The final pareto optimal configurations code utilized around 30% less power and 4% faster execution time. The analysis shows the convergence of optimization, and 45% improvement in standard deviation.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [41] Online Thread Auto-Tuning for Performance Improvement and Resource Saving
    Luan, Guangqiang
    Pang, Pu
    Chen, Quan
    Xue, Shuai
    Song, Zhuo
    Guo, Minyi
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3746 - 3759
  • [42] Performance benchmarking and auto-tuning for scientific applications on virtual cluster
    Hsu, Ke-Jou
    Chou, Jerry
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (05): : 6174 - 6206
  • [43] Toward an End-to-End Auto-tuning Framework in HPC PowerStack
    Wu, Xingfu
    Marathe, Aniruddha
    Jana, Siddhartha
    Vysocky, Ondrej
    John, Jophin
    Bartolini, Andrea
    Riha, Lubomir
    Gerndt, Michael
    Taylor, Valerie
    Bhalachandra, Sridutt
    2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 473 - 483
  • [44] An Auto-Tuning Framework for a NUMA-Aware Hessenberg Reduction Algorithm
    Eljammaly, Mahmoud
    Karlsson, Lars
    Kagstrom, Bo
    COMPANION OF THE 2018 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '18), 2018, : 5 - 8
  • [45] LLAMA: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines
    Romero, Francisco
    Zhao, Mark
    Yadwadkar, Neeraja J.
    Kozyrakis, Christos
    PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 1 - 17
  • [46] Auto-Tuning for GPGPU Applications Using Performance and Energy Model
    Lin, Chih-Sheng
    Teng, Shih-Meng
    Hsiung, Pao-Ann
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 195 - 204
  • [47] Auto-tuning of Optimal PI Controller Satisfying Sensitivity Value Constrains
    Chen, Yong-hui
    Zeng, Xiao-hui
    Tan, Gong-quan
    ADVANCED RESEARCH ON INDUSTRY, INFORMATION SYSTEMS AND MATERIAL ENGINEERING, PTS 1-7, 2011, 204-210 : 1938 - 1943
  • [48] SHAMan: A Versatile Auto-tuning Framework for Costly and Noisy HPC Systems
    Robert, S.
    Zertal, S.
    Couvee, P.
    OPTIMIZATION AND LEARNING, OLA 2022, 2022, 1684 : 87 - 104
  • [49] Auto-tuning for GPGPU applications using performance and energy model
    Lin, Chih-Sheng
    Teng, Shih-Meng
    Hsiung, Pao-Ann
    JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 62 : 40 - 53
  • [50] Mjolnir: A framework agnostic auto-tuning system with deep reinforcement learning
    Nourchene Ben Slimane
    Houssem Sagaama
    Maher Marwani
    Sabri Skhiri
    Applied Intelligence, 2023, 53 : 14008 - 14022