GPU Auto-tuning Framework for Optimal Performance and Power Consumption

被引:0
|
作者
Cheema, Sunbal [1 ]
Khan, Gul N. [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
关键词
Auto-tuning; Code transformation; Multi-objective optimization; GPU code regeneration; Performance power optimization; EFFICIENCY;
D O I
10.1145/3589236.3589241
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An auto-tuning framework for GPU devices is presented for tuning application kernels of OpenCL. The GPU tuner employs multi-objective optimization methodology to improve the performance and power consumption of applications. It efficiently explores a user defined solution space comprising of possible tunable algorithmic and hardware counter variations through code transformations. The methodology targets GPU code tuning situations where performance and energy consumption are critical. The proposed framework is evaluated for 2D convolution kernels. It utilizes a non-dominated sorting Genetic Algorithm with hardware power sensor data for application code transformation through code rewrite and validation. Various algorithmic variations such as loop unrolling, caching, workgroup size and memory utilization are applied. The final pareto optimal configurations code utilized around 30% less power and 4% faster execution time. The analysis shows the convergence of optimization, and 45% improvement in standard deviation.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [31] ATF: A generic directive-based auto-tuning framework
    Rasch, Ari
    Gorlatch, Sergei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (05):
  • [32] Auto-Tuning for Military Microgrids
    Podlesak, Thomas
    Vitale, Joseph
    Wilson, Blane
    Bohn, Frank
    Gonzalez, Michael
    Bosse, Richard
    Siegfried, Stefan
    Lynch, Jaclyn
    Barnhill, William
    2019 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2019, : 6270 - 6277
  • [33] The interpolation method for auto-tuning
    Skvortsov, L.M.
    Shuiyun Gongcheng/Port & Waterway Engineering, 1998, (09):
  • [34] A Multivariable Auto-Tuning Digital Controller for Switching Power Converters
    Huang, Wangxin
    Abu Qahouq, Jaber A.
    2014 TWENTY-NINTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION (APEC), 2014, : 1053 - 1058
  • [35] Benefits of auto-tuning VFDs
    Avery, Paul
    Control Engineering, 2021, 68 (09)
  • [36] AUTO-TUNING PARALLEL SKELETONS
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    PARALLEL PROCESSING LETTERS, 2012, 22 (02)
  • [37] Least squares auto-tuning
    Barratt, Shane T.
    Boyd, Stephen P.
    ENGINEERING OPTIMIZATION, 2021, 53 (05) : 789 - 810
  • [38] Optimal Buffer Management Algorithm with Auto-tuning Reference Queue Length
    Zhang Heying
    Song Lei
    Fan Baohua
    Jiang Jie
    HPCC 2008: 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2008, : 418 - 424
  • [39] A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility
    Li, Jialin
    Ye, Huang
    Tian, Shaobo
    Li, Xinyuan
    Zhang, Jian
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 863 - 874
  • [40] Performance benchmarking and auto-tuning for scientific applications on virtual cluster
    Ke-Jou Hsu
    Jerry Chou
    The Journal of Supercomputing, 2022, 78 : 6174 - 6206