GPU Auto-tuning Framework for Optimal Performance and Power Consumption

被引:0
|
作者
Cheema, Sunbal [1 ]
Khan, Gul N. [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
关键词
Auto-tuning; Code transformation; Multi-objective optimization; GPU code regeneration; Performance power optimization; EFFICIENCY;
D O I
10.1145/3589236.3589241
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An auto-tuning framework for GPU devices is presented for tuning application kernels of OpenCL. The GPU tuner employs multi-objective optimization methodology to improve the performance and power consumption of applications. It efficiently explores a user defined solution space comprising of possible tunable algorithmic and hardware counter variations through code transformations. The methodology targets GPU code tuning situations where performance and energy consumption are critical. The proposed framework is evaluated for 2D convolution kernels. It utilizes a non-dominated sorting Genetic Algorithm with hardware power sensor data for application code transformation through code rewrite and validation. Various algorithmic variations such as loop unrolling, caching, workgroup size and memory utilization are applied. The final pareto optimal configurations code utilized around 30% less power and 4% faster execution time. The analysis shows the convergence of optimization, and 45% improvement in standard deviation.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [21] GeST: Generalized Stencil Auto-tuning Framework on GPUs
    Sun, Qingxiao
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 199 - 200
  • [22] Monte Carlo Optimisation Auto-Tuning on a Multi-GPU Cluster
    Paukste, Andrius
    2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 894 - 898
  • [23] Tensile: Auto-tuning GEMM GPU Assembly for All Problem Sizes
    Tanner, David E.
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 1066 - 1075
  • [24] PERI Auto-Tuning
    Bailey, David H.
    Chame, Jacqueline
    Chen, Chun
    Dongarra, Jack
    Hall, Mary
    Hollingsworth, Jeffrey K.
    Hovland, Paul
    Moore, Shirley
    Seymour, Keith
    Shin, Jaewook
    Tiwari, Ananta
    Williams, Sam
    You, Haihang
    SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [25] Auto-tuning self-compatibility of power converters
    Attaianese, C
    Tomasso, G
    APEC 2005: Twentieth Annual IEEE Applied Power Electronics Conference and Exposition, Vols 1-3, 2005, : 1964 - 1969
  • [26] PI Auto-tuning and Performance Assessment in HVAC Systems
    Zhao, Futao
    Fan, James
    Mijanovic, Stevo
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 1783 - 1788
  • [27] CESMTuner: An Auto-Tuning Framework for the Community Earth System Model
    Ding Nan
    Xue Wei
    Ji Xu
    Xu Haoyu
    Song Zhenya
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 282 - 289
  • [28] Structured Merge with Auto-Tuning: Balancing Precision and Performance
    Apel, Sven
    Lessenich, Olaf
    Lengauer, Christian
    2012 PROCEEDINGS OF THE 27TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2012, : 120 - 129
  • [29] GLAF: A Visual Programming and Auto-Tuning Framework for Parallel Computing
    Krommydas, Konstantinos
    Sasanka, Ruchira
    Feng, Wu-chun
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 859 - 868
  • [30] GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU
    Gu, Ruidong
    Becchi, Michela
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 294 - 304