moTuner: A Compiler-based Auto-tuning Approach for Mixed-precision Operators

被引:0
|
作者
Mo, Zewei [1 ]
Lin, Zejia [2 ]
Zhang, Xianwei [1 ]
Lu, Yutong [1 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
[2] Northwestern Polytech Univ, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
mixed-precision operator; auto-tuning; compiler; performance and accuracy; GPUs;
D O I
10.1145/3528416.3530231
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Arithmetic operators are now used in a wide spectrum of domains, including artificial intelligence, data analytics and scientific computing. Meanwhile, specialized hardware components to enable low-precision computing are increasingly deployed in GPUs and accelerators. Whereas promising to boost performance, accelerating the operators on the hardware necessitates manually tuning the mixed-precision knobs to balance the performance and accuracy, which can be extremely challenging in real practices. To address the issue, we present moTuner, an automatic framework for efficiently tuning mixed-precision operators. moTuner works on compiler-level to automatically enable the mixed-precision computation, without involving any manual modifications of source code and/or the operator library, thus significantly alleviating the programming burden. Owing to be implemented in compilation phase, moTuner can be more widely applicable with lessened efforts on the libraries. Further, moTuner adopts optimized search strategy in tuning to effectively narrow down the configuration space. The evaluations on GEMM operators and real applications demonstrate that moTuner achieves performance improvement up to 3.13x and 1.15x respectively, while guaranteeing considerably high accuracy.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [1] Auto-Tuning Mixed-Precision Computation by Specifying Multiple Regions
    Ren, Xuanzhengbo
    Kawai, Masatoshi
    Hoshino, Tetsuya
    Katagiri, Takahiro
    Nagai, Toru
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (02):
  • [2] GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU
    Gu, Ruidong
    Becchi, Michela
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 294 - 304
  • [3] A Bayesian Network Approach for Compiler Auto-tuning for Embedded Processors
    Ashouri, Amir Hossein
    Mariani, Giovanni
    Palermo, Gianluca
    Silvano, Cristina
    2014 IEEE 12TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2014, : 90 - 97
  • [4] A Scalable Auto-tuning Framework for Compiler Optimization
    Tiwari, Ananta
    Chen, Chun
    Chame, Jacqueline
    Hall, Mary
    Hollingsworth, Jeffrey K.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 796 - +
  • [5] TAFFO: The compiler-based precision tuner
    Cattaneo, Daniele
    Chiari, Michele
    Agosta, Giovanni
    Cherubin, Stefano
    SOFTWAREX, 2022, 20
  • [6] Compiler Auto-tuning via Critical Flag Selection
    Zhu, Mingxuan
    Hao, Dan
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1000 - 1011
  • [7] A Mixed-precision DNN Compiler for Robotic Computing on FPGA
    Li, Qiufeng
    Huang, Hantao
    Shen, Ao
    Li, Kai
    Huang, Sixiao
    Yu, Hao
    2023 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND ARTIFICIAL INTELLIGENCE, RAAI 2023, 2023, : 206 - 212
  • [8] EAtuner: Comparative Study of Evolutionary Algorithms for Compiler Auto-tuning
    Xiao, Guojian
    Qin, Siyuan
    Li, Kuan
    Chen, Juan
    Yin, Jianping
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 419 - 426
  • [9] FBTuner: A Feedback-Directed Approach for Safe Mixed-Precision Tuning
    Li, Xinyi
    Gopalakrishnan, Ganesh
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID 2024, 2024, : 638 - 639
  • [10] Referee: A Pattern-Guided Approach for Auto Design in Compiler-Based Analyzers
    Lv, Fang
    Li, Hao
    Wang, Lei
    Liu, Ying
    Cui, Huimin
    Xue, Jingling
    Feng, Xiaobing
    PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 1 - 12