Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems

被引:2
|
作者
Boratto, Murilo [1 ]
Alonso, Pedro [2 ]
Gimenez, Domingo [3 ]
Lastovetsky, Alexey [4 ]
机构
[1] Univ Estado Bahia, Nucleo Arquitetura Comp & Sistemas Operacionais, Salvador, BA, Brazil
[2] Univ Politecn Valencia, Dept Sistemas Informat & Comp, Valencia, Spain
[3] Univ Murcia, Dept Sistemas Informat, Murcia, Spain
[4] Univ Coll Dublin, Sch Comp Sci, Heterogeneous Comp Lab, Dublin, Ireland
来源
JOURNAL OF SUPERCOMPUTING | 2017年 / 73卷 / 01期
关键词
Automatic tuning; Matrix polynomials; Performance; Multicore; Multi-GPU;
D O I
10.1007/s11227-016-1694-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic tuning methodologies have been used in the design of routines in recent years. The goal of these methodologies is to develop routines which automatically adapt to the conditions of the underlying computational system so that efficient executions are obtained independently of the end-user experience. This paper aims to explore programming routines that can automatically be adapted to the computational system conditions thanks to these automatic tuning methodologies. In particular, we have worked on the evaluation of matrix polynomials on multicore and multi-GPU systems as a target application. This application is very useful for the computation of matrix functions like the sine or cosine but, at the same time, the application is very time consuming since the basic computational kernel, which is the matrix multiplication, is carried out many times. The use of all available resources within a node in an easy and efficient way is crucial for the end user.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条
  • [1] Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems
    Murilo Boratto
    Pedro Alonso
    Domingo Giménez
    Alexey Lastovetsky
    The Journal of Supercomputing, 2017, 73 : 227 - 239
  • [2] Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems
    Boratto, Murilo
    Alonso, Pedro
    Gimenez, Domingo
    Barreto, Marcos
    JOURNAL OF SUPERCOMPUTING, 2014, 70 (02): : 733 - 745
  • [3] Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems
    Murilo Boratto
    Pedro Alonso
    Domingo Gimenéz
    Marcos Barreto
    The Journal of Supercomputing, 2014, 70 : 733 - 745
  • [4] Modelling Multi-GPU Systems
    Spampinato, Daniele G.
    Elster, Anne C.
    Natvig, Thorvald
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 562 - 569
  • [5] Auto-Tuning TRSM with an Asynchronous Task Assignment Model on Multicore, Multi-GPU and Coprocessor systems
    Pinto, Clicia
    Barreto, Marcos
    Boratto, Murilo
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [6] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    Coelho, Leandro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
  • [7] Benchmarking multi-GPU applications on modern multi-GPU integrated systems
    Bernaschi, Massimo
    Agostini, Elena
    Rossetti, Davide
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (14):
  • [8] Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models
    Zhong, Ziming
    Rychkov, Vladimir
    Lastovetsky, Alexey
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (09) : 2506 - 2518
  • [9] Heterogeneous Computational Model for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2012, 2012, 9 : 47 - 56
  • [10] Performance Optimization of Allreduce Operation for Multi-GPU Systems
    Nukada, Akira
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3107 - 3112