ABCLib_DRSSED: A parallel eigensolver with an auto-tuning facility

被引:19
|
作者
Katagiri, T
Kise, K
Honda, H
Yuba, T
机构
[1] Univ Electrocommun, Grad Sch Informat Syst, Tokyo 1828585, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Tokyo 1828585, Japan
基金
日本科学技术振兴机构;
关键词
ABCLib; auto-tuning facility; FIBER; sampling point; load-balancer;
D O I
10.1016/j.parco.2005.10.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Conventional auto-tuning numerical software has the drawbacks of (1) fixed sampling points for the performance estimation, (2) inadequate adaptation to heterogeneous environments. To solve these drawbacks, we developed ABCLib(-)DRSSED, which is a parallel eigensolver with an auto-tuning facility. ABCLib(-)DRSSED has (1) functions based on the sampling points which are constructed with an end-user interface; (2) a load-balancer for the data to be distributed; (3) a new auto-tuning optimization timing called Before Execute-time Optimization (BEO). In our performance evaluation of the BEO, we obtained speedup factors from 10% to 90%, and 340% in the case of a failed estimation. In the evaluation of the load-balancer, the performance was 220% improved. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:231 / 250
页数:20
相关论文
共 50 条
  • [1] AUTO-TUNING PARALLEL SKELETONS
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    PARALLEL PROCESSING LETTERS, 2012, 22 (02)
  • [2] Efficient Auto-Tuning of Parallel Programs with Interdependent Tuning Parameters via Auto-Tuning Framework (ATF)
    Rasch, Ari
    Schulze, Richard
    Steuwer, Michel
    Gorlatch, Sergei
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (01)
  • [3] Parallel GMRES Incomplete Orthogonalization Auto-Tuning
    Aquilanti, Pierre-Yves
    Petiton, Serge
    Calandra, Henri
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 2246 - 2256
  • [4] Formal Techniques for Development and Auto-tuning of Parallel Programs
    Doroshenko A.
    Ivanenko P.
    Yatsenko O.
    SN Computer Science, 4 (2)
  • [5] Taming Parallel I/O Complexity with Auto-Tuning
    Behzad, Babak
    Huong Vu Thanh Luu
    Huchette, Joseph
    Byna, Surendra
    Prabhat
    Aydt, Ruth
    Koziol, Quincey
    Snir, Marc
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [6] Knowledge discovery in auto-tuning parallel numerical library
    Kuroda, Hisayasu
    Katagiri, Takahiro
    Kanada, Yasumasa
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2002, 2281 : 628 - 639
  • [7] Auto-tuning Mapping Strategy for Parallel CFD Program
    Liu Fang
    Wang Zhenghua
    Che Yonggang
    2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, : 222 - 226
  • [8] Adaptive parallel tiled code generation and accelerated auto-tuning
    Tavarageri, Sanket
    Ramanujam, J.
    Sadayappan, P.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2013, 27 (04): : 412 - 425
  • [9] MaSiF: Machine Learning Guided Auto-tuning of Parallel Skeletons
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 437 - 438
  • [10] MaSiF: Machine Learning Guided Auto-tuning of Parallel Skeletons
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    Cole, Murray
    2013 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2013, : 186 - 195