A Hybrid Parallel Tridiagonal Solver on Multi-core Architectures

被引:5
|
作者
Tang, Guangping [1 ]
Li, Kenli [1 ]
Li, Keqin [1 ,2 ]
Chen, Hang [1 ]
Du, Jiayi [1 ]
机构
[1] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[2] State Univ New York New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
parallel; tridiagonal; hybrid algorithm; multi-threading; CYCLIC REDUCTION; ALGORITHM; SYSTEMS; EQUATION;
D O I
10.1109/IPDPSW.2014.71
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An optimized parallel algorithm is proposed to solve the problem occurred in the process of complicated backward substitution of cyclic reduction during solving tridiagonal linear systems. Adopting a hybrid parallel model, this algorithm combines the cyclic reduction method and the partition method. This hybrid algorithm has simple backward substitution on parallel computers comparing with the cyclic reduction method. In this paper, the operation count and execution time are obtained to evaluate and make comparison for these methods. On the basis of results of these measured parameters, the hybrid algorithm using the hybrid approach with a multi-threading implementation achieves better efficiency than the other parallel methods, i.e., the cyclic reduction and the partition methods. Among them, the cyclic reduction method is previously found to be the fastest algorithm in many ways for solutions. In particular, the approach involved in this paper has the least scalar operation count and the shortest execution time on multi-core computer when the size of an equation is large enough. The hybrid parallel algorithm improves the performance of the cyclic reduction and partition methods by 30% and 20% respectively.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [1] An iteration-based hybrid parallel algorithm for tridiagonal systems of equations on multi-core architectures
    Tang, Guangping
    Yang, Wangdong
    Li, Kenli
    Ye, Yu
    Xiao, Guoqing
    Li, Keqin
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 5076 - 5095
  • [2] Implementation of Hybrid Total FETI (HTFETI) Solver for Multi-core Architectures
    Riha, Lubomir
    Brzobohaty, Tomas
    Markopoulos, Alexandros
    Jarosova, Marta
    Kozubek, Tomas
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [3] Parallel Programming Tools for Multi-core Architectures
    Mohr, Bernd
    Krammer, Bettina
    Mix, Hartmut
    [J]. PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 643 - 652
  • [4] Parallel Finite Element Solver for Multi-Core Computers
    Fialko, Sergiy
    [J]. 2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 525 - 532
  • [5] Parallel B&B Algorithm for Hybrid Multi-core/GPU Architectures
    Bendjoudi, A.
    Chekini, M.
    Gharbi, M.
    Mehdi, M.
    Benatchba, K.
    Sitayeb-Benbouzid, F.
    Melab, N.
    [J]. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 914 - 921
  • [6] A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library)
    Valero-Lara, Pedro
    Andrade, Diego
    Sirvent, Raul
    Labarta, Jesus
    Fraguela, Basilio B.
    Doallo, Ramon
    [J]. IEEE ACCESS, 2019, 7 : 23365 - 23378
  • [7] Scalable Direct-Iterative Hybrid Solver for Sparse Matrices on Multi-Core and Vector Architectures
    Ono, Kenji
    Kato, Toshihiro
    Ohshima, Satoshi
    Nanri, Takeshi
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2020), 2020, : 11 - 21
  • [8] Automated Tuning in Parallel Sorting on Multi-core Architectures
    Lin, Haibo
    Li, Chao
    Wang, Qian
    Zhao, Yi
    Pan, Ninghe
    Zhuang, Xiaotong
    Shao, Ling
    [J]. EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 14 - +
  • [9] MODELS FOR PARALLEL WORKFLOW PROCESSING ON MULTI-CORE ARCHITECTURES
    Rauber, Thomas
    Ruenger, Gudula
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-2: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 2, 2008, : 220 - +
  • [10] MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures
    Petschow, M.
    Bientinesi, P.
    [J]. PARALLEL COMPUTING, 2011, 37 (12) : 795 - 805